Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[MLU] add index select and index_select_grad kernel #4

Open
wants to merge 804 commits into
base: mlu-r2.4
Choose a base branch
from

Conversation

PeiyuLau
Copy link
Collaborator

@PeiyuLau PeiyuLau commented Jun 9, 2023

PR types

New features

PR changes

OPSs

Describe

add index_select and index_select_grad kernel

weishengying and others added 30 commits October 13, 2022 17:22
* fix immutable op quantize bugs

* fix

* fix build bug

* fix test

* notest,test=inference

* fix ppyoloe acc drop bugs

* fix test

* fix test

* add test

* fix

* fix

* fix test

* fix refined name bug

* fix test

* bias fix

* fix matmul weight dequant bug

* re-ci

* fix tester

* fix test

* fix tester

* update weight dequantize func

* update code

* update test for converage

* update test

* update cmake

* update cmakelist

* update code

* rerun ci

* remove useless code
…addle#46920)

* refind logic of choose phi kernrel

* fix complie budg
* Add more record event in run program op

* Refine code

* Restore code

* Rename event
* Fix hAPI bug of not compatible with LayerHook
* for gpt-gen

* fix reshard

* adapt assign and shape op

* add dist_assign & unittest

* add conditional block unittest

* rename unittest
* update test_image.py

* update test_image.py
* simplify conv_mkldnn op registration

* remove custom type value in conv grad op
* add singleton to custom device

* Update custom_device.cc

Init device_init_flag_ in default
haohongxiang and others added 29 commits October 28, 2022 12:59
…PaddlePaddle#47369)

* fix no sync bugs

* update

* update task chain

fix: update wait chain

feat: add `GetDeviceContext` for gloo

* fix oom

* fix dev

* update

* update

Co-authored-by: LiYuRio <[email protected]>
Co-authored-by: ForFishes <[email protected]>
* [JITLayer]Enable OneDNN on CPU and Fix zero shape

* remove VLOG
* fix engine build method

* fix import

* update engine cost

* update raise error

* update cmakelist

* revert optimizer

* revert optimizer

* fix unittest

* fix unittest

Co-authored-by: caozhou <[email protected]>
…7447)

* add fused_allreduce_gradients_with_group

* add scale

* fix ci
* maping from dist name scope to single name scope

* update

* fix gen cmake

* support runtype is '' when using test_runner.py

* Revert "fix gen cmake"

This reverts commit d7a653d.

* update gen-ut-cmakelist; test=document_fix

* revert code; test=document_fix
…ls (PaddlePaddle#47469)

* rm kaiser window in audio window function

* rm paddle audio utils which is redundant

* rm kaiser in test_audio_functions.py
* fix python module not found bug

* delete unused cast,test=allcases
* fix typo `Fasle`/`Flase` -> `Flase`

* fix typo `Ture` -> `True`
@PeiyuLau PeiyuLau changed the title Add index select [MLU] add index select and index_select_grad kernel Jun 9, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.