Skip to content

Pull requests: NVIDIA/FasterTransformer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Remove parenthesis from asserts
#699 opened Jul 2, 2023 by miguelusque Loading…
Add cuDNN include path as a common include dir
#724 opened Jul 18, 2023 by jacobkahn Loading…
Fix beam search output_log_prob index error
#732 opened Jul 25, 2023 by cpm0722 Loading…
Add fusion-for-decoder-only for llama
#733 opened Jul 28, 2023 by binxuan Loading…
[Bugfix] GptJ & GptNeoX batch inference error
#742 opened Aug 11, 2023 by YZP17121579 Loading…
Support Seq length up to 8K
#756 opened Sep 4, 2023 by zhen-jia Loading…
Ft llama opt
#762 opened Oct 2, 2023 by dypshong Loading…
Include stdio.h
#770 opened Oct 19, 2023 by JihaoXin Loading…
Update README.md
#776 opened Oct 29, 2023 by eltociear Loading…
ProTip! Follow long discussions with comments:>50.