Skip to content

Pull requests: huggingface/text-generation-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

IPEX support FP8 kvcache
#3144 opened Mar 31, 2025 by sywangyi Draft
5 tasks
Use ROCM 6.3.1
#3141 opened Mar 27, 2025 by mht-sharma Loading…
5 tasks
WIP: Add VLM transformers backend
#3132 opened Mar 21, 2025 by mht-sharma Draft
4 of 12 tasks
Gaudi: Use exponential growth to replace BATCH_BUCKET_SIZE gaudi Issues related to Intel Gaudi hardware
#3131 opened Mar 21, 2025 by yuanwu2017 Loading…
5 tasks
Gaudi: clean cuda/rocm code in hpu backend, enable flat_hpu gaudi Issues related to Intel Gaudi hardware
#3113 opened Mar 14, 2025 by sywangyi Loading…
5 tasks
feat: align function id with tool call response
#3111 opened Mar 13, 2025 by drbh Loading…
wip: comment out prepend full_text
#3079 opened Mar 7, 2025 by jrc2139 Draft
1 of 5 tasks
Pr 2982 ci branch
#3046 opened Feb 20, 2025 by drbh Loading…
Support xccl distributed backend
#3034 opened Feb 18, 2025 by dvrogozh Loading…
Add 'json_schema' alias to GrammarType.Json
#2982 opened Jan 31, 2025 by aW3st Loading…
2 of 5 tasks
[Backend] Introduce vLLM backend
#2976 opened Jan 31, 2025 by mfuntowicz Loading…
llava next image encoder to allow un-aligned patch / image sizes
#2936 opened Jan 22, 2025 by jimexist Loading…
5 tasks
Update Dockerfile to use devel image for compatibility
#2848 opened Dec 16, 2024 by YaserJaradeh Loading…
2 of 5 tasks
Enable qwen2vl video
#2756 opened Nov 18, 2024 by drbh Loading…
9 tasks done
[WIP] Add gfx1100 support to AMD pytorch build
#2642 opened Oct 13, 2024 by cazlo Draft
1 of 5 tasks
Add model_load_time metric
#2311 opened Jul 26, 2024 by Edwinhr716 Loading…
2 of 5 tasks
ProTip! Follow long discussions with comments:>50.