-
Notifications
You must be signed in to change notification settings - Fork 399
Pull requests: huggingface/text-embeddings-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(docker): detect CUDA version from
CUDA UMD Version header on driver 6xx (#870)
#871
opened Jun 3, 2026 by
Anai-Guo
Loading…
refactor: streamline tensor initialization and rearrange struct fields for clarity
#869
opened May 26, 2026 by
Unmesh100
Loading…
chore: enable Dependabot weekly GitHub Actions bumps
dependabot
#868
opened May 26, 2026 by
hf-dependantbot-rollout
Bot
Loading…
Support modular Sentence Transformers cross-encoder rerankers (e.g. ettin-reranker)
#867
opened May 25, 2026 by
hotchpotch
Loading…
3 of 5 tasks
feat: ROCm flash-attn varlen, triton layer norm, and AMD Dockerfile
#860
opened Apr 9, 2026 by
Abdennacer-Badaoui
Member
Loading…
Add rate-limited and aggregate logging to reduce log volume at high load
#859
opened Apr 8, 2026 by
dsingal0
Loading…
Add repository cloning step for local installation
#781
opened Dec 19, 2025 by
smedegaard
Loading…
1 of 5 tasks
feat: add varlen attention on cpu
#777
opened Dec 17, 2025 by
michaelfeil
Contributor
•
Draft
5 tasks
candle: health check by queuing on cuda
#775
opened Dec 17, 2025 by
michaelfeil
Contributor
Loading…
5 tasks
Add Support for XProvence Sentence-Level Context Pruning (naver/xprovence-reranker-bgem3-v1)
#770
opened Dec 4, 2025 by
sigridjineth
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.