issues
search
huggingface
/
optimum-tpu
Google TPU optimizations for transformers models
Apache License 2.0
66
stars
17
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Call function directly
#47
ManfeiBai
closed
4 months ago
0
Minor fix for mispelled stage in TGI dockerfile.
#46
thealmightygrant
closed
4 months ago
0
Align to Transformers 4.41.1
#45
tengomucho
closed
4 months ago
1
Fine tuning with FSDP v2
#44
tengomucho
closed
4 months ago
1
Issue getting Llama3 8b running on GKE
#43
francescov1
opened
4 months ago
22
Fix typo ; Update llama_tuning.md
#42
furkanakkurt1335
closed
4 months ago
1
Update to Pytorch 2.3.0 and transformers v4.40.2
#41
tengomucho
closed
4 months ago
1
Bug doc builder
#40
pagezyhf
closed
4 months ago
0
Basic Llama2 Tuning
#39
tengomucho
closed
4 months ago
10
Update server ip used by client to connect
#38
ManfeiBai
closed
3 months ago
2
Try again to fix nightly builds
#37
tengomucho
closed
5 months ago
0
fix(build): setup.py removed from build_dist dependencies
#36
tengomucho
closed
5 months ago
0
chore(ci): added workflow for nightly tests
#35
tengomucho
closed
5 months ago
0
Include two different stages for building TGI image:
#34
mfuntowicz
closed
5 months ago
1
Fix missing '=' to assign environment variables in the default case w…
#33
mfuntowicz
closed
5 months ago
1
Llama support
#32
tengomucho
closed
5 months ago
4
Sharding in tgi
#31
tengomucho
closed
5 months ago
1
Fix tests with do_sample=True
#30
tengomucho
closed
5 months ago
1
Fix optimum-tpu pip install instructions
#29
mfuntowicz
closed
5 months ago
2
Forward arguments from TGI launcher to the model
#28
mfuntowicz
closed
5 months ago
1
Fix TGI missing import
#27
mfuntowicz
closed
5 months ago
0
Cache en
#26
tengomucho
closed
5 months ago
2
Bump version to 0.1.0.dev2
#25
mfuntowicz
closed
5 months ago
1
Bump version to 0.1.0.dev1
#24
mfuntowicz
closed
5 months ago
1
Weights upcasted to `float32` at load time
#23
mfuntowicz
closed
5 months ago
2
Fix tgi cli invalid import
#22
mfuntowicz
closed
5 months ago
2
Parallel sharding
#21
tengomucho
closed
5 months ago
1
Added some links to Cloud TPU documentation
#20
mikegre-google
closed
5 months ago
1
Fix typo in index.mdx
#19
mfuntowicz
closed
5 months ago
1
Fix rule and instructions for TGI
#18
mfuntowicz
closed
5 months ago
1
Fix layout in README
#17
mfuntowicz
closed
5 months ago
0
Improve readme
#16
mfuntowicz
closed
5 months ago
0
Fix main doc build workflow
#15
regisss
closed
5 months ago
1
Adopt naming convention of transformers API
#14
mfuntowicz
closed
5 months ago
0
Add documentation to the repository
#13
mfuntowicz
closed
5 months ago
0
Xla parallel proxy
#12
tengomucho
closed
5 months ago
0
Add PyPI release workflow
#11
regisss
closed
6 months ago
3
Repo layout
#10
tengomucho
closed
6 months ago
0
feat: use dynamic batching when generating
#9
tengomucho
closed
6 months ago
1
Revert "fix: attention mask should be 1 or 0"
#8
tengomucho
closed
6 months ago
0
Testpbmci
#7
paulinebm
closed
5 months ago
0
Enable compilation
#6
tengomucho
closed
6 months ago
0
Small optimizations
#5
tengomucho
closed
6 months ago
0
Add static KV cache and test on Gemma-2B
#4
tengomucho
closed
6 months ago
0
Fix TGI Dockerfile
#3
shub-kris
closed
6 months ago
1
Enable CI/CD
#2
tengomucho
closed
7 months ago
0
Basic TGI server on XLA
#1
tengomucho
closed
7 months ago
0
Previous