issues
search
google
/
paxml
Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.
Apache License 2.0
458
stars
69
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Convert string dtype to jnp dtype during evaluation
#54
ashors1
closed
1 year ago
0
Redirect to NVIDIA Rosetta from GPU section of README
#53
ashors1
closed
1 year ago
0
[NVIDIA] Update 5B config and scripts
#52
ashors1
closed
1 year ago
0
Bump urllib3 from 2.0.6 to 2.0.7 in /paxml/pip_package
#51
dependabot[bot]
closed
1 year ago
1
Bump urllib3 from 1.26.16 to 1.26.17 in /paxml/pip_package
#50
dependabot[bot]
closed
1 year ago
1
[NVIDIA] Support new config option `USE_FP8`
#49
kaixih
closed
1 year ago
1
[NVIDIA] New collection for variables 'overwrite_with_gradient'
#48
kaixih
closed
1 year ago
0
[NVIDIA] Add 27B MoE config
#47
ashors1
closed
1 year ago
0
Add Transformer Engine support to Paxml
#46
ashors1
opened
1 year ago
0
Update GPU Scripts and Configs
#45
ashors1
closed
1 year ago
0
Support FP8 params updating for NVIDIA Hopper GPUs
#44
kaixih
closed
1 year ago
3
Use CPU-only version of TensorFlow
#43
andportnoy
closed
1 year ago
1
Bump tornado from 6.3.2 to 6.3.3 in /paxml/pip_package
#42
dependabot[bot]
closed
1 year ago
1
[NVIDIA] Update scripts to improve configurability
#41
ashors1
closed
1 year ago
0
support for fractional per core batch size
#40
abhinavgoel95
opened
1 year ago
1
Update README.md: fix missing -r argument to pip
#39
joker-eph
opened
1 year ago
1
fix broken link in readme
#38
sadikneipp
closed
1 year ago
0
How to continue training from a checkpoint?
#37
lkm2835
opened
1 year ago
0
[NVIDIA] Fix Default Container in Example Submit File
#36
ashors1
closed
1 year ago
0
Int8 checkpoint
#35
wx-x
opened
1 year ago
0
[NVIDIA] Update GPU Documentation, Configs, and Paths
#34
ashors1
closed
1 year ago
1
Add multislice configs and documentation
#33
michelle-yooh
closed
1 year ago
1
Bump transformers from 4.29.2 to 4.30.0 in /paxml/pip_package
#32
dependabot[bot]
opened
1 year ago
0
Bump transformers from 4.27.4 to 4.30.0 in /paxml/contrib/gpu/scripts_gpu
#31
dependabot[bot]
opened
1 year ago
0
ARM64 Build
#30
joker-eph
opened
1 year ago
4
Bump tornado from 6.2 to 6.3.2 in /paxml/contrib/gpu/scripts_gpu
#29
dependabot[bot]
opened
1 year ago
0
Bump requests from 2.30.0 to 2.31.0 in /paxml/pip_package
#28
dependabot[bot]
closed
1 year ago
1
Bump requests from 2.28.2 to 2.31.0 in /paxml/contrib/gpu/scripts_gpu
#27
dependabot[bot]
opened
1 year ago
0
Anisha lg pax
#26
A9isha
opened
1 year ago
0
Installing paxml from source failed due to dependency problem
#25
yhtang
closed
1 year ago
5
Pin JAX version in Dockerfile
#24
ashors1
closed
1 year ago
0
Correct referenced commit in gpu README
#23
ashors1
closed
1 year ago
0
Yooh multislice
#22
michelle-yooh
closed
1 year ago
0
Enable Latency Hiding Scheduler in Dockerfile
#21
ashors1
closed
1 year ago
0
Bump tensorflow from 2.9.3 to 2.11.1 in /paxml/contrib/gpu/scripts_gpu
#20
dependabot[bot]
closed
4 months ago
1
Update nvidia.py to add more configs
#19
abhinavgoel95
closed
1 year ago
2
Unexpected Overheads with Activation Checkpointing with Pipeline Parallelism
#17
abhinavgoel95
opened
1 year ago
1
Update GPU Dockerfile and Configs
#16
ashors1
closed
1 year ago
0
Bump tensorflow from 2.9.3 to 2.11.1 in /paxml/pip_package
#15
dependabot[bot]
closed
7 months ago
1
Make num_train_steps configurable in gpu configs
#14
ashors1
closed
1 year ago
0
Update Dockerfile and Build Path
#13
ashors1
closed
1 year ago
0
Update Logging for Gradient Accumulation
#12
ashors1
closed
1 year ago
0
Error running Common Crawl example
#11
RobertLiJN
closed
1 year ago
2
Add GPU scripts and dependencies
#10
ashors1
closed
1 year ago
3
Perform gradient clipping on global batch when using gradient accumulation
#9
ashors1
opened
1 year ago
3
Update Metrics and Flags for GPU
#8
ashors1
closed
1 year ago
1
Convert string to jnp dtype
#7
ashors1
closed
1 year ago
0
update docstring
#6
ashors1
closed
1 year ago
2
Pipeline Parallelism: USE_REPEATED_LAYERS bug
#5
abhinavgoel95
closed
1 year ago
3
Pipeline Parallelism: F external/org_tensorflow/tensorflow/compiler/xla/array.h:446] Check failed: n < sizes_size Fatal Python error: Aborted
#4
abhinavgoel95
closed
1 year ago
3
Previous
Next