-
### Describe the feature
Training large model needs too much memory, for example:
```
$ accelerate estimate-memory meta-llama/Llama-2-70b-chat-hf
├───────┬─────────────┬──────────┬───────────────…
-
CPM 使用微调脚本训练, 不开启 --use-delta 这一选项,则出现如下错误:
Traceback (most recent call last):
File "finetune_cpm_bee.py", line 503, in
main()
File "finetune_cpm_bee.py", line 499, in main
finetune(…
-
Using model: Tacotron
Using device: cuda
Initialising Tacotron Model...
\Loading the json with %s
{'sample_rate': 16000, 'n_fft': 800, 'num_mels': 80, 'hop_size': 200, 'win_size': 800, 'fmin'…
-
Hello,
I'm running `bitsandbytes==0.41.1` in a Python 3.10 miniconda environment, 8xA100 GPU (using `accelerate` for multi-GPU), Cuda 12.2.
I'm having problems resuming training (DPO) from a ch…
-
- PyTorch-Forecasting version: `0.10.3`
- PyTorch version: `1.13.0+cu116`
- PyTorch-Lightning version: `1.8.6`
- Python version: `3.8.16`
- Operating System: `Linux 2301df3359e4 5.10.147+ #1 SMP S…
-
We can add it to CGH tutorial. 2D is simple but for 3D there are different approaches to implement it. We can stick to 2D for now
-
1. Hello, may I ask, are all the settings mentioned in the paper? In preprocessing step, we adopt Dlib [14] to carry out face
and landmark detection (another detector Openface[4] is adopt in the abla…
-
TensorFlow is a popular open-source ML framework/ecosystem from Google.
Unfortunately your parser doesn't work well on its docstring. Here's a link to the docstring:
https://github.com/tensorflow/…
-
This code crashes when run with the "Xcode 11 (April 15, 2020)" toolchain, but does not crash when run with a nightly toolchain on Linux.
```swift
import Foundation
import TensorFlow
typealias…
-
31 Dynamo test are failing with "'NoneType' object has no attribute 'profiler'".
## Repro
`PYTORCH_TEST_WITH_DYNAMO=1 pytest profiler/test_profiler.py -v -k test_profiler_correlation_id`
You …