-
第四课的课后作业怎么打开?是打开saved_resource.html文件吗?我用浏览器打开这个文件没有内容。(windows7、qq浏览器)
-
**Describe the bug**
fp8 e4m3 wgrad seems to be extremely slow compared to both FP32 and FP16, often 50x to 100x slower.
I have attached the profiling results in [this Google spreadsheet](https://doc…
-
Hi, first of all, I really appreciated your impressive work.
I just followed your [command](https://github.com/balancap/SSD-Tensorflow#fine-tuning-a-network-trained-on-imagenet) which guide how to …
-
In create_hf_model, what's the purpose of resizing the model embedding?
model.config.end_token_id = tokenizer.eos_token_id
--
44 | model.config.pad_token_id = model.config.eos_token_id
…
-
-
Hi recently i made a introduction course about llama2/3:https://learn.deeplearning.ai/courses/prompt-engineering-with-llama-2/lesson/1/introduction
The teacher was one of the CEO of Llama3: https:…
-
Hi,
I wrote detailes as follows:
Many thanks
**H2O version, Operating System and Environment**
R is connected to the H2O cluster:
H2O cluster uptime: 3 seconds 363 millisecon…
-
### bug描述 Describe the Bug
多机训练时,报 NCCL error(6) 错误,但同样环境下pytorch能正常多机训练。
运行环境:容器,2台机器,内网ip分别为223.0.15.19,223.0.15.22
paddle版本:2.4.2
多机代码demo.py如下:
```python
import os
import paddle
impo…
-
I modified the epoch to 174 ,then
amax@amax:/data/yh/DFANet$ python main.py
./log/dfanet20190822T2152
Re-starting from epoch 174
load weights from ./log/dfanet20190822T2152/model_dfanet_0173.pt …
-
Segy management unit test need to work with publicly available tiny segy file