-
- [x] I have marked all applicable categories:
+ [ ] exception-raising bug
+ [ ] RL algorithm bug
+ [ ] documentation request (i.e. "X is missing from the documentation.")
+ [x] ne…
-
使用 schema linker 微调的结果,和您开源的sql generate 模型,进行测试:
schema linker 微调的脚本如下:
deepspeed --master_port 29567 train_schema_item_filter.py \
--batch_size 4 \
--gradient_descent_step 8 \
…
-
1. Your model was trained on a 6000-image dataset. So for each image, how did you crop them?
2. What's your training/validation accuracy after you finished training the model?
-
### Base Classes
- [x] [nn::Module](https://github.com/arrayfire/arrayfire-ml/pull/30)
- [x] [autograd::Variable](https://github.com/arrayfire/arrayfire-ml/pull/30)
- [x] [Solver](https://github.co…
-
I'm trying to use webdataset for a distributed Pytorch XLA POC. I tried implementing the `ResizedDataset` class but start receiving many errors like the following after ~40 training steps. Any ideas h…
-
I ran the "ff_as_attention_cifar10_10samples.yaml" by running **run.py**. I want to generate the **table 1** in your paper by running **/paper/ff_as_attention/print_predictive_power.py**. But I have s…
-
嗨您好!我在租用云端服务器时遇到了这个错误,服务器配置为(pytorch1.7,cuda11.0),百度这个问题发现应该是BCEloss的两个参数输入时的类型不一样,但是奇怪的地方是这个问题是在运行了大概第25个batch时出现的,也就是说最开始可以运行,而且代码在我本地主机上(pytorch1.6,cuda10.2)上可以正常运行,没有任何错误。由于云端服务器的系统镜像没办法更换到成我本地主机的…
-
**Describe the bug**
A UserWarning is raised, indicating that a given NumPy array is not writable, and PyTorch does not support non-writable tensors. Additionally, a RuntimeError is encountered stati…
-
- [ ] [Answer.AI - You can now train a 70b language model at home](https://www.answer.ai/posts/2024-03-06-fsdp-qlora.html)
# Answer.AI - You can now train a 70b language model at home
**DESCRIPTION:…
-
**System information**
_Some irrelevant fields have been deleted_
- Have I written custom code (as opposed to using a stock example script provided in TensorFlow): Yes
- OS Platform and Distributio…