-
Hi friends! 👋
There are a lot of cool existing resources for how to do *x* with *x* model, and we’d like to showcase and aggregate these resources on a model’s documentation. This’ll help users see…
-
在运行完bash run_finetene_ds.bash后,在output文件夹下没有找到readme中提到的‘test_preds.json’文件。
我想要运行convert.py文件进行格式转换,但是这个目录下并没有test_pred.json文件,请问这个该如何解决呢?
-
If you haven’t already, check out our [contributing guidelines](https://github.com/Expensify/ReactNativeChat/blob/main/contributingGuides/CONTRIBUTING.md) for onboarding and email contributors@expensi…
-
```
import subprocess
CUDA_version = [s for s in subprocess.check_output(["nvcc", "--version"]).decode("UTF-8").split(", ") if s.startswith("release")][0].split(" ")[-1]
print("CUDA version:", …
-
I was given a week to reproduce Google's gT5 with the threat to terminate my PhD study right if I fail, meaning train mT5 with mC4 plus fine-tuning. Isn't that an unreasonable request? The model and d…
-
## Issue description
Some LLMs are currently part of the `popular_on_huggingface` corpus of MLAgility. Those models are significantly large, causing our benchmarking infrastructure to take signific…
-
Error after using the test commands:
![image](https://user-images.githubusercontent.com/132207807/235434898-a24019c7-289e-4d0a-a54f-2574d60171a9.png)
It does pull account information, so only the …
-
cond_output = self.model.generate(
input_ids=batch['input_ids'],
attention_mask=batch['attention_mask'],
force_words_ids=batch['force_words_ids'],
n…
-
运行脚本:
```
#!/bin/bash
# Please run this script under ${project_id} in project directory of
deepspeed_args="--master_port=11000" # Default argument
if [ $# -ge 1 ]; then
deepspeed_args="…
-
Hello, I would like to run the code on a custom chinese dataset. Can you share the format of the processed chinese dataset? So that I can change the format of my dataset to a format that the program c…