-
根据文档中的命令,完成了40B的lora微调训练,微调训练命令如下:
`GPUS=8 PER_DEVICE_BATCH_SIZE=2 sh shell/internvl2.0/2nd_finetune/internvl2_40b_hermes2_yi_34b_dynamic_res_2nd_finetune_lora.sh`
针对微调后的模型,模型可以成功加载,加载命令如下:
`mode…
-
Hi,
To version each project separately in nx, I'm using Independent Mode. However, I'd like to generate a Changelog for all of the projects in the root, rather than on a project-by-project basis.
I…
-
Hello there, I was examining the RAG guide:
https://github.com/anthropics/anthropic-cookbook/blob/main/skills/retrieval_augmented_generation/guide.ipynb
Especially I was trying to understand the…
-
Hello,
I am still playing around with the awesome code you produced and I really like the way you implemented GG20 in the other dev branch using `round-based` etc. While working with the code there…
-
Hello @patil-suraj,
I get the following assertion error:
Code:
```
gen = pipeline("question-generation", model='valhalla/t5-base-qg-hl', ans_model='valhalla/t5-base-qa-qg-hl')
qa = gen('Dire…
-
Because of C# compiler still does not allow to generate expressions from statement lambdas (CS0834), is it feasible to have some workaround in the Salterelle Compiler? For instance my first idea:
An …
-
https://aclanthology.org/2023.tacl-1.1/
-
I am attempting to generate new TagStandard41h* and TagStandard52h* families and I am seeing extremely long generation time estimates. I have attempted to generate some of the larger Standard families…
-
Hello, I have some questions about point cloud coloring in your code. When coloring point clouds, I noticed that you still chose to add noise to the point cloud coordinates and directly predict colors…
-
I'm trying to implement various methods from the [patchscopes](https://arxiv.org/abs/2401.06102) paper, and some of them utilize token generation to eg. explain the meaning of a patched representation…