-
## Summary
Need to create a strong draft of this module so that we can revise and eventually offer this section of the course.
## Sub-Tasks
- [x] Draft module skeleton
- [x] Draft learning o…
-
You may have som bug on type manipulation and thus the model can not be finetuned via DeepSpeed(bf16 mix precision)
File "/deepseek_v2/modeling_deepseek.py", line 1252, in forward
hidden_state…
-
### Community Note
> Please vote by adding a 👍 reaction to the issue to help us prioritize.
> If you are interested to work on this issue, please leave a comment.
### Feature Spec
Wing SDK now h…
-
Hi @andreatramacere ,
I am working on broad-band Blazar SED modeling using `JetSeT`, here we are using the one-zone leptonic model.
But, in some cases, I have to use a two-zone leptonic scenario…
-
**Describe the bug**
I've installed Nemo using below commands in my aws sagemaker
**!sudo yum update -y && yum install -y libsndfile1 ffmpeg
!pip install Cython packaging
BRANCH='main'
!pytho…
-
Larry working with Steven and Alicia on the Sequence Variant Classification v4 guideline modeling.
The goal of this ticket is to be able to represent as much (or most) of the ACMG v4 evidence codes a…
-
I am trying to pretrain a MPT model using [llm-foundry](https://github.com/mosaicml/llm-foundry) using AliBi with flash attention. During pre training, I see the below warning -
```
WARNING: compos…
-
Thanks for your work again!
In the paper the topic modeling of OBELICS is implemented using LDA, and I am wondering what is the specific LDA model was used, what setting was used to train the model, …
-
### Describe the Question
Please provide a clear and concise description of what the question is.
大佬,请问您新增的reward_modeling.py这一脚本是不是也可以用来训练评分器!数据集的形式就和data/reward一样把
-
### Description
I was using the Zoo Modeling App and I believe I was deleting items from the code pane, and then got hit with this page
### Version
v0.22.3