-
Hello,
I am trying to use your algorithm, could you expand on the the features :
ELL_LINE_NAME DRUG_ID LN_IC50 d0 d1 d2 d3 d4 d5 d6 d7 d8 d9 d10 d11 d12 d13 d14 d15 d16 d17 d18 d19 d20 d21 d22 d…
-
### Context
"A user story is an informal, general explanation of a software feature written from the perspective of the end user or customer.
The purpose of a user story is to articulate how a piece…
-
您好,我在复现Few-shot SuperGLUE(即`FewGLUE_32dev`数据)实验时,CB、WSC、COPA数据集的结果和论文中存在一定差距(复现实验所有模型均基于`albert-xxlarge-v2`这一个预训练模型,与论文设计一致,实验seed=42无修改):
![image](https://user-images.githubusercontent.com/26740837/…
-
```
trainer = ORPOTrainer(
model=model,
train_dataset=dataset["train"],
eval_dataset=dataset["test"],
#peft_config=peft_config,
tokenizer=tokeni…
-
### Feature request
How can we take advantage of https://osu-nlp-group.github.io/Mind2Web/ (dataset at https://huggingface.co/datasets/osunlp/Mind2Web) ?
### Motivation
_No response_
-
## Project Roadmap: Domain-Specific Knowledge Mesh
**1. Project Goals:**
* **Unified Data Management:** Create a system that ingests and manages data from various sources, including files, data…
-
-
These models have just been released and appear to be amazing. Links below:
Blog from fal.ai: https://blog.fal.ai/flux-the-largest-open-sourced-text2img-model-now-available-on-fal/
Huggingface:…
-
Hi @ArmelRandy and @loubnabnl
I am fine-tuning star coder on my custom dataset and was monitoring the training and validation loss.
The training loss seems to decrease however in case of eval los…
-
I noticed that the provided training scripts use Qwen2-7B-Instruct for pretraining and Qwen2-7B-Instruct-224K for fine-tuning, which doesn't seem to match the paper's description: "We trained our mode…