human-judgment Search Results

1000+ results
for human-judgment

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

laws-africa/peachjam #2077

As a user of a collection, I would like to see the tags a ju…

For example, here: https://africanlii.org/indexes/case-indexes/case-indexes-human-rights?jurisdictions=African%20Regional%20Bodies&q=&sort=-date under each judgment, I would like to see the terms the…

NotoriousMBB updated 2 months ago
1
Migracode-Barcelona/Module-JS2 #17

[TECH ED] Apply Magic Sauce

### Link to the coursework https://applymagicsauce.com/demo ### Why are we doing this? Companies are very interested in the data provided by software like Apply Magic Sauce. Automated languag…

SallyMcGrath updated 3 days ago
2
princeton-nlp/SimPO #74

Question about tuning set

Did you use a special validation set for ultrafeedback when tuning the hyper-paramaters in Table 7, or just the `test_pref` set from the [original binarized ultrafeedback data](https://huggingface.co/…

yakazimir updated 1 week ago
1
irthomasthomas/undecidability #901

[2303.16634] G-Eval: NLG Evaluation using GPT-4 with Better …

- [ ] [[2303.16634] G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment](https://arxiv.org/abs/2303.16634) # [2303.16634] G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment …

ShellLM updated 3 months ago
1
tm4roon/survey #43

What makes a good conversation? How controllable attributes …

# What makes a good conversation? How controllable attributes affect human judgments 「良い対話システムとは何か?」について、モデルによる出力制御と人手評価の関係性を調査した研究。出力制御では、次の4つの項目について制御を行なった: 応答の繰り返し, 応答の具体性, 発話と応答の関係性, 質問と応答のバランス。結…

tm4roon updated 5 years ago
1
AkihikoWatanabe/paper_notes #1464

Self-Taught Evaluators, Tianlu Wang+, N/A, arXiv'24

# URL - https://arxiv.org/pdf/2408.02666 # Affiliations - Tianlu Wang, N/A - Ilia Kulikov, N/A - Olga Golovneva, N/A - Ping Yu, N/A - Weizhe Yuan, N/A - Jane Dwivedi-Yu, N/A - Richard Yu…

AkihikoWatanabe updated 3 weeks ago
1
GAIR-NLP/scaleeval #1

evaluation dataset

Hi, Thanks for the super cool work. Do you plan to release the evaluation dataset? (with human judgments)

alirezamshi updated 6 months ago
1
zhangmozhi/mrs #1

Human judgements/generations

@zhangmozhi In your ACL paper you've discussed human generations and human judgments. Wondering if any of these are available online? FYI @Palipoor

danyaljj updated 3 years ago
1
acl-org/acl-anthology #3581

Author Metadata: {Fei Liu}

### Author Pages https://aclanthology.org/people/f/fei-liu/ https://aclanthology.org/people/f/fei-liu-utdallas/ ### Type of Author Metadata Correction - [X] The author page wrongly conflates diffe…

feiliu00 updated 3 months ago
1
argilla-io/distilabel #117

[FEATURE] Benchmark existing preference tasks (UltraFeedback…

The idea would be to build and run a benchmark with at least the following datasets: [HHH Alignment](https://huggingface.co/datasets/HuggingFaceH4/hhh_alignment) & [MT Bench Human Judgment](https://hu…

dvsrepo updated 3 months ago
4

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for human-judgment

1000+ results
for human-judgment