human-preferences Search Results

HumanCompatibleAI/imitation #696

Support human preferences in “Deep RL from human preferences…

Our team [KABasalt](https://github.com/BASALT-2022-Karlsruhe) participated in last year's BASALT competition and we noticed that RLHP currently lacks support for human preferences. ## Problem: On…

mschweizer updated 1 year ago

LAION-AI/Open-Assistant #1888

Stanford Human Preferences Dataset (SHP)

https://huggingface.co/datasets/stanfordnlp/SHP This may best be one of the best first datasets in the training of a model

bennmann updated 1 year ago

tgstation/tgstation #86452

Selecting ears is gone for felinids

### Client Version: 515.1642 ### Issue Summary: You can no longer select your ear type in the character preferences for felinids. The option is gone. ![image](https://github.com/user-attac…

Bm0n updated 2 days ago

silbinarywolf/sw-zombie-fortress #6

add team preferences setting (prefer human or zombie)

Allow player to say what they'd prefer to play as (Zombie / Survivor) and let that weighing occur

silbinarywolf updated 1 year ago

PKU-Alignment/safe-sora #4

[Feature Request] Add traditional methods for comparison

### Required prerequisites - [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-sora/issues) and [Discussions](https://github.com/PKU-Alignment/safe-sora/discussions) tha…

calico-1226 updated 2 weeks ago

explodinggradients/ragas #1188

Integrating third-party LLMs for Evaluating Chinese-native R…

Hi there, Thank you for bringing the elegant RAG Assessment framework to the community. I am an AI engineer from Alibaba Cloud, and our team has been fine-tuning LLM-as-a-Judge models based on t…

hurenjun updated 13 hours ago

justlikeazoo/Paper-Review #3

[Paper Review] Fine-Tuning Language Models from Human Prefer…

[[1] Fine-Tuning Langauage Models from Human Preferences.pdf](https://github.com/justlikeazoo/Paper-Review/files/12842664/1.Fine-Tuning.Langauage.Models.from.Human.Preferences.pdf)

justlikeazoo updated 11 months ago

mukulkothari/open_music #5

Challenges in New Music Exploration

- Same Old Songs: • Tired of hearing the same popular tracks? So are we. - Off-Target Suggestions: • Ever feel like the recommendations just don’t get you? - Genre Tunnel Vision: • Discovering ne…

abnvjain updated 1 month ago

openai/lm-human-preferences #22

What is the full link for gs://lm-human-preferences/

Thank you very much @WuTheFWasThat

guotong1988 updated 1 year ago

ruizheng20/robust_data #1

Code of your ICLR paper "Improving Generalization of Alignm…

Hi, Thank you for your work. I want to ask where I can find the code of "Improving Generalization of Alignment with Human Preferences through Group Invariant Learning"

AGTSAAA updated 3 months ago

1000+ results for human-preferences

1000+ results
for human-preferences