human-preferences Search Results

1000+ results
for human-preferences

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kennethreitz/simplemind #30

Ability to create Discussions

The ability to create discussion between different LLMs would be a cool addition. Not sure about the use case but I tried it and came out well. Here is the [gist](https://gist.github.com/Siddhesh-Agar…

Siddhesh-Agarwal updated 11 hours ago
21
nebuly-ai/optimate #245

[Chatllama] Training Reward Model on Human Preference Data

Hi! Is there a specific reason that we train the reward model based on absolute scores rather than pairwise human preferences on the same prompts, as most of the other rlhf work?

TonyZhanghm updated 1 year ago
5
tgstation/tgstation #87474

Eye surgeries are non-functional

### Client Version: 515.1642 ### Issue Summary: Doing an eye surgery never actually gives anyone their vision back. Replacing their eyes does not help, nor does Oculine. I had to delete and re-crea…

TheVekter updated 6 days ago
2
maxheld83/schumpermas #314

preferences over beliefs in deliberative theory

preferences, beliefs and values aren't as neatly separate in deliberative politics as in game theory (careful here, values is the wrong term to begin with). Caplan scorns some of this confusion as _p…

maxheld83 updated 5 years ago
1
fakerbaby/fakerbaby #1

Thank you for your interesting work

Hi, Thank you for your interesting work "Improving Generalization of Alignment with Human Preferences through Group Invariant Learning (ICLR 2024 Spotlight)"! I want to ask where I can find the Gi…

AGTSAAA updated 5 months ago
1
packagecontrol/st_package_reviewer #26

fail a package that includes font_size in it's preferences

There was recently a discussion on Discord whereby someone was unable to change the font size on specific syntaxes. It initially seemed related to https://github.com/SublimeTextIssues/Core/issues/1551…

keith-hall updated 4 years ago
1
AkihikoWatanabe/paper_notes #1412

Direct Preference Optimization: Your Language Model is Secre…

# URL - https://arxiv.org/abs/2305.18290 # Affiliations - Rafael Rafailov, N/A - Archit Sharma, N/A - Eric Mitchell, N/A - Stefano Ermon, N/A - Christopher D. Manning, N/A - Chelsea Finn, …

AkihikoWatanabe updated 3 weeks ago
1
mrahtz/learning-from-human-preferences #4

Extra instructions for Ubuntu

Hi @mrahtz , thanks for doing this repo! I thought it might be useful to you or others to pass along some extra stuff I had to do to get this running on a fresh Ubuntu 18.04 install. Feel free to dele…

eggsyntax updated 3 years ago
18
mrahtz/learning-from-human-preferences #15

GRPC error

Hi @mrahtz , thanks for doing this repo! I think this algorithm is a milestone in the process of deep reinforcement learning. We installed all components according to the pipfile and pipfile.lock fi…

errorer-max updated 1 year ago
3
searxng/searxng #2353

Preferences aren't saved on page refresh

searx goes back to defult preferences whenever I try to navigate to another mode (images/videos/it/etc), or the next page, or alter the search query...and it's always in gibberish, there's no way to m…

neurodiverseEsoteric updated 11 months ago
11

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for human-preferences

1000+ results
for human-preferences