Keeping track of issues filed at LLM+RL projects

opening-up-chatgpt / opening-up-chatgpt.github.io

Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Tracking Openness, Transparency, and Accountability in Instruction-Tuned Text Generators.” In Proceedings of the 5th International Conference on Conversational User Interfaces. doi:10.1145/3571884.3604316.

https://opening-up-chatgpt.github.io/

Apache License 2.0

80 stars 5 forks source link

Keeping track of issues filed at LLM+RL projects #60

Open mdingemanse opened 9 months ago

mdingemanse commented 9 months ago

This issue is just a place to keep track of issues filed at other github projects or HuggingFace hubs to ask for documentation

mdingemanse commented 9 months ago

Mistral: https://github.com/mistralai/mistral-src/issues/9

mdingemanse commented 9 months ago

Xwin-LM: https://github.com/Xwin-LM/Xwin-LM/issues/14

mdingemanse commented 9 months ago

RedPajama: https://huggingface.co/togethercomputer/RedPajama-INCITE-7B-Instruct/discussions/9#64c7c30a527d763655434618

Now has an answer:

The instruction-tuned model was trained on instructions from the P3 and Natural Instructions datasets, which were decontaminated against HELM (you can find more details about the decontamination strategy in this blog bost).

You can find the resulting decontaminated dataset used to train the instruction tuned model here: https://huggingface.co/datasets/togethercomputer/RedPajama-Data-Instruct -- the metadata here contains a source field pointing to the task / dataset where the instance comes from.

mdingemanse commented 8 months ago

Solar 70B: https://huggingface.co/upstage/SOLAR-0-70b-16bit/discussions/15

mdingemanse commented 8 months ago

Boomer 1B: https://huggingface.co/budecosystem/boomer-1b/discussions/1#652be6c22aa5b27c77eab9c7

mdingemanse commented 8 months ago

GenZ-70B: https://huggingface.co/budecosystem/genz-70b/discussions/1#6533ffd624c306369652e7d1

mdingemanse commented 7 months ago

Erasmian Language Model (ELM): https://github.com/Joaoffg/ELM/issues/1 (and see #65)

mdingemanse commented 4 months ago

Yi 34B Chat: https://huggingface.co/01-ai/Yi-34B-Chat/discussions/7

mdingemanse commented 4 months ago

Gemma: https://huggingface.co/google/gemma-7b-it/discussions/64