-
# URL
- https://arxiv.org/abs/2301.13848
# Affiliations
- Tianyi Zhang, N/A
- Faisal Ladhak, N/A
- Esin Durmus, N/A
- Percy Liang, N/A
- Kathleen McKeown, N/A
- Tatsunori B. Hashimoto, N/A…
-
### Issues Policy acknowledgement
- [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md)
### Where…
-
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
https://arxiv.org/abs/2312.15166
Fast Inference of Mixture-of-Experts Language Models with Offloading
https://pa…
-
Hi,
I was reading the article [Inflection-2.5: meet the world's best personal AI](https://inflection.ai/inflection-2-5), and in the article it was mentioned that `nearly 25%—of examples in the reason…
-
- [ ] [“Emergent” abilities in LLMs actually develop gradually and predictably – study | Hacker News](https://news.ycombinator.com/item?id=39811155)
# "Emergent" abilities in LLMs actually develop gr…
-
# URL
- https://arxiv.org/abs/2401.10020
# Affiliations
- Weizhe Yuan, N/A
- Richard Yuanzhe Pang, N/A
- Kyunghyun Cho, N/A
- Sainbayar Sukhbaatar, N/A
- Jing Xu, N/A
- Jason Weston, N/A
#…
-
Great library, a light library for all the main evals was really needed!💯
I just came across this [line](https://github.com/huggingface/lighteval/blob/af24080ea4f16eaf1683e353042a2dfc9099f038/src/…
-
Dear GPTCache Team,
we are a security research group. We've used GPTCache for a while and impressed by its design and speed, but as we studied further, more concerns about the security of GPTCache ha…
-
### Title
Best Practices and Lessons Learned on Synthetic Data for Language Models
### Link
[Best Practices and Lessons Learned on Synthetic Data for Language Models.pdf](https://github.com/X-lab…
-
I am starting this thread for feature spec discussion for umBRELA @lintool @ronakice.
Suggestions from my side:
- parameter for specifying the number of samples for inference and later performin…