-
Original Repository: https://github.com/ml-explore/mlx-examples/
Listing out examples from there which would be nice to have. We don't expect the models to work out the moment they are translated to …
-
# URL
- https://arxiv.org/abs/2411.05504
# Authors
- Haoran Lian
- Yizhe Xiong
- Zijia Lin
- Jianwei Niu
- Shasha Mo
- Hui Chen
- Peng Liu
- Guiguang Ding
# Abstract
- The prevalent …
-
## 論文リンク
https://arxiv.org/abs/2005.14165
## 公開日(yyyy/mm/dd)
2020/05/28
## 概要
GPT-3 の論文。
GPT-2 よりも2桁大きい 1750 億というパラメタ数の言語モデルである GPT-3 を作成し、その性能を非常に広い範囲で検証している。
事前学習した大規模言語モデルに対して、モデルの重みを変えな…
-
How can we integrate the use of language models to evaluate language model generations?
Currently, lm-eval evaluates language model generations with conventional metrics such as accuracy, bleu, etc…
-
**Describe the feature**
class Template里面可以做padding,但是Qwen2VLTemplateMixin, InternLMXComposer2Template里面只有im_mask,没有,input_ids的attention_mask,(有PADDING的情形)
能不能把padding attention_mask都放回去呀。
http…
-
This is a feature request to deploy Small Language Models (SLM) (3b or 1b). SLMs are improving quickly and are becoming good choice for narrowed scope usecases.
Examples can be TinyLlama, Minichat…
-
Along the development of small language models, compressed language models play crucial roles as well.
Typical representatives (in time order) would be:
1) sheared-llama (https://arxiv.org/abs/231…
-
### Feature request
Any LLM which could work with russian language.
For example, it may be possible to import this https://github.com/yandex/YaLM-100B
### Motivation
Russian text processing is …
-
Hi,
I would like to activate the context menu to paste into the textarea and sort language models into sublevels (especially English is difficult to consult).
This would make it leaner.
Thanks! :-)…
-
llama-stack install from source:https://github.com/meta-llama/llama-stack/tree/cherrypick-working
### System Info
python -m "torch.utils.collect_env"
/home/kaiwu/miniconda3/envs/llama/lib/pytho…