pre-trained-language-models Search Results

1000+ results
for pre-trained-language-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

apple/ml-sigmoid-attention #3

When will you release pre-trained language models based on s…

Thank you for your impressive work, congratulations! I am wondering when you will release pre-trained language models based on sigmoid attention, at least a small demo models? I am looking forward to …

guxm2021 updated 1 week ago
1
unslothai/unsloth #1018

Add support for Qwen2Audio

[Qwen2Audio huggingface docs](https://huggingface.co/docs/transformers/main/en/model_doc/qwen2_audio) I see there's been a couple requests for vision-language model support like LLaVa: https:…

jonflynng updated 3 weeks ago
2
myshell-ai/DreamVoice #3

Other languages?

Hi, does it work with German? Thanks!

philpav updated 2 weeks ago
1
e4exp/paper_manager_abstract #502

Knowledge Inheritance for Pre-trained Language Models

- https://arxiv.org/abs/2105.13880 - 2021 近年、GPT-3に代表される大規模な事前学習済み言語モデル（PLM）の探索により、膨大な量のパラメータを持つPLMの威力が明らかになり、ますます大規模なPLMを学習する波が起こっています。しかし、大規模なPLMの学習には膨大な計算資源が必要であり、時間とコストがかかります。また、既存の大規模PLMは…

e4exp updated 3 years ago
2
Kushal997-das/Project-Guidance #1382

Title: Add Real-Time Translation Model to the Machine Learni…

### Initiative (Required) GSSoC 2024 Extd 🚀 ### Is your feature request related to a problem? Please describe. Hi, I would like to contribute a Real-Time Translation Model to the Advanced section u…

770navyasharma updated 1 day ago
1
AdvikMehta/PCLbot #7

Reading - Pre-Trained Large Language Models for Industrial C…

Donnyye updated 8 months ago
1
huggingface/candle #2525

[QUESTION] Protocol of adding a new model (Stella_en_<*>_v5 …

Hi, I have a working implementation of [Stella_en__v5](https://huggingface.co/dunzhang/stella_en_1.5B_v5) family of models which is one of the top ranking model in the MTEB leaderboard for rerankin…

AnubhabB updated 6 days ago
2
e4exp/paper_manager_abstract #446

Rethinking embedding coupling in pre-trained language models

- https://arxiv.org/abs/2010.12821 - 2020 本稿では、最新の学習済み言語モデルにおいて、入力埋め込みと出力埋め込みの間で重みを共有するという標準的な手法を再評価する。その結果、非結合型の埋め込みによってモデリングの柔軟性が向上し、多言語モデルの入力埋め込みにおけるパラメータ割り当ての効率を大幅に改善できることを示した。入力エンベッディングのパ…

e4exp updated 3 years ago
2
arXivTimes/arXivTimes #206

Language Models with Pre-Trained (GloVe) Word Embeddings

## 一言でいうと RNNを使った言語モデルにword embeddingを組み込むことで性能向上をはかっている話。メモリセルにはGRU、embeddingにはGloVeを使用。n番目の単語ベクトルをn-1個の単語ベクトルから予測している。 ### 論文リンク https://arxiv.org/abs/1610.03759 ### 著者/所属機関 Victor Makarenk…

Hironsan updated 7 years ago
3
stac-extensions/ml-aoi #13

Unpack "training" to account for different kinds of training…

In the [MLM ](https://github.com/wherobots/mlm-form) each model has a geospatial footprint. right now it is pretty loose what this represents, so I expect this to be confusing to use for search and di…

rbavery updated 15 hours ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for pre-trained-language-models

1000+ results
for pre-trained-language-models