modeling-language Search Results

1000+ results
for modeling-language

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVlabs/VILA #109

Issue with Flash Attention on V100 GPU for Llama-3-VILA1.5-8…

Hi, I am encountering an issue when running inference on the Llama-3-VILA1.5-8B model. The error message I receive is: ```RuntimeError: FlashAttention only supports Ampere GPUs or newer.``` I…

vedernikovphoto updated 3 weeks ago
8
showlab/Show-o #21

No module named 'parquet.parquet_dataset'

File "Show-o/parquet/refinedweb_dataset.py", line 20, in from parquet.parquet_dataset import CruiseParquetDataset ModuleNotFoundError: No module named 'parquet.parquet_dataset'

mrswang1 updated 1 month ago
2
intel/neural-compressor #1980

how to evaluate AWQ ?

https://github.com/intel/neural-compressor/blob/master/docs/source/quantization_weight_only.md#examples how to set eval_func? https://github.com/intel/neural-compressor/blob/master/examples/3…

chunniunai220ml updated 1 month ago
7
evolutionaryscale/esm #94

Is it possible to map esm3 embedding back to sequence?

I want to explore the esm3 space but wondering how to map the modified embedding back to sequence.

johnnytam100 updated 1 week ago
1
recbygus/llm #2

Fine tune the model with a dialog dataset

!pip install transformers datasets from transformers import GPT2Tokenizer, GPT2LMHeadModel, Trainer, TrainingArguments from datasets import load_dataset, load_metric tokenizer = GPT2Tokenizer.from_…

recbygus updated 3 weeks ago
1
belgif/ICEGthema-person #2

Associate Person with one or many LanguageProficiency

While not explicitly present in IBZ ITs, it might be useful to record the language skills / preferences of a person. Using a `LanguageProficiency` object allows to stipulate whether the proficiency is…

saxomoose updated 2 days ago
2
facebookresearch/fairseq #4209

PadDataset does not have pad_length (multilingual_language_m…

## 🐛 Bug On [`multilingual_language_modeling.py` the method `build_dataset_for_inference`](https://github.com/pytorch/fairseq/blob/f591cc94caa85098ccf125a4782f91125b6a086d/fairseq/tasks/multilingua…

afcruzs-ms updated 2 years ago
1
arXivTimes/arXivTimes #215

Frustratingly Short Attention Spans in Neural Language Model…

## 一言でいうと Attentionを行う場合、隠れ層のベクトルは次の単語の予測・Attentionの算出・将来の単語に有用な情報の格納、という3つの役割を担っていることになる。なので出力を3つにして役割分担させるアイデア。併せて、単純に過去の隠れ層を結合して入力するだけでも高精度になることを確認 ### 論文リンク https://arxiv.org/abs/1702.045…

icoxfog417 updated 6 years ago
1
karakuri-ai/paper-readings #21

[1998]A Language Modeling Approach to Information Retrieval

## ざっくり言うと - documentからqueryが検索ワードとして生成される確率をモデル化 - 確率モデルは単語`t`の出現確率を工夫してモデル化している - ノンパラメトリックな方法 - tf-idfよりも優れた検索結果を達成 #### キーワード - IR - Language modeling ## 1. 情報 ### 論文リンク https://dl.a…

IkokObi updated 5 years ago
4
facebookresearch/fairseq #5467

Empty 'args' value in Neural Language Modeling "Training a t…

## 🐛 Bug The model trained (in Colab) according to instructions in Neural Language Modeling "Training a transformer language model with the CLI tools" example model has an empty 'args' value result…

lancioni updated 5 months ago
2

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for modeling-language

1000+ results
for modeling-language