language-modeling Search Results

1000+ results
for language-modeling

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Davlatkhudzha/sunpinyin #12

investigate the possibility to use 3rd-party language modeli…

``` - SLM toolkit from CMU: http://www.speech.cs.cmu.edu/SLM/toolkit.html - MSRLM: http://research.microsoft.com/apps/pubs/default.aspx?id=70505 - MITLM toolkit: http://code.google.com/p/mitlm/ - Vari…

GoogleCodeExporter updated 8 years ago
1
YangXiangGcc/sunpinyin #12

investigate the possibility to use 3rd-party language modeli…

``` - SLM toolkit from CMU: http://www.speech.cs.cmu.edu/SLM/toolkit.html - MSRLM: http://research.microsoft.com/apps/pubs/default.aspx?id=70505 - MITLM toolkit: http://code.google.com/p/mitlm/ - Vari…

GoogleCodeExporter updated 8 years ago
1
karakuri-ai/paper-readings #28

[2018] Character-Level Language Modeling with Deeper Self-At…

## ざっくり言うと - 64層のTransformerを用いて文字レベルの言語モデルを学習 - 多層での学習を進めるために3つのauxiliary lossesを加えた - text8とenwik8における文字レベル言語モデルでSOTA #### キーワード - character-level - language model - transformer ## 1. 情報…

IkokObi updated 5 years ago
4
rosewang2008/language_modeling_via_stochastic_processes #10

Issue in data loading

It seems to me that this line should be changed to `if 'tm' in self.name` (https://github.com/rosewang2008/language_modeling_via_stochastic_processes/blob/5cbc3eed581eba6444c471bfe716bd56db0f5253/lang…

da03 updated 2 years ago
1
tensorflow/tensor2tensor #1223

Issues with decoding from dataset for language modeling prob…

### Description Related to issue [1046](https://github.com/tensorflow/tensor2tensor/issues/1046). Decoding from dataset for language modeling problems (text2self as defined in text_problems.py) i…

mikeymezher updated 5 years ago
2
intel/neural-compressor #1980

how to evaluate AWQ ?

https://github.com/intel/neural-compressor/blob/master/docs/source/quantization_weight_only.md#examples how to set eval_func? https://github.com/intel/neural-compressor/blob/master/examples/3…

chunniunai220ml updated 1 month ago
7
NVlabs/VILA #109

Issue with Flash Attention on V100 GPU for Llama-3-VILA1.5-8…

Hi, I am encountering an issue when running inference on the Llama-3-VILA1.5-8B model. The error message I receive is: ```RuntimeError: FlashAttention only supports Ampere GPUs or newer.``` I…

vedernikovphoto updated 1 month ago
8
facebookresearch/metaseq #198

Make epoch mean epoch again (in streaming language modeling)

We currently have the following unfortunate naming: https://github.com/facebookresearch/metaseq/blob/4288451502667dda2be71a0a1a9df5066b583ae8/metaseq/tasks/streaming_language_modeling.py#L271-L290 …

suchenzang updated 2 years ago
2
e4exp/paper_manager_abstract #653

MLIM: Vision-and-Language Model Pre-training with Masked Lan…

- https://arxiv.org/abs/2109.12178 - 2021 視覚と言語の事前学習（VLP）は，画像やテキストの入力を必要とする下流のタスクのモデル性能を向上させる．現在のVLPアプローチは、 (i)モデルアーキテクチャ（特に画像エンベッダー）、 (ii)損失関数、 (iii)マスキングポリシーによって異なります。画像エンベッダーは、ResNet…

e4exp updated 3 years ago
3
derekhui7/qbb2024-answers #2

Bootcamp Day 2 grading rubrics

[Updated 20240911] DAY 2 MORNING | Exercise | Description | Completion | | -------- | ------- | ------- | | Q1A | Code present | YES | | Q1B | `protein_coding` genes count correct | YES | | Q1…

mfisada updated 1 month ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for language-modeling

1000+ results
for language-modeling