openai / finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf
MIT License
2.15k stars 503 forks source link

Error!! :( #39

Open amirakazadeh opened 5 years ago

amirakazadeh commented 5 years ago

hi how can I solve this error??

OSError: [E050] Can't find model 'en'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory.

Anybody can help me??

thomwolf commented 5 years ago

Hi, python -m spacy download en should do the trick. cf https://spacy.io/usage/models

amirakazadeh commented 5 years ago

hi :) could you please tell me how can I run this code step by step??

tnx

P.S. I do "python -m spacy download en ." but the error is still here:(

On Fri, Dec 28, 2018 at 2:00 PM Thomas Wolf notifications@github.com wrote:

Hi, python -m spacy download en should do the trick. cf https://spacy.io/usage/models

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/openai/finetune-transformer-lm/issues/39#issuecomment-450337557, or mute the thread https://github.com/notifications/unsubscribe-auth/ArYzMk_X-yf_A6Dz82qWhTlayI3lZFDJks5u9fLLgaJpZM4ZjqXe .

Brandonnogithub commented 5 years ago

hi :) could you please tell me how can I run this code step by step?? tnx P.S. I do "python -m spacy download en ." but the error is still here:( On Fri, Dec 28, 2018 at 2:00 PM Thomas Wolf @.***> wrote: Hi, python -m spacy download en should do the trick. cf https://spacy.io/usage/models — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#39 (comment)>, or mute the thread https://github.com/notifications/unsubscribe-auth/ArYzMk_X-yf_A6Dz82qWhTlayI3lZFDJks5u9fLLgaJpZM4ZjqXe .

"python -m spacy download en ." is to download a model. maybe you have an error during downloading Try download it again. and test whether you have downloaded it by the following code:

import spacy spacy.load("en")

amirakazadeh commented 5 years ago

I'd solved it. :)

tnx

amirakazadeh commented 5 years ago

hi again :) when I want to run program in "colab'" this error is occurred:

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe9 in position 1397: invalid continuation byte

how can I fix it? please help me

tnx

Brandonnogithub commented 5 years ago

hi again :) when I want to run program in "colab'" this error is occurred: UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe9 in position 1397: invalid continuation byte how can I fix it? please help me tnx

if you are trying to open the files, just change the encoding parameter. The original data of roc is encoded in "ISO-8859-1" instead of "utf-8"

amelieyu commented 5 years ago

hi :) could you please tell me how can I run this code step by step?? tnx P.S. I do "python -m spacy download en ." but the error is still here:( On Fri, Dec 28, 2018 at 2:00 PM Thomas Wolf @.***> wrote: Hi, python -m spacy download en should do the trick. cf https://spacy.io/usage/models — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#39 (comment)>, or mute the thread https://github.com/notifications/unsubscribe-auth/ArYzMk_X-yf_A6Dz82qWhTlayI3lZFDJks5u9fLLgaJpZM4ZjqXe .

"python -m spacy download en ." is to download a model. maybe you have an error during downloading Try download it again. and test whether you have downloaded it by the following code:

import spacy spacy.load("en")

i try python -m spacy download en. but i get error: raise ConnectionError(e, request=request) requests.exceptions.ConnectionError: HTTPSConnectionPool(host='raw.githubusercontent.com', port=443): Max retries exceeded with url: /explosion/spacy-models/master/shortcuts-v2.json (Caused by NewConnectionError('<urllib3.connection.VerifiedHTTPSConnection object at 0x7fb84d77f790>: Failed to establish a new connection: [Errno 110] Connection timed out',))

jingenyan commented 5 years ago

FileNotFoundError: [Errno 2] No such file or directory: 'encoder_bpe_40000.json",where to get this file“encoder_bpe_40000.json”,thanks

priyanka-chaudhary commented 5 years ago

@jingenyan : It is at this location https://github.com/openai/finetune-transformer-lm/tree/master/model

little-black-f commented 4 years ago

HI :) where is the file(maybe somethins else) "ROCstroies.tsv" and "rocstories.jsonl"? I can't find it . Thx