facebookresearch / ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
https://parl.ai
MIT License
10.48k stars 2.09k forks source link

Reddit Movie Dialog no longer exists #3280

Closed andyfangaf closed 3 years ago

andyfangaf commented 3 years ago

Bug description Reddit Movie Dialog no longer exists.

Reproduction steps

TrainModel.main(
    # similar to before
    task='empathetic_dialogues,blended_skill_talk,movie_dialog_reddit,convai2,persona_chat', 
    model='transformer/generator',
    model_file='from_pretrained/model',

    # initialize with a pretrained model
    init_model='zoo:tutorial_transformer_generator/model',

    # arguments we get from the pretrained model.
    # Unfortunately, these must be looked up separately for each model.
    n_heads=16, n_layers=8, n_positions=512, text_truncate=512,
    label_truncate=128, ffn_size=2048, embedding_size=512,
    activation='gelu', variant='xlm',
    dict_lower=True, dict_tokenizer='bpe',
    dict_file='zoo:tutorial_transformer_generator/model.dict',
    learn_positional_embeddings=True,

    # some training arguments, specific to this fine-tuning
    # use a small learning rate with ADAM optimizer
    lr=1e-5, optimizer='adam',
    warmup_updates=100,
    # early stopping on perplexity
    validation_metric='ppl',
    # train at most 10 minutes, and validate every 0.25 epochs
    max_train_time=600, validation_every_n_epochs=0.25,

    # depend on your gpu. If you have a V100, this is good
    batchsize=12, fp16=True, fp16_impl='mem_efficient',

    # speeds up validation
    skip_generation=True,

    # helps us cram more examples into our gpu at a time
    dynamic_batching='full',
)

Logs Please paste the command line output:


---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
<ipython-input-39-ff3044de39fe> in <module>()
     36 
     37     # helps us cram more examples into our gpu at a time
---> 38     dynamic_batching='full',
     39 )

15 frames
/usr/lib/python3.6/importlib/_bootstrap.py in _find_and_load_unlocked(name, import_)

ModuleNotFoundError: No module named 'parlai.tasks.movie_dialog_reddit'

---------------------------------------------------------------------------
NOTE: If your import is failing due to a missing package, you can
manually install dependencies using either !pip or !apt.

To view examples of installing some common dependencies, click the
"Open Examples" button below.
---------------------------------------------------------------------------```
andyfangaf commented 3 years ago

Nevermind, it's the documentation that needs updating. Just checked the source and it's in there. The problem is the docs don't link there (some are even broken). It'd be nice to also have the identifiers for the tasks in the docs 🙂

andyfangaf commented 3 years ago

Reopening for a maintainer to triage and comment

stephenroller commented 3 years ago

Did you happen to keep the rest of the stacktrace?

andyfangaf commented 3 years ago

I think the rest were just generic pip install examples

github-actions[bot] commented 3 years ago

This issue has not had activity in 30 days. Please feel free to reopen if you have more issues. You may apply the "never-stale" tag to prevent this from happening.

stephenroller commented 3 years ago

Sorry, what's going wrong with this?

stephenroller commented 3 years ago

It looks like it's supposed to be movie_dialogue maybe? I don't remember a task called movie_dialog_reddit. Did we delete it?

I don't see any docs that refer to movie_dialog_reddit anywhere

stephenroller commented 3 years ago

Oh I see, found the link you added. Okay, the "code" link is correct, but you inferred from the title.

Got it, good call out.

stephenroller commented 3 years ago

image

Hopefully this looks better