Lightning-Universe / lightning-transformers

Flexible components pairing 🤗 Transformers with :zap: Pytorch Lightning
https://lightning-transformers.readthedocs.io
Apache License 2.0
607 stars 77 forks source link

omegaconf.errors.ConfigAttributeError: Missing key datafiles #236

Closed swcrazyfan closed 2 years ago

swcrazyfan commented 2 years ago

🐛 Bug

I follow everything in the documentary precisely on Kaggle, Colab, and a TPU VM, but it always give me a missing key datafiles error.

To Reproduce

Steps to reproduce the behavior:

Run this Kaggle notebook. Then, see the final output.

This is the error I get, but you can read the full thing in the notebook:

Error executing job with overrides: ['dataset.cfg.train_file=../input/netkjvsourcetarget/train.json', 'dataset.cfg.validation_file=../input/netkjvsourcetarget/eval.json', 'task=nlp/summarization', 'trainer.gpus=1']
Traceback (most recent call last):
  File "train.py", line 10, in hydra_entry
    main(cfg)
  File "/kaggle/working/lightning-transformers/lightning_transformers/cli/train.py", line 77, in main
    logger=logger,
  File "/kaggle/working/lightning-transformers/lightning_transformers/cli/train.py", line 52, in run
    data_module.setup("fit")
  File "/opt/conda/lib/python3.7/site-packages/pytorch_lightning/core/datamodule.py", line 474, in wrapped_fn
    fn(*args, **kwargs)
  File "/kaggle/working/lightning-transformers/lightning_transformers/core/nlp/data.py", line 31, in setup
    dataset = self.load_dataset()
  File "/kaggle/working/lightning-transformers/lightning_transformers/core/nlp/data.py", line 62, in load_dataset
    elif self.cfg.datafiles:
omegaconf.errors.ConfigAttributeError: Missing key datafiles
    full_key: datafiles
    object_type=dict

Set the environment variable HYDRA_FULL_ERROR=1 for a complete stack trace.

swcrazyfan commented 2 years ago

Can anyone point me in the right direction?

stale[bot] commented 2 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

SeanNaren commented 2 years ago

We're moving away from Hydra configs existing in this repo, and all examples are now pure Pytorch Lightning. Let me know if there are any issues with this and I can help get around this!