ekatraone / Alpaca-style-Dataset-Generator

This project generates a high-quality Alpaca-style dataset from input text files, PDFs, and Word documents.
MIT License
17 stars 7 forks source link

name 'CONFIG' is not defined #2

Open roymina opened 1 month ago

roymina commented 1 month ago

errors when run python src/main.py --num_examples 1000 :

Loading input data...
Loading input files:   0%|                                                   | 0/1 [00:00<?, ?file/s]
Traceback (most recent call last):
  File "D:\github_clones\Alpaca-style-Dataset-Generator\src\main.py", line 34, in <module>
    main()
  File "D:\github_clones\Alpaca-style-Dataset-Generator\src\main.py", line 10, in main
    input_texts = load_input_data(CONFIG['input_folder'])
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "D:\github_clones\Alpaca-style-Dataset-Generator\src\data_loader.py", line 20, in load_input_data
    with ThreadPoolExecutor(max_workers=CONFIG['max_workers']) as executor:
                                        ^^^^^^
NameError: name 'CONFIG' is not defined
bbeartheancient commented 2 weeks ago

add "from config import CONFIG" to the top of data_loader.py