CODAIT / Identifying-Incorrect-Labels-In-CoNLL-2003

Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.
Apache License 2.0
12 stars 2 forks source link

Cleanup and verify scripts #21

Closed BryanCutler closed 3 years ago

BryanCutler commented 3 years ago

Moved all notebooks to scripts/

Fixed directory spelling of human_labels_audited

Replaced Label_Stats with Label_Stats_Fred.ipynb

Ran all scripts and notebooks and fixed any data file locations and made sure script ran

Fixes #18

review-notebook-app[bot] commented 3 years ago

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

BryanCutler commented 3 years ago

@kmh4321 and @xuhdev , do we still need apply_sentence_correction.ipynb, sentence_correction_preprocessing.ipynb and token_correction_preprocessing.ipynb or can we archive them?

xuhdev commented 3 years ago

apply_sentence_correction.ipynb is already in download_and_correct_corpus.py, sentence_correction_preprocessing.ipynb is only supposed to be used to generate the correction files. I'm not sure what token_correction_preprocessing.ipynb is for

BryanCutler commented 3 years ago

Ok I'll archive apply_sentence_correction.ipynb since I'm sure that one has been ported, but leave the other 2