This repoistory is a collaborative effort by the DCU - ADAPT Team for the EDOS 2023 SemEval Task-10 shared task.
Python3 modules: gensim nltk numpy pandas scikit-learn spacy xgboost transformers sentence-transformers torch
(SpaCy is only needed if requesting its tokeniser with train.py --tokeniser
,
or making predictions with a model that was trained with this option.)
NLTK asks to further run inside python:
import nltk
nltk.download('punkt')
Spacy instructions say to run
python -m spacy download en_core_web_sm
TODO: do we need this model? We only use the tokeniser.
See task specific README.md in src/ directory
See LICENSE.txt
If you use this code or the released models, please cite our paper to appear at SemEval 2023.