CogComp / cogcomp-nlp

CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.
http://nlp.cogcomp.org/
Other
473 stars 142 forks source link

Dataless: duplicate categories #596

Open danyaljj opened 6 years ago

danyaljj commented 6 years ago

https://github.com/CogComp/cogcomp-nlp/blob/master/dataless-classifier/src/main/resources/hierarchies/20newsgroups/parentChildIdMap.txt https://github.com/CogComp/cogcomp-nlp/blob/master/dataless-classifier/data/hierarchies/20newsgroups/parentChildIdMap.txt

shatu commented 6 years ago

The files inside the src/main/resources folder were intended to serve as the default hierarchy (for programmatic use -- since files inside the data folder are not readable from inside the jar), whereas the files inside the data folder were to serve as templates for the end-user. I can very well delete the files inside data folder, but I guess it's in general a good practice to bundle up all the additional files inside the data folder.