cpdoc / dhbb-nlp

processamentos DHBB
Other
5 stars 2 forks source link

linguateca's folder now segmented on raw files with the same preprocessing as freeling and opennlp #37

Closed lucasrct closed 4 years ago

lucasrct commented 4 years ago

Linguateca's folder now contains:

  1. The .raw files segmented with linguateca's perl library with the same preprocessing as the files segmented by opennlp and freeling

  2. shell script make.sh to create your own files

  3. perl script perl.pl to use the perl library Lingua::PT::PLNbase to segment the files

  4. README file

arademaker commented 4 years ago

desculpe, não vejo os scripts nem o README.