CODAIT / text-extensions-for-pandas

Natural language processing support for Pandas dataframes.
Apache License 2.0
215 stars 34 forks source link

MINOR: add additional flexibility to a few functions within tp.io #233

Closed ZachEichen closed 2 years ago

ZachEichen commented 2 years ago

Added flexibility to two functions within tp.io

  1. Added the option to split corpuses into documents by sentence boundary when reading in .conllu files using the tp.io.conll module.
  2. Added the option to have differently named columns when using tp.bert.align_bert_tokens_to_corpus_tokens