utterworks / fast-bert

Super easy library for BERT based NLP models
Apache License 2.0
1.86k stars 341 forks source link

Hotfix/support xlsx #313

Closed lingdoc closed 1 year ago

lingdoc commented 1 year ago

Adds support for xlsx files in the pandas import of data files. This addresses various formatting/character problems that can be introduced in a raw csv import, as noted in #311 and possibly others.

lingdoc commented 1 year ago

this fix simply checks whether .xlsx exists in the filename and uses the read_excel() import from the pandas library. It may require installation of openpyxl via pip.