clips / wordkit

Featurize words into orthographic and phonological vectors.
GNU General Public License v3.0
40 stars 10 forks source link

Problem with subtlexus reader #20

Open lyambailey opened 2 years ago

lyambailey commented 2 years ago

When using the subtlexus (reader) function to load in the following corpus: SUBTLEXusfrequencyabove1.xls (obtained from here: http://crr.ugent.be/programs-data/subtitle-frequencies), I get the following error:

subtlexus('SUBTLEXusfrequencyabove1.xls', fields='orthography')


TypeError Traceback (most recent call last)

in ----> 1 subtlexus('SUBTLEXusfrequencyabove1.xls', fields='orthography') ~\anaconda3\lib\site-packages\wordkit\corpora\corpora\subtlex.py in subtlexus(path, fields) 46 def subtlexus(path, 47 fields=("orthography", "frequency")): ---> 48 return subtlex(path, fields, "eng-us") 49 50 ~\anaconda3\lib\site-packages\wordkit\corpora\corpora\subtlex.py in subtlex(path, fields, language) 31 skiprows = 0 32 ---> 33 return reader(path, 34 fields, 35 LANG2FIELD[language], ~\anaconda3\lib\site-packages\wordkit\corpora\base\reader.py in reader(path, fields, field_ids, language, preprocessors, opener, **kwargs) 68 field_ids = {} 69 ---> 70 df = opener(path, **kwargs) 71 # Columns in dataset 72 colnames = set(df.columns) ~\anaconda3\lib\site-packages\wordkit\corpora\base\reader.py in _open(path, **kwargs) 27 extension = os.path.splitext(path)[-1] 28 if extension in {".xls", ".xlsx"}: ---> 29 df = pd.read_excel(path, 30 na_values=nans, 31 keep_default_na=False, ~\anaconda3\lib\site-packages\pandas\util\_decorators.py in wrapper(*args, **kwargs) 294 ) 295 warnings.warn(msg, FutureWarning, stacklevel=stacklevel) --> 296 return func(*args, **kwargs) 297 298 return wrapper TypeError: read_excel() got an unexpected keyword argument 'sep'