Closed mdingemanse closed 1 week ago
final CSV also includes gaze, should have only speech
parsing EAF generates an error (it works, but looks alarming — can this be suppressed or avoided?)
pympi
. I will have to check whether it can be suppressed there. Since it's just a warning, I address 3. first.saving corpus locally throws error
write_csv
encounters a TypeError
if metadata is provided in metadata fields. Proposed solution ready for review in linked pull request.
Working through this Colab notebook I noticed its output is not entirely selfexplanatory yet and also it generates some errors that may throw off beginners:
/usr/local/lib/python3.10/dist-packages/pympi/Elan.py:1471: UserWarning: Parsing unknown version of ELAN spec... This could result in errors... warnings.warn('Parsing unknown version of ELAN spec... '
TypeError Traceback (most recent call last)
8 frames
/usr/local/lib/python3.10/dist-packages/sktalk/corpus/write/writer.py in(x)
52 norm = pd.jsonnormalize(data=metadata, sep="")
53 df = pd.DataFrame(norm)
---> 54 df[:] = np.vectorize(lambda x: ', '.join(
55 x) if isinstance(x, list) else x)(df)
56 return df
TypeError: sequence item 0: expected str instance, dict found