Open mmaz opened 3 years ago
in German, 'null' (zero) is being converted to NaN by pandas when it is the only word present in the transcript (due to single-word-target-segments data)
NaN
One option is to use filter_na=False when reading Common Voice TSVs https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.html
filter_na=False
however, we should also first check for truly missing values in the sentence transcription column
in German, 'null' (zero) is being converted to
NaN
by pandas when it is the only word present in the transcript (due to single-word-target-segments data)One option is to use
filter_na=False
when reading Common Voice TSVs https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.htmlhowever, we should also first check for truly missing values in the sentence transcription column