Open colinmorris opened 4 years ago
Hey @colinmorris, thanks for letting me know and sorry for the delay. I don't know how GitHub doesn't notify me about it.
Related to the issue, you are completely right. This is like this because there's no pre-processing of the data for skipping problematic characters like the mentioned one ("
). I'll try to submit a PR fixing this. Thanks!
The row in
azlyrics_lyrics_l.csv
looks like:There's an extra double-quote in the song title field, which confuses the parser in Python's csv library (and probably most others). Per the csv RFC:
(btw, thank you for publishing this dataset! It's sorely needed.)