Open RIOZHU123 opened 2 years ago
Given we already support Twitter V2 JSON natively what does dealing with a specific tools CSV output add? There are other Twitter data collection tools that output different formats, and I think it makes sense to only support the canonical Twitter output format.
Thanks Sam - totally agree!! I attached the Python code in this issue for converting Twarc CSV to CN Toolkit CSV - hope it helps in some circumstances. But again, I agree since the toolkit should fit in more general cases
a feasible function for converting Twitter dataset (Twitter CSV format converted by twarc2) to the acceptable format in preprocess.py might be necessary