Closed crisgarrillo closed 1 week ago
Hi i'm trying to execute a pipeline of stats. Following your example summary_stats.py, an error occurred :
AttributeError: 'TLDExtract' object has no attribute 'extract_str'. Did you mean: '_extractor'?
I tried both with parquet file than Jsonl. I tried on commonly used dataset like culturax or redpajama...
Any idea or suggestion is very appreciated.
Thanks Chris
Hi i'm trying to execute a pipeline of stats. Following your example summary_stats.py, an error occurred :
AttributeError: 'TLDExtract' object has no attribute 'extract_str'. Did you mean: '_extractor'?
I tried both with parquet file than Jsonl. I tried on commonly used dataset like culturax or redpajama...
Any idea or suggestion is very appreciated.
Thanks Chris