Open ArthurMinovsky opened 1 year ago
Crawl data
Check quality of data // Should collect or not with the amount, available for collection, quality of text; paragraph, content, etc.
Check Overlap by collected data list
[ ] Output: List of recommended data for crawl
`from datasets import load_dataset
thsum = load_dataset('thaisum')`
Output:
Example of data source :
Crawl data
Check quality of data // Should collect or not with the amount, available for collection, quality of text; paragraph, content, etc.
Check Overlap by collected data list
[ ] Output: List of recommended data for crawl
`from datasets import load_dataset
thsum = load_dataset('thaisum')`
Output: