Open hugolpz opened 3 years ago
There is a crawl_ca-valencia.py within the google/corpuscrawler projects. Which produces a file visible on their readme.md . Surprisingly, this frequency file didn't make it to UNILEX. As renowed Twitter expert on Catalan language Unjoanqualsevol puts it:
crawl_ca-valencia.py
Great! But I can't find Catalan (ca) language data ðŸ˜[Crying emoji]
ca
cat
ca-valencia
.txt
Q: Is there any plan to rerun the whole chain at any time or periodically ?
I proposed a PR #13 .
For maintenance reasons I plan to remove this PR branch in a week.
There is a
crawl_ca-valencia.py
within the google/corpuscrawler projects. Which produces a file visible on their readme.md . Surprisingly, this frequency file didn't make it to UNILEX. As renowed Twitter expert on Catalan language Unjoanqualsevol puts it:ca
,cat
, norca-valencia
document within UNILEX. A quick search [CTRL+F] for.txt
returns the following results : ProjectsQ: Is there any plan to rerun the whole chain at any time or periodically ?