issues
search
OpenThaiGPT
/
openthaigpt-pretraining
Apache License 2.0
21
stars
10
forks
source link
Crawl news mfa
#265
Open
nattjn
opened
1 year ago
nattjn
commented
1 year ago
Why this PR
Why we need this PR? To crawl news in MFA
Changes
Add functions to crawl news contents in MFA
Add scripts for each news topic
Related Issues
Close #
Checklist
[ ] PR should be in the
Naming convention
[ ] Assign yourself in to Assigneees
[ ] Tag related issues
[ ] Constants name should be ALL_CAPITAL, function name should be snake_case, and class name should be CamelCase
[ ] complex function/algorithm should have
Docstring
[ ] 1 PR should not have more than 200 lines changes (Exception for test files). If more than that please open multiple PRs
[ ] At least PR reviewer must come from the task's team (model, eval, data)
nattjn
commented
1 year ago
done
Why this PR
Why we need this PR? To crawl news in MFA
Changes
Related Issues
Close #
Checklist