issues
search
bigscience-workshop
/
metadata
Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.
Apache License 2.0
30
stars
12
forks
source link
feat: tag clean website desc., entity paragraph, and title
#150
Closed
tianjianjiang
closed
2 years ago
tianjianjiang
commented
2 years ago
close: #148 close: #151
shanyas10
commented
2 years ago
Thanks @tianjianjiang!
close: #148 close: #151