bigscience-workshop / metadata

Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.
Apache License 2.0
30 stars 12 forks source link

How do we define a paragraph? #114

Closed norakassner closed 2 years ago

norakassner commented 2 years ago

https://docs.google.com/document/d/1S2gQrOZl5UVwILboc9E4wwmXJtO2QmfNq4B_pTdzYS0

tianjianjiang commented 2 years ago

Since @timoschick has noted https://docs.google.com/document/d/1S2gQrOZl5UVwILboc9E4wwmXJtO2QmfNq4B_pTdzYS0, I suppose we can close this ticket.