Pgibby8 / SentenceFusion

0 stars 0 forks source link

Locate suitable training dataset #1

Open Pgibby8 opened 4 years ago

Pgibby8 commented 4 years ago

Right now I'm leaning towards Project Gutenberg for its size, diversity, and availability http://www.gutenberg.org/

Pgibby8 commented 4 years ago

This repository seems promising https://github.com/c-w/gutenberg