ianb / personal-history-archive

An experiment in creating a dump of your personal browser history for analysis
Mozilla Public License 2.0
33 stars 0 forks source link

Allow importing external documents #20

Open ianb opened 6 years ago

ianb commented 6 years ago

Lots of people use things like IMDB reviews (eh), or the Wikipedia corpus (good!) to enhance or bootstrap NLP analysis. We could do the same. We might make these into a different browser in the database, or mark them specially. Keeping them well marked would help us do comparison experiments to see how these affect the algorithms.

ianb commented 6 years ago

See here for Wikipedia dumps