Lots of people use things like IMDB reviews (eh), or the Wikipedia corpus (good!) to enhance or bootstrap NLP analysis. We could do the same. We might make these into a different browser in the database, or mark them specially. Keeping them well marked would help us do comparison experiments to see how these affect the algorithms.
Lots of people use things like IMDB reviews (eh), or the Wikipedia corpus (good!) to enhance or bootstrap NLP analysis. We could do the same. We might make these into a different
browser
in the database, or mark them specially. Keeping them well marked would help us do comparison experiments to see how these affect the algorithms.