xpmethod / opensyllabus

Other
48 stars 10 forks source link

sample corpus #56

Closed denten closed 10 years ago

denten commented 10 years ago
  1. Prepare a sample dump (100k) of documents containing the full text.
  2. Sanitize names.
  3. Release on the website and through the list. Thank the folks who were working on name extraction.
denten commented 10 years ago

Duplicate