steven-cutting / splitta

Automatically exported from code.google.com/p/splitta
0 stars 0 forks source link

Format of training data? #6

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago

I want to train up the system on a new batch of data.

You allude to the format (noting MXTERMINATOR, which is not well documented).

Could you actually provide a script to convert PTB2 and Brown into the correct 
format?

Or, at the very least, provide a small snippet showing example training data?

Original issue reported on code.google.com by tur...@gmail.com on 12 Aug 2010 at 6:58