loge-gh / jp-tools

Automatically exported from code.google.com/p/jp-tools
1 stars 0 forks source link

Extract grammar/topic markers from the article #7

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
There's a lot of informal markers in the articles:

> кн. рыбы.
> спорт. судья; рефери (в борьбе сумо́).
> бот. спора.
> энт. тли, Aphididae.

> ономат.:
> 1) со стуком (падать, ударяться и т. п.);
> 2) жадно, давясь (пить, есть);
> 3) одиноко, потерянно.

We're to find these and in each case determine if the marker can be safely 
removed and added as appropriate grammar/topic tags.
If not, we're to ban the article in the strict mode.

Original issue reported on code.google.com by himse...@gmail.com on 9 Apr 2013 at 2:04