Closed ftesser closed 11 years ago
The wikipedia cleaning up process seems take as full sentence the TITLE of a wikipedia paragraph followed by the text.
Example: the following text is extracted from http://it.wikipedia.org/wiki/Pretty_Rhythm:_Dear_My_Future
Personaggi Prizmmy☆: Ha quattordici anni e la sua mascotte è Mimi.
The entry points for this tasks are:
With commit 0aea904c3a94db46becc733bbc5ad44f18682218 the TITLE+paragraph text issue has been fixed. Every title now receives a final dot, so that it is now interpreted as a complete sentence.
The wikipedia cleaning up process seems take as full sentence the TITLE of a wikipedia paragraph followed by the text.
Example: the following text is extracted from http://it.wikipedia.org/wiki/Pretty_Rhythm:_Dear_My_Future
The entry points for this tasks are: