mimno / Mallet

MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
https://mimno.github.io/Mallet/
Other
973 stars 346 forks source link

SimpleTagger allow other input than file-based #183

Open zidane28 opened 3 years ago

zidane28 commented 3 years ago

Hi, I had came across this library and so far I found out that SimpleTagger alone already fulfill my tagging need. But I facing a problem when i want to integrate into my project due to SimpleTagger is using file-based input while my project need text-based input / on-demand.

So do you planned to diversify the input type allow for SimpleTagger? Let's say I decided to fork this out and edit the source code on my own, am I allowed to modify it?

Thank you.

mimno commented 3 years ago

An better API version would be great, feel free to submit a pull request!