TeangaNLP / teanga2

Teanga a dó
Apache License 2.0
0 stars 0 forks source link

Corpus transforms #9

Closed jmccrae closed 5 months ago

jmccrae commented 8 months ago

We should be able to support lazy (perform at read) transformation of the corpus

# Word frequency after lower case
corpus.lower().freq("words")
# Word frequency based on reversed words
corpus.transform("text", lambda text[::-1]).freq("words")