langsari / quran-dataset

a collection of quran datasets from various sources
MIT License
7 stars 2 forks source link

NLTK REF #8

Open MidtorJya opened 3 years ago

MidtorJya commented 3 years ago

-NLTK HOWTOs http://www.nltk.org/howto/?fbclid=IwAR0Bkxu7OjzrFAhsFpjPNf1YvC_NnMoUzKVTDjrf5VR7GjpVuuo02HbpXds

-Language Processing and Python(ch1) https://www.nltk.org/book/ch01.html

-Accessing Text Corpora and Lexical Resources(ch2) https://www.nltk.org/book/ch02.html

-Process raw text(ch3) https://www.nltk.org/book/ch03.html?fbclid=IwAR1i2X13s4xeII2IsNEO504vWjm67PWiOiWT9ROyCtgePvD9a-XW_V08n2g

-Writing Structured Programs(ch4) https://www.nltk.org/book/ch04.html

-Categorizing and Tagging Words(ch5) https://www.nltk.org/book_1ed/ch05.html

-Regression Tests: Treebank Tokenizer http://www.nltk.org/howto/tokenize.html?fbclid=IwAR17mxZ7nHc94VnqJD3NM1NURXQNXnVHo2GdI0V6n81l7-GGy6pttjy7tpg

MidtorJya commented 3 years ago

-Natural Language Tool Kit - corpus, punctuation, stop words https://www.youtube.com/watch?v=gRk53jBPYvE&list=PL20GJ8grs0SVPsd9JCbHNkS-616wznVjL&index=6

-Natural Language Tool Kit - stemmer, tokenizer, POS tagger https://www.youtube.com/watch?v=RYgqWufzbA8&list=PL20GJ8grs0SVPsd9JCbHNkS-616wznVjL&index=7

-Natural Language Tool Kit - word2vec, clustering, classifying https://www.youtube.com/watch?v=rJmmh7kiNuQ&list=PL20GJ8grs0SVPsd9JCbHNkS-616wznVjL&index=8

**very useful