vaidap / library

DS4D Project on Encyclopaedia Britannica
0 stars 1 forks source link

Resources #3

Open vaidap opened 4 years ago

vaidap commented 4 years ago

Data Fair google doc: https://docs.google.com/spreadsheets/d/1NmxHYJ5zwznSzADEfPW6ssrs9uph7-78EFLnlcaZ--8/edit#gid=0

vaidap commented 4 years ago

Dumping all the links that seemed interesting, for looking at later

Case Studies

http://vallandingham.me/textvis-talk/#1 (!)

RegEx and cleaning data

https://programminghistorian.org/en/lessons/cleaning-ocrd-text-with-regular-expressions (!) https://sites.temple.edu/tudsc/2014/08/12/text-scrubbing-hacks-cleaning-your-ocred-text/ https://github.com/KBNLresearch/ochre https://towardsdatascience.com/correcting-text-input-by-machine-translation-and-classification-fa9d82087de1

Analysis

https://programminghistorian.org/en/lessons/counting-frequencies https://towardsdatascience.com/a-complete-exploratory-data-analysis-and-visualization-for-text-data-29fb1b96fb6a (!) https://en.wikipedia.org/wiki/Topic_model https://en.wikipedia.org/wiki/Named-entity_recognition https://en.wikipedia.org/wiki/Sentiment_analysis https://dida.do/blog/extracting-information-from-documents text summarization, POS tagging? https://monkeylearn.com/text-analysis/

SpaCy

https://nicschrading.com/project/Intro-to-NLP-with-spaCy/ https://www.analyticsvidhya.com/blog/2017/04/natural-language-processing-made-easy-using-spacy-%E2%80%8Bin-python/ https://stackoverflow.com/questions/45605946/how-to-do-text-pre-processing-using-spacy

vaidap commented 4 years ago

http://sentdex.com/sentiment-analysis/ https://archive.nytimes.com/www.nytimes.com/interactive/2012/09/06/us/politics/convention-word-counts.html#God https://www.tableau.com/trial/data-visualization

vaidap commented 4 years ago

Credit to @Haonan9607: https://medium.com/@colemiller94/topic-modeling-with-spacy-and-gensim-7ecfd3de95f4