issues
search
DataKind-BLR
/
PrathamBooks-Sprint-2018
Code and documentation for the collaboration with PrathamBooks during Sprint' 2018
MIT License
4
stars
7
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add 2nd LDA model
#48
goelakash
closed
5 years ago
0
This reverts one commit for LDA model, which was not trained properly.
#47
goelakash
closed
5 years ago
0
lda_model trained over the entire illustration text
#46
yadavabhishekkumar
closed
5 years ago
0
Refactor: move LDA model trained with story_text only to a sub-folder inside model directory
#45
goelakash
closed
5 years ago
0
lda_model trained over the illustration_text
#44
yadavabhishekkumar
closed
5 years ago
0
ENH: Adding a model based on frequency and collations
#43
heaven00
closed
5 years ago
0
Ui interface
#42
heaven00
closed
5 years ago
0
Saving model files and some OS related cleanup
#41
goelakash
closed
5 years ago
0
Train LDA model to suggest tags
#40
goelakash
closed
5 years ago
0
Train LDA model and infer top words
#39
goelakash
closed
5 years ago
0
ENH: Adding exact matching evaluation object
#38
heaven00
closed
5 years ago
0
Revert "Added a few utility scripts and a notebook to do LDA topic modelling."
#37
heaven00
closed
5 years ago
0
Exploratory analysis - cache and error clean-up
#36
TheDataAreClean
closed
5 years ago
0
Exploratory analysis code.
#35
TheDataAreClean
closed
5 years ago
0
Added a few utility scripts and a notebook to do LDA topic modelling.
#34
goelakash
closed
5 years ago
0
comparing various keyword extraction methods.
#33
nirmalsinghania2008
opened
5 years ago
0
Jupyter notebook for Gensim LDA and LSI
#32
SahilKuchlous
closed
5 years ago
1
keyword extraction using sommy
#31
yadavabhishekkumar
opened
5 years ago
2
Improving the handling of directory paths and pre-processing the text
#30
arnabbiswas1
closed
5 years ago
0
stories_pages.csv, is malformed.
#29
Gunnvant
closed
5 years ago
1
clean_stories_data.py does validate the input or output directory
#28
arnabbiswas1
closed
5 years ago
0
Create more text corpus
#27
arnabbiswas1
opened
5 years ago
0
Build a pipeline to process the text data (stories)
#26
arnabbiswas1
closed
3 years ago
1
POC on LDA for stories
#25
githubssn
closed
5 years ago
0
Tags Seeding/Validation Approach
#24
umeshprasadk
opened
5 years ago
0
Campaign Tags to be removed
#23
abmath
closed
3 years ago
2
Processing Approach
#22
abmath
opened
5 years ago
0
Summary approach-Abhinav Mathur
#21
qbera
closed
5 years ago
1
Apply TextRank (Summarization : Extraction Based Techniques) for English Stories
#20
arnabbiswas1
opened
5 years ago
0
Apply Latent Semantic Analysis (Summarization : Extraction Based Techniques) for English Stories
#19
arnabbiswas1
opened
5 years ago
0
Apply Non-negative Matrix Factorization (Topic Modeling) for English Stories
#18
arnabbiswas1
opened
5 years ago
0
Apply Latent Dirichlet Allocation (Topic Modeling) for English Stories
#17
arnabbiswas1
opened
5 years ago
2
Apply Latent semantic indexing (Topic Modeling) for English Stories
#16
arnabbiswas1
opened
5 years ago
1
Explore "Key Phrase (Word) Extraction" for English Stories content
#15
arnabbiswas1
opened
5 years ago
0
Added comments, help message and renamed the file.
#14
SahilKuchlous
closed
6 years ago
0
Script to extract content from html pages and then merge pages for each story
#13
arnabbiswas1
closed
5 years ago
1
POC TF-IDF For Stories
#12
githubssn
closed
6 years ago
5
Basic data exploration of story data
#11
githubssn
closed
6 years ago
1
Story Content Data Correction
#10
githubssn
closed
5 years ago
3
Added HTML cleanup script (issue #1)
#9
SahilKuchlous
closed
6 years ago
1
Measurable criteria for suitability of generated tags
#8
proxygeek
opened
6 years ago
2
Explore different tools/algorithms to auto-generate tags
#7
sacmax
closed
5 years ago
8
Updated the document
#6
arnabbiswas1
closed
6 years ago
0
Visualize content of stories (Wordcloud?)
#5
arnabbiswas1
opened
6 years ago
6
Analysis of existing tags and categories
#4
arnabbiswas1
opened
6 years ago
0
Basic Exploratory Analysis of Stories Data
#3
arnabbiswas1
closed
5 years ago
5
For a story, merge content from multiple pages
#2
arnabbiswas1
closed
5 years ago
3
For stories, text content needs to be extracted from html page content
#1
arnabbiswas1
closed
5 years ago
2