DataKind-BLR PrathamBooks-Sprint-2018 issues

DataKind-BLR / PrathamBooks-Sprint-2018

Code and documentation for the collaboration with PrathamBooks during Sprint' 2018

MIT License

4 stars 7 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Add 2nd LDA model

#48 goelakash closed 5 years ago
0
This reverts one commit for LDA model, which was not trained properly.

#47 goelakash closed 5 years ago
0
lda_model trained over the entire illustration text

#46 yadavabhishekkumar closed 5 years ago
0
Refactor: move LDA model trained with story_text only to a sub-folder inside model directory

#45 goelakash closed 5 years ago
0
lda_model trained over the illustration_text

#44 yadavabhishekkumar closed 5 years ago
0
ENH: Adding a model based on frequency and collations

#43 heaven00 closed 5 years ago
0
Ui interface

#42 heaven00 closed 5 years ago
0
Saving model files and some OS related cleanup

#41 goelakash closed 5 years ago
0
Train LDA model to suggest tags

#40 goelakash closed 5 years ago
0
Train LDA model and infer top words

#39 goelakash closed 5 years ago
0
ENH: Adding exact matching evaluation object

#38 heaven00 closed 5 years ago
0
Revert "Added a few utility scripts and a notebook to do LDA topic modelling."

#37 heaven00 closed 5 years ago
0
Exploratory analysis - cache and error clean-up

#36 TheDataAreClean closed 5 years ago
0
Exploratory analysis code.

#35 TheDataAreClean closed 5 years ago
0
Added a few utility scripts and a notebook to do LDA topic modelling.

#34 goelakash closed 5 years ago
0
comparing various keyword extraction methods.

#33 nirmalsinghania2008 opened 5 years ago
0
Jupyter notebook for Gensim LDA and LSI

#32 SahilKuchlous closed 5 years ago
1
keyword extraction using sommy

#31 yadavabhishekkumar opened 5 years ago
2
Improving the handling of directory paths and pre-processing the text

#30 arnabbiswas1 closed 5 years ago
0
stories_pages.csv, is malformed.

#29 Gunnvant closed 5 years ago
1
clean_stories_data.py does validate the input or output directory

#28 arnabbiswas1 closed 5 years ago
0
Create more text corpus

#27 arnabbiswas1 opened 5 years ago
0
Build a pipeline to process the text data (stories)

#26 arnabbiswas1 closed 3 years ago
1
POC on LDA for stories

#25 githubssn closed 5 years ago
0
Tags Seeding/Validation Approach

#24 umeshprasadk opened 5 years ago
0
Campaign Tags to be removed

#23 abmath closed 3 years ago
2
Processing Approach

#22 abmath opened 5 years ago
0
Summary approach-Abhinav Mathur

#21 qbera closed 5 years ago
1
Apply TextRank (Summarization : Extraction Based Techniques) for English Stories

#20 arnabbiswas1 opened 5 years ago
0
Apply Latent Semantic Analysis (Summarization : Extraction Based Techniques) for English Stories

#19 arnabbiswas1 opened 5 years ago
0
Apply Non-negative Matrix Factorization (Topic Modeling) for English Stories

#18 arnabbiswas1 opened 5 years ago
0
Apply Latent Dirichlet Allocation (Topic Modeling) for English Stories

#17 arnabbiswas1 opened 5 years ago
2
Apply Latent semantic indexing (Topic Modeling) for English Stories

#16 arnabbiswas1 opened 5 years ago
1
Explore "Key Phrase (Word) Extraction" for English Stories content

#15 arnabbiswas1 opened 5 years ago
0
Added comments, help message and renamed the file.

#14 SahilKuchlous closed 6 years ago
0
Script to extract content from html pages and then merge pages for each story

#13 arnabbiswas1 closed 5 years ago
1
POC TF-IDF For Stories

#12 githubssn closed 6 years ago
5
Basic data exploration of story data

#11 githubssn closed 6 years ago
1
Story Content Data Correction

#10 githubssn closed 5 years ago
3
Added HTML cleanup script (issue #1)

#9 SahilKuchlous closed 6 years ago
1
Measurable criteria for suitability of generated tags

#8 proxygeek opened 6 years ago
2
Explore different tools/algorithms to auto-generate tags

#7 sacmax closed 5 years ago
8
Updated the document

#6 arnabbiswas1 closed 6 years ago
0
Visualize content of stories (Wordcloud?)

#5 arnabbiswas1 opened 6 years ago
6
Analysis of existing tags and categories

#4 arnabbiswas1 opened 6 years ago
0
Basic Exploratory Analysis of Stories Data

#3 arnabbiswas1 closed 5 years ago
5
For a story, merge content from multiple pages

#2 arnabbiswas1 closed 5 years ago
3
For stories, text content needs to be extracted from html page content

#1 arnabbiswas1 closed 5 years ago
2