issues
search
BIDS-projects
/
topic-modeling
Categorization of various data science institutions into several different topics
Apache License 2.0
1
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Select the most likely topics for institutions
#32
don-han
opened
8 years ago
0
Implement a function that takes in a string and counts the urls from which those strings are coming from
#31
don-han
closed
8 years ago
1
Build a function that can find which url the topic words are coming from
#30
don-han
closed
8 years ago
0
Too many duplicate topics
#29
chewisinho
opened
8 years ago
0
Build 2-D or 3-D scatterplot for analytic aspect of visualization
#28
don-han
closed
8 years ago
1
Figure out where the rest of the boilerplates are coming from
#27
don-han
closed
8 years ago
1
Topic Modeling Result Thread
#26
don-han
opened
8 years ago
1
Changed 'url' to 'src_url'
#25
chewisinho
closed
8 years ago
0
Modified topic modeling to take in filtered data (reads from MongoDB)
#24
chewisinho
closed
8 years ago
0
Change database loading
#23
chewisinho
closed
8 years ago
0
remove geographical locations and unnecessary common nouns using NER
#22
don-han
closed
8 years ago
1
remove boilerplates using justText or other 3rd library
#21
don-han
closed
8 years ago
4
remove numbers/dates/times using regex
#20
don-han
closed
8 years ago
1
Get the most likely topics for each institutions
#19
don-han
opened
8 years ago
0
Better filtering of features
#18
don-han
closed
8 years ago
3
Fix document summarizer
#17
chewisinho
closed
8 years ago
0
Added summarizer
#16
chewisinho
closed
8 years ago
0
Added weighting functions
#15
chewisinho
closed
8 years ago
1
Implement weight function on lda.py
#14
don-han
closed
8 years ago
0
Organize the code for lda.py
#13
don-han
closed
8 years ago
0
Change LDA I/O organization
#12
chewisinho
closed
8 years ago
1
LDA weighting
#11
chewisinho
closed
8 years ago
0
Use Zipf's law weighing
#10
don-han
closed
8 years ago
2
Implement TF-IDF and apply on each document
#9
don-han
closed
8 years ago
0
Finish iterative stopword generative LDA
#8
don-han
closed
8 years ago
1
still having trouble with textmining
#7
don-han
closed
8 years ago
1
clean up bidslda.py
#6
don-han
closed
8 years ago
0
Write LDA for Apache Spark
#5
don-han
closed
8 years ago
1
Filter words
#4
don-han
closed
8 years ago
0
reorganize/rename files
#3
alvinwan
closed
8 years ago
0
Implement an algo that takes in MockItem
#2
don-han
closed
8 years ago
0
Create a MongoDB Loader
#1
don-han
closed
8 years ago
0