issues
search
tanussingh
/
Big-Data-Management-Analytics-Project
Final Project for CS 6350.001 - Large Scale Data Collection and preprocessing in Spark
3
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Crawl new sites using Scrapy
#11
ishansharma
opened
5 years ago
1
Make overall structure for deduplication algorithm
#10
ishansharma
opened
5 years ago
2
Integrate Spacy with Spark
#9
ishansharma
opened
5 years ago
0
Integrate Gensim with Spark
#8
ishansharma
opened
5 years ago
0
Figure out how to do Doc2Vec with either Gensim or Spacy
#7
ishansharma
opened
5 years ago
1
Integrate Spacy with Spark
#6
ishansharma
closed
5 years ago
0
Integrate UDPipe with Spark
#5
ishansharma
opened
5 years ago
0
Find out how to do NER with Spacy
#4
ishansharma
closed
5 years ago
1
Find a way to compare articles based on UDPipe output
#3
ishansharma
opened
5 years ago
2
Figure out how to store UDPipe output
#2
ishansharma
opened
5 years ago
0
Setup Mongo → Kafka → Spark → Kafka → Mongo Streaming
#1
ishansharma
opened
5 years ago
1