issues
search
dviator
/
WeSaidSheSaid
For the development of an application to store, document, and analyze political speeches and the media's interpretation of them.
1
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix referential integrity when storing Candidates Names
#43
dviator
opened
9 years ago
0
Put speech translations into text files, point database to text file
#42
aistein
opened
9 years ago
1
Investigate github hosting of speeches
#41
dviator
opened
9 years ago
0
Use http requests to grab full page DOM instead of selenium
#40
dviator
closed
9 years ago
1
Make two databases that can be toggled between for Dev and Persistent Data
#39
dviator
opened
9 years ago
1
Have run.sh spit out a message to command line when scrapy is complete
#38
aistein
closed
9 years ago
0
Validate State Names when collecting location metadata
#37
dviator
opened
9 years ago
0
Make an automated Install for an external server that includes all dependencies
#36
dviator
opened
9 years ago
0
Add Config File option for setting highest logging level for crawler (DEBUG to CRITICAL)
#35
aistein
opened
9 years ago
0
Run Scrapy from a script
#34
dviator
closed
9 years ago
0
Figure out why scraper quits halfway through at random points
#33
dviator
closed
9 years ago
1
Background scrapy process
#32
dviator
closed
9 years ago
0
Account for varying candidate name's in meta-data scraping
#31
aistein
closed
9 years ago
0
Hide Browser window while scraping
#30
dviator
closed
9 years ago
0
Validate Speaker Before Transcribing Video
#29
dviator
closed
9 years ago
1
Figure out how to deal with links to video playlists
#28
aistein
closed
9 years ago
0
Improve speaker validate logic to accept certain matching subsets
#27
dviator
closed
9 years ago
1
Convert printed Debug output to logger classes throughout project
#26
dviator
closed
9 years ago
1
research ORM's (object relational mappers) for putting database on webpage
#25
aistein
opened
9 years ago
0
Read Legal stipulations / licensing agreements from CSPAN
#24
aistein
opened
9 years ago
0
Fix ghost error that occurs while scraping and translating speeches
#23
aistein
opened
9 years ago
2
Database needs secure version for production usage
#22
dviator
opened
9 years ago
0
Turn directory structure into proper python package
#21
aistein
closed
9 years ago
1
General Database Validation
#20
aistein
closed
9 years ago
0
Transcribe Speech data before insert
#19
dviator
closed
9 years ago
1
Format timestamps before db insert
#18
dviator
closed
9 years ago
1
Crawler needs to grab the correct name from 'candidates' item property (possibly having multiple candidate values), before pushing into database
#17
aistein
closed
9 years ago
0
Need to figure out how to escape quotes in InsertCandidates for Mike O'Malley
#16
dviator
closed
9 years ago
1
Database connection could be optimized
#15
dviator
closed
9 years ago
0
In Videos with multiple speakers, identify speech text by candidate
#14
dviator
opened
9 years ago
0
Transcriber Needs to Validate it's Input
#13
dviator
opened
9 years ago
0
Crawler needs Unit Tests
#12
dviator
opened
9 years ago
0
Need to manage dependencies for project
#11
dviator
opened
9 years ago
3
Crawler needs to write to DB
#10
dviator
closed
9 years ago
0
Crawler needs to grab speech metadata along with link
#9
dviator
opened
9 years ago
2
Crawler needs to get to 'show more videos' section of search
#8
dviator
closed
9 years ago
4
Crawler needs to validate links somehow
#7
dviator
closed
9 years ago
2
Transcriber needs to redirect it's stdout to a log file and directory
#6
dviator
closed
9 years ago
0
Test Class Needs to Cleanup after itself.
#5
dviator
opened
9 years ago
0
Scraper for CSPAN
#4
dviator
closed
9 years ago
4
Additional Cleanup on speech output
#3
dviator
opened
9 years ago
1
Turn comparePhrase.py into a class
#2
dviator
closed
9 years ago
1
Refine post_processing.py for cc to speech text and test
#1
dviator
closed
9 years ago
3