Open easherma opened 7 years ago
For now we're using Jupyter/iPython Notebooks and reading the flat files we wrote to disk.
The slowest part is having Beautiful Soup parse the xml, but loading transcripts into Pandas and then searching them seems pretty snappy. We're not dealing with a ton of data here and the transcripts won't really change, so I'm contemplating going with a NoSQL/document based solution over something more heavyweight.
Need a way to search keywords, KISS, TBD
This goes along with determining a database structure