For the MVP we intend to keep things simple with just a frontend over elastic search, but we don't want to potentially pollute elastic search when we import new stuff. We will first store all the disaparate data sources in (SQL or something else?) and make an ETL to Elastic Search in Python.
Outcomes
Ingestion
We will have a command line script that will let anyone to injest data in our system
Well defined and documented workflow for adding new documents in the elastic search
Content
Scripts for various sites from our sources to crawl and index data in Elastic search
For the MVP we intend to keep things simple with just a frontend over elastic search, but we don't want to potentially pollute elastic search when we import new stuff. We will first store all the disaparate data sources in (SQL or something else?) and make an ETL to Elastic Search in Python.
Outcomes
Ingestion
Content