API server of the cs-insights project. This is the main part of storing data and accessing an external data analysis endpoint. It uses a mongoDB instance to store everything and queries the cs-insights-prediction-endpoint to get machine learning results.
Is your feature request related to a problem? Please describe.
Issue https://github.com/ag-gipp/NLP-Land-frontend/issues/22 requires some things to be implemented in the backend. If we refer to topics in this issue, we mean word/gram based analysis.
Describe the solution you'd like
Implement the following endpoints:
[ ] Top k topics per year measured for all years
[ ] Most occurring unigrams in titles and their count (paged)
[ ] Most occurring unigrams in abstracts and their count (paged)
[ ] Optional: Most occurring bigrams in titles/abstracts and their count (paged)
[ ] All endpoints should support filters
[ ] Optimizations, e.g. caching, because the backend might struggle with these calculations
Describe alternatives you've considered
Postpone the implementation until the prediction endpoint is ready and run calculations there.
Is your feature request related to a problem? Please describe. Issue https://github.com/ag-gipp/NLP-Land-frontend/issues/22 requires some things to be implemented in the backend. If we refer to topics in this issue, we mean word/gram based analysis.
Describe the solution you'd like Implement the following endpoints:
Describe alternatives you've considered Postpone the implementation until the prediction endpoint is ready and run calculations there.