ORNL / DataFed

A Federated Scientific Data Management System
https://ornl.github.io/DataFed/
Other
18 stars 14 forks source link

Search: Improve Data Search Interface #596

Closed dvstans closed 3 years ago

dvstans commented 3 years ago

The current data search approach for personal, project, and shared data needs to be simplified and made more performant on the back-end. Currently, the formed queries are a complex join between flat indexes and multiple graph traversals. Need to analyze the search scopes and design a way to flatten the entire search into a single index (the same way that the catalog search works).

Collection-specific searches will need to be handled separately and may need to be restricted in some ways (to avoid joins and limit execution time).

The web portal UI, CLI, and Python API will all need to be updated to work with the new approach.

This is an EPIC issue that will have other issues linked to it.

dvstans commented 3 years ago

To apply an arango view to personal data search, it must support several starting points / filter options: