data-dot-all / dataall

A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
https://data-dot-all.github.io/dataall/
Apache License 2.0
233 stars 82 forks source link

Allow searching datasets by bucket name #1437

Open clayreimann opened 3 months ago

clayreimann commented 3 months ago

Scenario I would like to be able to search for datasets by bucket name. When I find code that is using the same data that I want to use I don't have any references to data.all registration, but I do have a bucket name.

Solution When searching datasets, if I provide the exact name of an s3 bucket used by a dataset I would like the search to return that, and only that, dataset.

I think that fuzzy matching or prefix matching might be a desirable extension to this. However those options should have a switch for administrators to disable, should the administrators wish to obfuscate their bucket naming schemes.

dlpzx commented 3 months ago

This is a very cool feature! Thanks @clayreimann for opening an issue. Do you have bandwidth to work on it? I believe it is a good first issue. Otherwise we can pick it up