biodiversitydata-se / biologging-sensor-client

SBDI Biologging Sensor project - Client
GNU General Public License v3.0
2 stars 0 forks source link

Dataset with embargo (isPublic=false) : download button, data available, ... #108

Open mathieuLU opened 1 month ago

mathieuLU commented 1 month ago

2 cases when the download button should not be displayed :

display the embargo date instead ?

in the snippet, the embargo should be mentionned

Make sure that the API cannot provide to public when isPublic=false ? how to make it available in the client then ??

maybe a subset should be provided ?

in any case, the "total records" in the dataset listing is supposed to show the real number of records of the dataset. Not what's in the database

mathieuLU commented 3 weeks ago

ok so for a public dataset (no emargo, isPublic=true), we have this now : snippet => Screenshot from 2024-09-19 10-58-18 detail => Screenshot from 2024-09-19 10-57-45

and so when dataset with embargo : snippet (no button + embargo date) => Screenshot from 2024-09-19 10-58-12 detail => Screenshot from 2024-09-19 10-57-57

does that look better @DeboraArlt @blacksparrowhawk

DeboraArlt commented 2 weeks ago

looks ok for me. perhaps would be better with an explicit statement about the data set not being currently public, but we can go for this easy solution now. and see what users say.

mathieuLU commented 2 weeks ago

ok so the question is still open. How do we deal with the datasets with embargo ? (isPublic = false) Will the data be in the database anyway with some authentication/filter needed ? How do we use the visualisation tools ? Do we only let a sample data available to download ? etc...

several options :

1- all the data is in the database, but not available to download/retrieve from the API. Which means that the web client (which is using the API) can't use the data for the visualisation tool. SO we need to specify a way for the client to access the data anyway. And do we want the client to access the first data available as usual ? (maybe a risk to display protected data ?)

2- only a sample available in the database. This way, the database remains 100% public. No need to authenticate/filter or anything. But it means that a sample has to be created when we publish the dataset. And then it has to be clearly specified somewhre that only a sample is available. Which means that we may display the "total number of records of the dataset" (for real) and the "total of records AVAILABLE in the database"

3- maybe a mix of these 2 ? hidden data in the database + public sample available for download/API ?

@blacksparrowhawk what are you views on that ?

mathieuLU commented 2 weeks ago

i add to the discussion the comment that Debora wrote :

` I don’t know exactly how it works, but for SOS they talk about the public index, end a sensitive/protected index. I thinks it’s a bit like two separate databases. Normally one get’s only data from the public index. But if you have permission you also get data from the protected index (according what your permissions allows access to).

For me option 2 works too. The private/embargo dataset provides a subset that is available public. But it seemed like Johan did not quite like this option. But let’s say the subset is only one animal. One could also select just part of the total time series, e.g. just half a year instead of a full annual cycle, i.e. a simple subset for illustrative purpose. I cannot see that anyone would/could misuse that data. `

DeboraArlt commented 2 weeks ago

From the discussion today 24 Sept (Mathieu and Debora) it sounds like the easier option is having only public data in our database. That's the data we dipslay and that one can access via the API. No permissions etc. needed. It would be good though to have a public sample data from the non-public (embargo) datasets. This sample data could be explored (graphs) and also downloaded. The data owner decides what he/she wants to release as a sample - could be data for one animal only, and for a selected time period and/or selected location only, i.e. data that the owner deems as unsensitive to show in graphs and have downloadable via the API. I suggest to add to the metadata - so metadata include information about total no. of records in the dataset, and some information about what the selected public sample is. I will have a look at our metadata form and see where this could be added.

mathieuLU commented 6 days ago

has been decided :

mathieuLU commented 5 days ago

ok so :