ices-eg / wg_WGSFDGOV

Working Group on Spatial Fisheries Data Governance
http://www.ices.dk/community/groups/Pages/WGSFDGOV.aspx
2 stars 0 forks source link

Development of an anonymity summary of VMS data ouptuts #6

Open colinpmillar opened 4 years ago

colinpmillar commented 4 years ago

Rational:

Suggested sections:

the following is taken from WGSFD 2019 https://doi.org/10.17895/ices.pub.5648:

Further: ICES WGSFD recommends that the following guidelines are followed when publishing data:

  1. Swept area ratios (SAR) are not sensitive and can be published, even if there are less than 3 vessels within the aggregation. This information cannot be used to identify individual vessel.
  2. If there is need to publish data with less than 3 vessels within the aggregation level, the data values can be classified, so that only groups are published (e.g. kw groups), that are wide enough that individual vessels can’t be identified.
  3. Published data should not include information that can be used to infer the suppressed value (e.g. if the value of a single unit is suppressed but the total value is published then the suppressed value might be calculated).
  4. A solution for publishing the sensitive data have been mentioned in the ICES data call that data are only made public at ICES rectangle level, but it would be possible to give information on the empirical distribution of values within each ICES rectangle.
  5. Special requests for advice, for example the current work in support of WKBEDPRES2, can be addressed with data calls to national labs to produce rasters from point data which can then be aggregated at an international level.
neil-ices-dk commented 4 years ago

thanks @colinpmillar i have some doubts about the 'steps taken' as we should ensure we are taking the steps as outlined in the datacall/VMS licence and any other agreement - so this is 'standard' and we would therefore refer to these boiler plate texts - but do you mean to ensure a kind of checklist that these things have been adhered to?

colinpmillar commented 4 years ago

That's the lines I was thinking on. Best case scenario, the aggregation is high enough so we dont need to do anything, and we can report that, but in the case that we do have cells with anonymity issues, we list the standard steps that we had to take (referencing the WGSFD 2019 or other reference). I think it would be much safer if WGSFDGOV could review the procedure and report, then we can run this as a standard output for any VMS data request, maybe.

colinpmillar commented 4 years ago

@christianvdorrien suggests developing a draft of this for the December WGSFDGOV meeting using the data required for a previous Helcom request, so we can compare the output.

colinpmillar commented 3 years ago

draft report on anonymisation is here: https://github.com/ices-eg/wg_WGSFDGOV/blob/master/anonymity/report.html

and a preview can be seen here: https://htmlpreview.github.io/?https://github.com/ices-eg/wg_WGSFDGOV/blob/master/anonymity/report.html

infortunately it looks like the interactive plots are not showing, so I can demonstrate by screen sharing. I think if you download this file it will work when opened in a browser.

colinpmillar commented 3 years ago

Note the Ospar product should be published in Summer 2021

colinpmillar commented 3 years ago

Hi @ices-eg/wgsfdgov Just to check I am on the right track,

It would be good to produce a list of priorities / criteria, such as:

Then we can summaries each procedure based on these priorities / criteria

Anyway, thoughts would be good, so just pop them in as a comments on this thread - cheers!

jensr commented 3 years ago

Hi @colinpmillar That sounds really good. I think one of the things I was getting a little confused about yesterday was whether the tool is for decision support only, or if you are looking to do it all in one go. One of the key things for WGSFDGOV to understand is probably the decision criteria, and to ensure these align with data policy/licenses/limitations in general, whereas WGSFD have the expertise to define what is useful and necessary.