DataONEorg / metrics-service

An efficient database and REST API for delivering aggregated data set metrics to clients.
Apache License 2.0
2 stars 1 forks source link

Unique data downloads (file by user/ipaddress) #52

Open vchendrix opened 5 years ago

vchendrix commented 5 years ago

As I understand it the COUNTER implementation, uses a 60 minute time limit to determine a unique download. We have an additional uses case where we merely want to know the unique download without the time component. Basically we want counts of the file by a user or ip address if anonymous. The time component is not as important because the same user may be downloading the same file multple times but we only want that counted once. Would it be possible to get this statistic?

rushirajnenuji commented 5 years ago

Hey @vchendrix - we discussed this issue today - during our regular MDC meeting.

We decided that it would be possible for us to add this support to the metrics-service API. It should be fairly straight-forward - we'll be looking for the subjects associated with the event log, and if the subject is missing (or anonymous), we'll fall back to IP address.

@csjx updated us that you and Cory and other members of ESS_DIVE team are working on resolving the issue of getting the correct subject and IP address associated with the log event. This will definitely help us add support of generating unique metrics by subject + IP address.

Will keep you updated with the progress of this feature.

vchendrix commented 5 years ago

@rushirajnenuji Thanks for the update! We have a fix for the ip address problem in ESS-DIVE. We just need to schedule a release.