mysociety / popit

DEPRECATED - Development on PopIt has stopped and it is no longer being maintained
https://goo.gl/Vvej4Q
Other
76 stars 33 forks source link

Allow for attribution of all data (per-field sourcing) #26

Open evdb opened 12 years ago

evdb commented 12 years ago

Ensure that this suggestion is implemented:

Make sure you include sources i.e. make it fundamental in the user interface that people put the URL and/or a text description of where they got data from.

This is vital for provenance and for later data updating. And most importantly, for the user interface to add credibility to sites made using the data.

mhl commented 10 years ago

Notes from the backlog triage session at the all team meeting: "see #279 - probably sources of information should be attached to changes, so related to the versioning work"

clkao commented 10 years ago

+1

kaerumy commented 10 years ago

This is currently a blocker for Sinar work on representative database. We have several data sources to help build a complete picture of our representatives, and these sources must be attributed to maintain integrity (non technical) of data that is being used by everybody else.

paullenz commented 10 years ago

As this is a blocked for you I have tagged it as contender for review to be added to the next sprint (starts in a week)

kaerumy commented 10 years ago

Thank you for making this a contender.

paullenz commented 10 years ago

Speaking to Sinar - for their needs just an additional field that would enable them to list attribution source would be sufficient for now - more complex versioning is not essential

This is needed ASAP (faster than the multiple language option)

chrismytton commented 10 years ago

Pull request for simplified version of this using popolo source fields is here - https://github.com/mysociety/popit/pull/680.

chrismytton commented 10 years ago

@kaerumy I've just pushed a basic implementation of source fields live. This adds an extra Sources tab to the person/organization view/edit pages. This follows the Popolo metadata format of having each source be a url with a short description.

screen shot 2014-11-13 at 12 55 41

I'm going to leave this ticket open as we plan to do more comprehensive attribution based on versioning in the future, but hopefully this is a useful initial version! :+1:

kaerumy commented 10 years ago

This is good for some of our initial import of representatives, but we will eventually need source per field in near future. https://sinar-malaysia.popit.mysociety.org/organizations/5474018888eca8ff1ed43367#sources is not very useful when we want to track for example the source of posts held/change for each person/post.

Consumer applications like our Accountability tracker will need to easily pull specific source for changes in posts held, or start/end of position. Currently it will have to pull in all Sources and leave it to the user to figure out which one was used to verity the information.

zarino commented 9 years ago

Hi @kaerumy – If you had a way to record sources for each individual change, would you expect it to be optional, or required?

Also, if users were asked for a source only once, when they finally press the "Save changes" button, rather than individually per input field – would that be ok for your use case? If it wouldn't, could you explain why?

zarino commented 9 years ago

Just to add some context, YourNextMP (which is a user interface built on top of the PopIt API) has already made its own decisions about both of these points.

YourNextMP requires you to provide a source whenever you make any changes.

But it doesn't ask per individual field. Instead, on the candidate editing page (which is closest to the default PopIt person/organisation page) there's a single "Source" field next to the "Save" button at the bottom of the page:

screen shot 2015-01-19 at 16 44 53

This means YourNextMP can't track individual sources for each individual change, but it can record a source for each value changed at the same time (what I'd call an "editing session", or in version control might be called a "check-in" or a "commit").

Looking at the diffs on YourNextMP it appears most people only change one thing when they edit a candidate—maybe adding an email address or a twitter profile, or adding a membership—so recording one source per editing session, rather than per field, doesn't make much of a difference. But it does make it a lot simpler for users doing the editing.

The question is, @kaerumy, does Sinar Project need more granularity than that?

kaerumy commented 9 years ago

Hi @zarino, in short we're building a single centralized database of thousands of people & organizations from various verified sources, that will be used in by different applications as well as a research tool for investigative journalists for years to come.

zarino commented 9 years ago

Thanks for the feedback @kaerumy. You didn't quite answer my questions, but can I assume that your answers would be..?

Is that accurate?

kaerumy commented 9 years ago

Yes correct on both.

zarino commented 9 years ago

For mySociety peeps – here's a run-down of my work on this so far:

https://docs.google.com/a/mysociety.org/presentation/d/1msmugkIGh3i1v25gYoq4DNULwyhcd7mGHtPy05u0FwQ/edit

akuckartz commented 9 years ago

This is about provenance and could use PROV-O (http://www.w3.org/TR/prov-o/).

akuckartz commented 9 years ago

PROV Primer: http://www.w3.org/TR/2013/NOTE-prov-primer-20130430/

chrismytton commented 9 years ago

Just a quick update on where we're at with this: We haven't forgotten about this ticket! We've done some work towards implementing it but it became clear along the way that the implementation is too coupled to the UI.

For the time being we've put this ticket on hold while we focus on decoupling the API and the UI, which is being tracked in #837.

Once we've made some progress with the API and UI decoupling then I want to come back to this ticket and hopefully the implementation will be much cleaner/clearer.

kaerumy commented 8 years ago

We've implemented this feature in our replacement version of Popit here https://github.com/Sinar/popit_ng