ThreeSixtyGiving / datatester

Scripts to asses the quality of data from http://data.threesixtygiving.org
MIT License
3 stars 2 forks source link

metadata - filename ? #21

Closed stevieflow closed 6 years ago

stevieflow commented 6 years ago

I dont think we output the actual filename in the metadata - it's part of the download URL - eg:

http://abcharitabletrust.org.uk/data/abct-data-february-2018.xlsx

It could be worth us outputting the filename specifically - esp as these will be at the end of various URL patterns, making it difficult for some to gather

robredpath commented 6 years ago

I'm not sure if anyone currently does this, but the file name at the end of the URL isn't necessarily the name of the file that gets downloaded to your computer. For example - GrantNav's http://grantnav.threesixtygiving.org/api/grants.json will download a file named with the format grants-TIMESTAMP.json

What would be the use case for knowing the filename, @stevieflow ?

stevieflow commented 6 years ago

I think the use case would be something like:

As a data user I want to check that the Titles and Filenames of datasets, in case there is any extra information I can use.

It's a flimsy case, I agree - example could be:

@drkane might have ideas

robredpath commented 6 years ago

@stevieflow - I'm not sure that's something we want to encourage! Anything useful should be in the metadata - especially as the filename is lost if someone converts the data.

drkane commented 6 years ago

I'd agree - doesn't seem helpful to put the filename in the metadata when it could so easily change.

stevieflow commented 6 years ago

OK no problem - we can forget this!

@KDuerden and I spoke about how filename might be useful, in terms of any potential information that could be gleaned. As with https://github.com/ThreeSixtyGiving/registry/issues/27 I guess our first tactic could be to look at some guidance in terms of naming conventions, anyway.