Open cmsdroff opened 9 years ago
data package at https://github.com/warrantgroup/IMO-Vessel-Codes.git
Great. Could you do the quality assurance step detailed at:
http://data.okfn.org/doc/core-data-curators#3-quality-assurance
Basically post a validation and view link for the data package.
@rgrp valid and displays correctly
Only question i had was regards the license, data is scraped from mentioned sources using a script, last one we published was from the UN CEFACT for Package Code published under the PDL license, would this be the same?
Looks great and ready to go.
Re license I suggest following approach in http://data.okfn.org/doc/publish-faq#-strong-license-strong-
Specifically i would apply PDDL license but note point about scraping - this goes in a section called "License" (see the faq for details).
Thanks @rgrp i've applied the PDDL license and reviewed the README.md file to reflect suggestions on formatting.
We will scrape the full data this week, then let you know as guess we are ready to merge?
@cmsdroff that sounds right. My one other suggestion is that if you have scraping scripts you may want to add them to the repo in the scripts/ directory and add a README.md in the scripts directory detailing how to use them.
@cmsdroff how are you folks doing here?
We have (most) of the data, site we scraped from implemented some restrictions to try and prevent scraping, we will reattempt on Friday with some workaround and if not will grab data from one of the other sites.
Keep you posted Friday.
On 14 Sep 2015, at 10:02, Rufus Pollock notifications@github.com wrote:
@cmsdroff https://github.com/cmsdroff how are you folks doing here?
— Reply to this email directly or view it on GitHub https://github.com/datasets/registry/issues/109#issuecomment-140007638.
All business is conducted subject to Warrant Group Limited Standard Terms and Conditions 2012, a copy of which is available on request and at www.warrant-group.com/terms.
Registered Office: Warrant House, 157 Regent Road, Liverpool, L5 9TF. Registered in England No. 1941659 VAT Reg. No. GB 100116496.
@cmsdroff This seemed almost done, what is the current status?
will have this in January, working on a different data source as we need it for our internal software. Will keep you posted when ready to transfer.
@cmsdroff any update here? The dataset here https://github.com/warrantgroup/IMO-Vessel-Codes looks good. Can we migrate it across.
We should just copy this over to core datasets for now and publish on datahub.io
@rgrp as discussed on issue #108 moved to separate issue for packaging.
Data set is the IMO Vessel Codes for all vessel types that have an IMO code.
We have scraped just under 10,000 data items, and will do a full scrape just before publishing. The data provided is to ensure we meet the packaging requirements, as a full scrape will take a couple of hours due to amount of data and API limits in place.
Can you confirm all ok and we will upload the final data before you merge in.