MapofLife / MOL

Integrating information about species distributions in an effort to support global understanding of the world's biodiversity.
http://mol.org
BSD 3-Clause "New" or "Revised" License
26 stars 2 forks source link

English names in search #59

Closed eightysteele closed 12 years ago

eightysteele commented 12 years ago

Support English names in search box.

walterj commented 12 years ago

Should we put heads together and develop a global taxonomy list with misc. fields (Class, Order, Family, English, Scientific, ..., IUCN status) that for now (beta release) may be static but later dynamic? It would be used for search, metadata (i.e. info about a species when mapped), and species list result annotation.

eightysteele commented 12 years ago

Well, so I think that all taxonomic information needs to be part of our MOL schema (instead of as a separate table) so that it's encoded directly with every point and polygon record. It should include Class, Order, Family, English, Scientific, IUCN status, and any other columns needed. @gaurav, are most of these columns already part of the MOL schema?

walterj commented 12 years ago

" ... encoded directly with every point and polygon record. It should include Class, Order, Family, English, Scientific, IUCN status, and any other columns needed ".

The limitation here is that some fields change with time. E.g. we may swap in/out taxonomies, or, most pertinently, want to update the IUCN status as it changes every year. We may want to add a field (e.g. Spanish name) down the road. What is the best way forward here? Perhaps best to discuss over skype.

eightysteele commented 12 years ago

The limitation here is that some fields change with time.

Yeah, this is totally fine. I don't see this as a limitation since we can update taxonomy and IUCN columns using the CartoDB SQL API. So I think we're set here if we just add these columns to our schema. Good?

gaurav commented 12 years ago

I'll check this out tomorrow, but I don't think all those fields are in the schema yet, and putting them in after uploading the polygons might cause some headaches. I'll report back on this once I have a clearer idea on what's already uploaded, and how easy it might be to add in the data we need.

walterj commented 12 years ago

" uploading the polygons might cause some headaches."

If that is the case, we might require a different solution, as we will want to add and update select fields after upload.

-----Original Message----- From: Gaurav Vaidya [mailto:reply@reply.github.com] Sent: Wednesday, February 15, 2012 4:27 AM To: Jetz, Walter Subject: Re: [MOL] English names in search (#59)

I'll check this out tomorrow, but I don't think all those fields are in the schema yet, and putting them in after uploading the polygons might cause some headaches. I'll report back on this once I have a clearer idea on what's already uploaded, and how easy it might be to add in the data we need.


Reply to this email directly or view it on GitHub: https://github.com/MapofLife/MOL/issues/59#issuecomment-3977295

eightysteele commented 12 years ago

putting them in after uploading the polygons might cause some headaches.

It will only be a headache now because we haven't extended loader.py to handle table updates yet. @gaurav, the easiest solution in the short term might be updating the schema, dropping the polygons table, and re-uploading them.

gaurav commented 12 years ago

@eightysteele: Yup, agreed.

gaurav commented 12 years ago

So the latest here is that we're waiting for @jmalczyk to get the taxonomic information from @walterj so we can figure out how to incorporate that. Right?

walterj commented 12 years ago

Sent to to @jmalczyk on Thursday. I think he has been working on it.

eightysteele commented 12 years ago

@gaurav - Yep. Once we get the English names, we'll store them in the VernacularName column in our schema. Make sense?

gaurav commented 12 years ago

Okay, great! Yes, VernacularName is definitely the right place to store them (unless there's a case where we have more than one vernacular name for a species, in which case ... maybe we can store them, semicolon-delimited, into the VernacularName field?). In any case, it looks like the right way to proceed might be:

  1. Update vernacular names in mol_rangemaps and the new jetz/birds dataset.
  2. Fix config.yaml in mol_rangemaps so it uses either SEASONAL or OccCode in setting Seasonality.
  3. Upload the jetz/birds dataset from Yale (with any needed help from me).
  4. Delete the bird rangemaps from mol_rangemaps.
  5. Reupload mol_rangemaps (minus bird data, which is in another "collection" now, and with the fixed Seasonality)
  6. Once that's done, we can move on to the long-languishing ecoregion/assemblance checklist data.

Sound good?

eightysteele commented 12 years ago

See the email that just went out which includes my thoughts on the above. @walter and @jmalczyk, are you seeing multiple English names for a species or no?

walterj commented 12 years ago

@walter and @jmalczyk, are you seeing multiple English names for a species or no?

Do you mean whether we would like to upload and have in CartoDB multiple English names per species? Not at this point I would say. Later we may add such fields as 'Alternate name' or 'Spanish name' etc., but not for first release for sure! Does this answer the question?

eightysteele commented 12 years ago

@walterj - Yeah man, that answers it! Awesome.