medea-project / ipcc-fact-checking

GOAL: to establish a line of sight between data and its source
Creative Commons Attribution 4.0 International
1 stars 2 forks source link

Do not separate fields unless requested #4

Closed eric-brechemier closed 10 years ago

eric-brechemier commented 10 years ago

In Appendix 3 of 1994 Special Report, the institution and country should be collected as a single field "Institution, Country", as specified in the initial template, not two separate fields "Institution" and "Country", as modified in your submission.

The separation will be done automatically as a post-processing step in a program.

eric-brechemier commented 10 years ago

Same issue in Annex II of 2000 Special Report "Emissions": please keep Institution, City, Country as a single field, entitled "City, Country" in the initial template.

eric-brechemier commented 10 years ago

@monlafon You can take as reference the work you have done on page 5 of 2000 Special Report "Emissions", where you kept "Name (Country)" in a single column, as expected.

eric-brechemier commented 10 years ago

@apoloniarakow Same issue in Annex A of 1999 Special Report "Aviation":

https://github.com/medea-project/ipcc-fact-checking/blob/master/ipcc.ch/1999-ipcc-sr-aviation/annexA/data.csv

Institution and country should be left as a single column, like in the initial template: "Institution, Country".

monlafon commented 10 years ago

Ok I am trying to merge the cells together to put the categories back again but it doesn't seem to work and here is the picture. Can you tell me how to fix it back to the original? screen shot 2014-09-24 at 00 46 14

eric-brechemier commented 10 years ago

Here is how I would do it:

  1. in an empty column (e.g. F in Appendix 3 of 1994 SR), enter the following formula in the first cell (F1):

    =CONCATENATE(D1,", ",E1)

    where D1 and E1 are the first cells of the previous columns (HINT: you can select the cells with a click instead of typing their name).

  2. copy the formula, select all cells of the column up to the last row of data, and paste the formula
  3. fix issues that you notice at this point in the concatenated text, by editing the source columns (the concatenated text cannot be edited at this point):
    • remove extra space in " Country"
    • remove duplicate country at end of institutions in last rows
    • ...
    • remove the formula from empty lines, which otherwise end up as ", "
  4. you can now copy the concatenated text to a new column. You need to use the "Paste Special..." command to copy the text instead of the formula:
    • select the column F
    • copy
    • select column G
    • in the menu, select Edit > "Paste Special..."
    • Text must be checked, Formulas must be unckecked
  5. at this point, you can keep the new column G, and delete the source columns D, E and F.
monlafon commented 10 years ago

screen shot 2014-09-24 at 13 14 17 SORRY I DO THIS BUT IT DOESN'T WORK, MAYBE I CAN CALL YOU ABOUT THIS LATER THIS AFTERNOON, I WILL PROCEED TO OTHER ISSUES

eric-brechemier commented 10 years ago

Sorry, I tried the formula in LibreOffice, which accepts commas where semicolons are expected in OpenOffice:

=CONCATENATE(D1;", ";E1)
monlafon commented 10 years ago

@eric-brechemier let me know if In Appendix 3 of 1994 Special Report and in Annex II of 2000 Special Report "Emissions" the issues are fixed and I will move on

eric-brechemier commented 10 years ago

@monlafon No, sadly: the text should contain commas , not semicolons ; like currently seen here: "National Center for Atmospheric Research; USA"

https://github.com/medea-project/ipcc-fact-checking/blob/master/ipcc.ch/1994-ipcc-sr-climate-change/appendix3-contributors/data.csv#L2

monlafon commented 10 years ago

Ok I think now they are fixed I hope!

eric-brechemier commented 10 years ago

@monlafon, yes, thanks.

monlafon commented 10 years ago

=)