rugk / crops-parser

🌱🍎🍆 A shell script to parse the data by the Food and Agriculture Organization of the United Nations on crops/fruits.
Other
15 stars 4 forks source link

Duplicate entries & lines for South Sudan #54

Closed sashazykov closed 6 years ago

sashazykov commented 6 years ago

https://github.com/rugk/crops-parser/blob/891c01cb05cdf85d005a2e77dca78f2b5dc7c088/result/mostTonnesHarvest_2014_OSMonly.yml#L180

https://github.com/rugk/crops-parser/blob/891c01cb05cdf85d005a2e77dca78f2b5dc7c088/result/mostTonnesHarvest_2014_OSMonly.yml#L181

There are two countries with SS code in the list and one of them has many duplicates (lemon, lime, etc)

rugk commented 6 years ago

Oh, you are right. That is interesting, specially as it is not even only in the combined "2013+2014" data set…

Let's tackle the duplicate lines problem, first. Maybe the other problem is also solved by it.

rugk commented 6 years ago

So in the ISO South Sudan,SS,728 is only listed once…

And I also noticed my filter for "0 data" was not enough, because in South Sudan you actually cannot find lemons or limes. At least, there is no data…

rugk commented 6 years ago

Found the bug. It confused "Sudan" with "South Sudan". Will be fixed with next commit, just regenerating data…