CodeforNepal / nepalmap_app

An application that maps census and other official data for Nepal to make data more easily accessible and understandable to the public. Want to help us? Check out the Wiki.
https://nepalmap.org
MIT License
64 stars 42 forks source link

Add agriculture production data #214

Open ravinepal opened 6 years ago

ravinepal commented 6 years ago

Here's the data (https://github.com/Code4Nepal/data/tree/master/datasets/agriculture) available

Source: Ministry of Agriculture Development, Nepal (PDF)

Here's a guide on how to visualize data on NepalMap.

cliftonmcintosh commented 6 years ago

Please note that the data sets sometimes include things other than districts, like "C.REGION". Non-district data should be removed when importing.

Please also note that districts that have no values may legitimately be given a value of zero for a data point depending on how confident we are that there really is nothing of that thing in the missing districts. For some data we may not be confident that the missing districts have none of the thing being counted. In those cases, using zero for the missing districts may NOT be appropriate. For example, the tea data has an other category. Obviously, it is incorrect that all the districts that are not listed have zero tea production because the statistics tell us there is tea produced in "other" districts. I am not sure how we should handle this case.

wizofe commented 6 years ago

heya, i am interested in contributing to the project. could you guide me, in what exactly help you need? do you need to create a database, or stream the data through the main program?

I would love to be active in your project : )

bbulpett commented 6 years ago

I am attempting solution for #214 Add agriculture production data. Will submit pull request for approval when successful. Thank you.

cliftonmcintosh commented 6 years ago

There is currently not an agriculture section in NepalMap. With the first agriculture data integration, the agriculture section will need to be created, much like we have sections on Demographics, Forest and Land Use, Disasters, etc.

cliftonmcintosh commented 6 years ago

@wizofe and @bbulpett

There are several data sets for agriculture. Please submit one pull request per data set. Please also consider "claiming" a specific data set so that other people will know what is already being worked on.

cliftonmcintosh commented 6 years ago

As I mentioned earlier, some data sets may not be complete because we lack data for some districts. They should not be included without further analysis. Here is a list of data sets that appear to be complete enough to work on:

Please note these should be considered valid data sets only if they have data for all 75 districts.

Here is a list of data sets that appear to be incomplete and, in my opinion, should not be worked on without further evaluation.

Some of these may be valid but we should verify that the lack of data really means that the missing districts have zero of those things.

bbulpett commented 6 years ago

Thank you @cliftonmcintosh for the explanation. Setting up my dev environment now. Will begin by adding Agriculture section. I will then start work on the Egg production data set.

wizofe commented 6 years ago

@cliftonmcintosh @bbulpett I am going to do the Milk Animals and Milk Production.

cliftonmcintosh commented 6 years ago

Thanks, @wizofe and @bbulpett

cliftonmcintosh commented 6 years ago

@bbulpett and @wizofe

It is perfectly fine to submit the work in steps. For example, you could submit a PR with just the SQL files for the statistics in your data sets. Like this one for forests.

Also please note that your data set may contain more than one data point, and each one would require its own integration into NepalMap. For example, the egg production data set is probably two separate data sets, one for number of laying animals by type (chicken versus duck) and another for eggs laid by type (chicken eggs versus duck eggs). So there would be an egg-laying animal table and an eggs table. The data on milk animals looks similar. It's likely there will be two tables, one for the type of animals, another for the amount of milk.

If you choose one to start with, please choose the actual eggs and the actual milk.

cliftonmcintosh commented 6 years ago

@nikeshbalami

What is the unit for milk in the milk data? Litres?

nikeshbalami commented 6 years ago

Hi @cliftonmcintosh its Unit: Mt.

cliftonmcintosh commented 6 years ago

@nikeshbalami

What is an "Mt."?

nikeshbalami commented 6 years ago

It's a "Metric Ton (Mt.)" @cliftonmcintosh

cliftonmcintosh commented 6 years ago

Thanks

Bezzy1999 commented 6 years ago

Hi @cliftonmcintosh I took the liberty of adding the meat production data in https://github.com/Code4Nepal/nepalmap_app/pull/217 since it wasn't claimed by anyone else.

cliftonmcintosh commented 6 years ago

@nikeshbalami and @ravinepal

I'm working on the egg data, and it seems like it must be incorrect. The number of hens and ducks is much, much higher than the number of eggs laid. For example, there are over 12 million laying hens but only about 1.3 million hen eggs laid. That means there is only one egg for every ten hens. That seems crazy. There's no way anyone would have ten hens and only expect one egg a year out of those ten hens. I grew up with chickens, and if we were in that situation, we would just kill them all and eat them. Can you help me understand the data? Is it just messed up?

cliftonmcintosh commented 6 years ago

@nikeshbalami and @ravinepal Here is the problem: The egg numbers are for thousands, so "25" means "25000".

See page 48 of the PDF report

eggs-by-thousand

nikeshbalami commented 6 years ago

Thanks @cliftonmcintosh and so sorry, I forget to add "Unit" in all datasets which had created a problem. Will be taking care of it from now-onwards while scrapping data.

cliftonmcintosh commented 6 years ago

@nikeshbalami

No worries. Thanks for the response.