Closed rbroth closed 2 years ago
Does anyone know if these would count as duplicates?
Cereals Cereals and their products Cereals, other and products
@bgsandan @rbroth @LuciaSegovia
Also:
Eggs and products Eggs and their products
Fruits and their products Fruits and vegetables Fruits, other and products
Vegetables and their products Vegetables, other and products
@andy-bevan, are those food_groups? or food_names?
These are food groups.
These are food groups.
I think I know where the duplication is coming from. It is from an old version of a FCT that we generated to match the Household Survey data in Malawi (IHS4). I was used before we agreed on the composition matching hierarchy. It's this one in here I hope it helps :)
FCT that we generated to match the Household Survey data in Malawi (IHS4)
Can we get rid of that one?
@rbroth I think we should, it may cause some missing values, if the FCT cascade is not fully implemented. But I think it would be good to remove it :)
The file mentioned above does not seem to be used directly by any of the loading processes. The food groups are loaded like:
So if we update the ids in the CSVs we would remove the duplicates...?
@LuciaSegovia does this make sense?
Example - Initial hardcoded values:
(0,0,'other') , (1,1, 'Cereals and their products') , (101,1,'Rice and rice-based products') , (102,1,'Maize and maize-based products') , (103,1,'Wheat and wheat-based products')
Example values from CSV files:
2006 Cereals 2019 Cereals, other and products 2006 2093 Wheat and products 2006 2043 Maize and products (including white maize) 2006 2044 Maize germ oil and products 2006
@bgsandan - is this still even an issue now that the initial duplicates have been removed?
I am sorry @andy-bevan, but I am not sure if I am following what is going on. Is this an issue of misalignment between IMPACT model and food composition data food groups names?
No worries - lets see if Andy can shed any light...?
I am sorry @andy-bevan, but I am not sure if I am following what is going on. Is this an issue of misalignment between IMPACT model and food composition data food groups names?
@andy-bevan - Sorry just catching up on this. So when you say "is this still even an issue now that the initial duplicates have been removed?" does that mean that the initial issue with multiple versions of essentially the same string with differences in capitalisation in the treemap (as above screenshot) no longer occurs?
Re: IMPACT and FCT food groups being different. Not a deal breaker for closing this issue but we should look to consolidate these (i.e. get the impact groups to match to existing FCT groups). That can be a separate/new issue though
Yes to the first question - those obvious duplicate/typos are resolved. I'll close the issue.
In this instance, 'Vegetables, other...' versus 'vegetables, other....'. Porbably to do with capitalisation
https://preview.micronutrient.support/quick-maps/diet/baseline?country-id=MWI&mnd-id=Ca&measure=diet&age-gender-group-id=WR
A