Closed BigDatalex closed 2 years ago
Just want to mention that we don't miss it: If #80 is merged, we should update this one accordingly. 👍🏼
Looking forward, pretty nice improvements!
Just want to mention that we don't miss it: If #80 is merged, we should update this one accordingly. 👍🏼
Looking forward, pretty nice improvements!
Thanks a lot for your commits too! I will update this according to #80 and test the spiders. I ping you if we are ready to try the notebook 👍
Maybe check also the documentation and add the naming schema of the files and names of the spiders.
From my POV we are now ready to test the notebook - the spiders were running fine with the latest changes! 👍
LGTM, let's merge when lining is fixed. 👍🏼
Awesome! @se-jaeger the mypy linting is on yours, right?
i did say i was looking into it. unfortunately github doesnt update the conversation unless you reload the page so i was replying to a 2hr old comment..
I documented the changes from @en-GB here: https://github.com/calgo-lab/green-db/issues/74#issuecomment-1177592419 and reverted the commits in this PR. @se-jaeger can you please approve the changes once again, so that we can finally merge this one :sweat_smile:
This PR adds the following information to the
green-db
table andscraping
table:source
: string (so far same as merchant, but will be needed when including e.g. project-cece)country
: string (ISO 3166-1 alpha-2 codes)gender
: According to GS1 GPC Attribute 20000366 one of: FEMALE, MALE, UNISEX, UNCLASSIFIED, UNIDENTIFIEDconsumer_lifestage
(age): According to GS1 GPC Attribut 20000045 one of : ADULT, ALL AGES, BABY/INFANT, CHILD 1-2 YEARS, CHILD 2 YEARS ONWARDS, UNCLASSIFIED, UNIDENTIFIEDIn addition the following fields of the green-db table are changed:
color
: changed tocolors
and stores values in an arraysize
: changed tosizes
and stores values in an arrayApart from this I did some filename end method refactorings to include the country information in its name.