HumanExposure / ChemicalExposure-SSC

2 stars 1 forks source link

Data Upload: CDR Functional Use #1205

Open Sakshi-Handa opened 5 days ago

Sakshi-Handa commented 5 days ago

This ticket is to upload CDR Functional Use data into Factotum. There is a CDR data Source in Factotum. https://ccte-factotum.epa.gov/datasource/52/ There will be 3 Functional Use type data groups added.

  1. CDR 2020 Industrial Uses
  2. CDR 2020 Consumer and Commercial Uses
  3. CDR 2020 Manufacturing-Import Information

The data document pages will each have one chemical card with reported functional uses. The associated data document file will be an excel file, rather than a PDF.

The CDR files and supplemental data are located in zip files on the L Drive: "L:\Lab\HEM\Factotum\CDR_2020"

See additional notes from Katherine: Each of these three data groups will have the original CDR file associated with the documents in the group as a supplemental file, and each document within the group will be an Excel file of the unique chemical identifier and reported used pairs. An extra sheet will be added to each Excel file to provide an explanation of how the file was created and the code used to create the file. A breakdown of the proposed data groups is in the table below.

<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:dt="uuid:C2F41010-65B3-11d1-A29F-00AA00C14882" xmlns="http://www.w3.org/TR/REC-html40">

CDR Sector | Supplemental File Name | Number of Documents | Size of Documents (as zip) -- | -- | -- | -- Industrial | 2020 CDR Industrial Processing and Use.xlsx | 7,969 | 59 MB Consumer/Commercial | 2020 CDR Consumer and Commercial Use.xlsx | 4,577 | 35 MB Manufacturing/Import | 2020 CDR Manufacturing-Import Information.xlsx | 444 | 3 MB

Sakshi-Handa commented 5 days ago

is there a script associated with the extraction on github? will need the link to register it.