frictionlessdata / datapackage-pipelines

Framework for processing data packages in pipelines of modular components.
https://frictionlessdata.io/
MIT License
119 stars 32 forks source link

Loading file with capital letters #197

Open gperonato opened 3 years ago

gperonato commented 3 years ago

When loading a file containing capital letters, an error is raised, as there is a validation problem. I have tried to correct this with update_packageand update_resource, but the same error occurs when the first processor is called. Is there a way to change this behavior? I cannot change the name of an online dataset. The same dataset does not cause any error in dataflows. Thanks, Giuseppe

Python 3.82 MacOs 10.15.7

akariv commented 3 years ago

Hey Giuseppe - do you mean that the file name contains uppercase letters, or that the uppercase letters are in a different location? If possible, could you share your pipeline-spec file and some sample of your source file?

On Wed, Nov 11, 2020 at 4:22 PM Giuseppe Peronato notifications@github.com wrote:

When loading a file containing capital letters, an error is raised, as there is a validation error. I have tried to correct this with update_package and update_resource, but the same error occurs when the first processor is called. Is there a way to change this behavior? I cannot change the name of an online dataset. The same dataset does not cause any error in dataflows. Thanks, Giuseppe

Python 3.82 MacOs 10.15.7

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/frictionlessdata/datapackage-pipelines/issues/197, or unsubscribe https://github.com/notifications/unsubscribe-auth/AACAY5NIH25GPXTDK35EXO3SPKMZZANCNFSM4TSBRSQQ .

gperonato commented 3 years ago

Hi Adam, yes sorry, I meant uppercase letters, which do not match the validation '^([-a-z0-9._/])+$'. Here is my pipeline-spec with the link to the source file:

JRC_GEOPP_DB_3:
  pipeline:
  - run: load
    parameters:
      from: http://cidportal.jrc.ec.europa.eu/ftp/jrc-opendata/JRCOD/RES-DATA/10001/LATEST/JRC-GEOPP-DB.csv
  - run: dump.to_path
    parameters:
      out-path: JRC_GEOPP_DB

Thank you, Giuseppe