knowledgesystems / cmo-pipelines

GNU Affero General Public License v3.0
3 stars 10 forks source link

Add python3 validation utils #1185

Closed callachennault closed 2 months ago

callachennault commented 3 months ago

This PR adds a script validation_utils_py3.py to use pandas to validate arbitrary study/data source files. Using pandas and the defined class structure should increase the speed of file processing and I/O. As of this PR, only one file validator is implemented for the CDM clinical sample file. Additional file validators can be added to this structure.