sdruskat / cff-in-the-wild

Analysis of Citation File Format files on GitHub
Apache License 2.0
2 stars 0 forks source link

Write a parser for CFF files #5

Open sdruskat opened 2 years ago

sdruskat commented 2 years ago

CFF is YAML, and can be validated against a JSON schema.

This could result in a function taking a file path and return a Python dict or similar to use in further analysis?

samharrison7 commented 2 years ago

Started with a function to read all CFF files in a directory in PR https://github.com/sdruskat/cff-in-the-wild/pull/20

sdruskat commented 2 years ago

Tentatively, fixing this would include a refactoring of https://github.com/sdruskat/cff-in-the-wild/blob/develop/code/analysis/python/python/read_cff_files.py#L17-L52 to extract the logic for single files into a function, and the main function calling a loop function that in turn calls this new function, right?