I've been tasked with going through some of the public documentation from the perspective of a new DCC, and bolstering it for understandability where necessary. Specifically in the KF ETL documentation, I have a few lingering questions, and Owen identified you as potentially a good contact to answer some questions. Ultimately these answers will help to bolster some of the existing documentation.
I can't find the instructions for DCCs uploading C2M2-transformed metadata to the CFDE. - this seems vital to include in an ETL walk-through. Is there a good source of documentation I can summarize for use in this recipe?
I see a handful of formats discussed - (BDBags, Frictionless, RO-Crate) - does the CFDE have a singular format it desires for upload, or is it up to the DCCs on how to format it?
The term-scanner script and gold tables are on a private repo. Currently, copies are housed on a personal Github - along with the R script that is necessary for ETL - which is referenced in the current public documentation. Is there a more official (but public) repo we should be hosting these vital documents on so that they are more accessible?
Thank you for any help and please don't hesitate to reach out if I can clarify these questions.
Hi @abradyIGS,
I've been tasked with going through some of the public documentation from the perspective of a new DCC, and bolstering it for understandability where necessary. Specifically in the KF ETL documentation, I have a few lingering questions, and Owen identified you as potentially a good contact to answer some questions. Ultimately these answers will help to bolster some of the existing documentation.
I can't find the instructions for DCCs uploading C2M2-transformed metadata to the CFDE. - this seems vital to include in an ETL walk-through. Is there a good source of documentation I can summarize for use in this recipe?
I see a handful of formats discussed - (BDBags, Frictionless, RO-Crate) - does the CFDE have a singular format it desires for upload, or is it up to the DCCs on how to format it?
The term-scanner script and gold tables are on a private repo. Currently, copies are housed on a personal Github - along with the R script that is necessary for ETL - which is referenced in the current public documentation. Is there a more official (but public) repo we should be hosting these vital documents on so that they are more accessible?
Thank you for any help and please don't hesitate to reach out if I can clarify these questions.