IBM / datascienceontology

Data Science Ontology
https://www.datascienceontology.org
Creative Commons Attribution 4.0 International
36 stars 14 forks source link

High-level, informal concepts of data science #12

Open epatters opened 5 years ago

epatters commented 5 years ago

The ontology should, perhaps, include high-level concepts of data science, such as "data cleaning/preprocessing", "inference", and "evaluation". The usefulness of such concepts is obvious, but there are several difficulties. Unlike the concepts currently in the ontology, these high-level concepts are

  1. informal and imprecise, i.e., do not admit a clean mathematical description
  2. usually present only implicitly in code or natural text, i.e., must be either inferred using NLP methods or manually annotated by the data analysis author

How to proceed is an open question.