-
# Problem
According to the spec definition of DataQualityMetricsInputDatasetFacet : https://github.com/OpenLineage/OpenLineage/blob/main/spec/facets/DataQualityMetricsInputDatasetFacet.json, the name…
-
### What
- Create quality facet for products that should not be in French, based on their brand
- EG: Conad is an Italian brand, and products are never in French.
- We see however that products a…
-
- We need quality facets for the new packagings data structures introduced in #219
- In particular, we need to know which products do not have complete packaging data.
- In order to determine if th…
-
### ... so that the data set that is trusted
### Identify and filter incomplete records:
- no value supplied in core elements - eventDate, scientificName, decimalLatitude, decimaLongitude
htt…
-
Verify, and if necessary, add translation descriptions for dataQuality (`metadata > dataQuality > report`) report elements:
- [x] type
- [ ] standaloneQualityReportDetails
- [ ] qualityMeas…
-
DataQuality is a very great project in the field of data quality, and I think a good way to enhance the influence of our two projects is to integrate DataQuality with DataSphere Studio.
**What is Dat…
-
There is a big overlap. DMP is covering everything what is then discussed later in the course.
Idea:
- make it shorter (what is dmp, who, when, why, budget planning, 1 slide on content, dissemin…
-
# Description
Values of Dataquality sections (lineage, specificationtitle..) of metadata are not saved into database. For repository is used PostgrelSQL. We use it in production enviroment https://r…
-
I think it will be interesting to add optionnal parameter ["PathRejects"], to write deduplicated rows, if we need to do some analyse of DataQuality when we have DuplicatedRow from source.
And also…
-
The SMAP tutoral 2.0 [read_and_plot_smap_data](notebooks/SMAP/02_read_and_plot_smap_data_rendered.ipynb) uses `h5py` and `numpy`. The whole notebook could be simplified and streamlined by using `xarr…