issues
search
awslabs
/
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Apache License 2.0
3.27k
stars
536
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Skip SparkTableMetricsRepositoryTest test when SupportsRowLevelOperations is not available
#536
eycho-am
closed
7 months ago
0
Feature: Add Row Level Result Treatment Options for Miminum and Maximum
#535
eycho-am
closed
7 months ago
0
Performance impact when trying to generate profiling report for more than 200 columns
#534
eapframework
opened
7 months ago
2
containsCreditCardNumber analyser constraint doesnt support for JCB credit card
#533
kakampassi
opened
7 months ago
0
Feature: Add Row Level Result Treatment Options for Uniqueness and Completeness
#532
eycho-am
closed
7 months ago
0
Anomaly checks when fails
#531
dinjazelena
opened
8 months ago
0
[FEATURE] Filter condition is ignored when filtering records based on row-level checks
#530
eapframework
opened
8 months ago
6
support col match and change to DatasetMatch
#529
VenkataKarthikP
closed
7 months ago
2
[FEATURE] Supporing Aggregation metrics for a group
#528
theajay87
opened
9 months ago
0
numerical statistical indicators have lost precision
#527
huwujingling
opened
9 months ago
0
add data synchronization test to verification Suite.
#526
VenkataKarthikP
closed
9 months ago
1
Exposing Helpful Anomaly Detection Metadata from Anomaly Strategies (ie Anomaly Thresholds)
#525
arsenalgunnershubert777
opened
9 months ago
7
fix ratio in constraint_message
#524
Aigul9
opened
10 months ago
0
Compliance calculation result
#523
vaishnavibv13
opened
10 months ago
1
Is Redshift supported as a data source?
#522
jbleduigou
opened
10 months ago
0
[FEATURE] Exposing Anomaly Strategy Calculation Thresholds for Users
#521
arsenalgunnershubert777
opened
10 months ago
0
Updating release version in pom.xml
#520
rdsharma26
closed
11 months ago
0
[BUG] Row based output incorrect when using satisfies check and assertion with upper bound < 1
#519
arsenalgunnershubert777
closed
6 months ago
3
MetricsRepository using Spark tables as the data source
#518
VenkataKarthikP
closed
10 months ago
4
Verify that non key columns exist in each dataset
#517
rdsharma26
closed
11 months ago
0
Test that exceptions within a check's constraints do not affect other…
#516
tylermcdaniel0
closed
11 months ago
0
[Data Synchronization/Matching] Delegate to Spark for checking existence of columns in the given dataframes
#515
rdsharma26
closed
11 months ago
0
Add Spark 3.5 support
#514
jhchee
closed
7 months ago
1
Update minor version for Spark 3.4 maven release
#513
eycho-am
closed
11 months ago
0
Creation of Exact Quantile Check
#512
jmilis2000
closed
11 months ago
2
Fix CustomSQL test syntax
#511
eycho-am
closed
11 months ago
0
Fail when CustomSql has syntax errors
#510
mentekid
closed
11 months ago
1
Custom SQL Analyzer
#509
mentekid
closed
12 months ago
0
Allow all DQ constraints to be generated from an Analyzer
#508
mentekid
closed
12 months ago
0
[FEATURE] Add support for Spark 3.5
#507
jhchee
closed
7 months ago
1
checks that 95% of entire table satisfy multiple conditions over different columns
#506
baljijit
closed
1 year ago
1
Add Spark 3.4 support
#505
jhchee
closed
1 year ago
2
Getting Error name 'isComplete' is not defined while running deequ code in Azure Databricks
#504
dilkushpatel
closed
1 year ago
4
Issue: Anomaly Detection - HoltWinters fail with SBT
#503
pawelpinkos
closed
1 week ago
0
[FEATURE] Add spark table metric repository
#502
charlieyou
opened
1 year ago
4
Incorporate referential integrity and data synchronization checks into Deequ's VerificationSuite
#501
rdsharma26
opened
1 year ago
6
[BUG] Unable to serialize Histogram with binningUdf when using them with useRepository
#500
psyking841
opened
1 year ago
0
Convert codebase to scala 2.13
#499
samidalouche
opened
1 year ago
1
Update release version to 2.0.4-spark-3.3
#498
eycho-am
closed
1 year ago
1
Is this library can be used with other Technolgy rather than Spark, such as Flink for example?
#497
abeermohamed1
closed
1 year ago
2
Support for Snowflake Connector's query pushdown
#496
Ioankall
opened
1 year ago
1
java.lang.NoSuchMethodError: org.apache.spark.sql.catalyst.expressions.aggregate.AggregateFunction.toAggregateExpression(Z)Lorg/apache/spark/sql/catalyst/expressions/aggregate/AggregateExpression;
#495
DivyangPatelIITD
opened
1 year ago
0
[FEATURE] Extract failing reason when filtering records based on row-level checks
#494
eapframework
closed
1 year ago
0
Replace Spark SQL isNull check with Spark Scala based DSL
#493
rdsharma26
closed
1 year ago
0
Updated the Categorical range constraint suggestions to use a new class called ConstraintSuggestionWithValue
#492
rdsharma26
closed
1 year ago
0
Added Uniqueness constraint suggestion to the list of EXTENDED suggestions
#491
rdsharma26
closed
1 year ago
0
Enhanced constraint suggestions
#490
rdsharma26
closed
1 year ago
0
Addition of HasMax/HasMin/HasStandardDeviation/HasMean constraint suggestions
#489
rdsharma26
closed
1 year ago
0
Adding the custom constraints
#488
DivyangPatelIITD
opened
1 year ago
1
Incremental profiling to be merged with older result
#487
nihal-laliwala-a
opened
1 year ago
0
Previous
Next