issues
search
awslabs
/
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Apache License 2.0
3.32k
stars
539
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[Min/Max] Apply filtered row behavior at the row level evaluation
#543
rdsharma26
closed
8 months ago
0
Java null pointer issue , while creating sparksession , with deequ jar
#542
koustreak
opened
8 months ago
0
How to use Deequ to implement a custom return result set and return the correct and incorrect number of each check result
#541
yeyeywye123
opened
8 months ago
0
[FEATURE] Cross-building via Mill
#540
nightscape
opened
9 months ago
5
Is AggregateMatch type check supported in the library?
#539
chaurasiya
opened
9 months ago
1
Fix bug in MinLength and MaxLength when NullBehavior.EmptyString
#538
eycho-am
closed
9 months ago
0
Add analyzerOption to add filteredRowOutcome for isPrimaryKey Check
#537
eycho-am
closed
9 months ago
0
Skip SparkTableMetricsRepositoryTest test when SupportsRowLevelOperations is not available
#536
eycho-am
closed
9 months ago
0
Feature: Add Row Level Result Treatment Options for Miminum and Maximum
#535
eycho-am
closed
9 months ago
0
Performance impact when trying to generate profiling report for more than 200 columns
#534
eapframework
opened
9 months ago
2
containsCreditCardNumber analyser constraint doesnt support for JCB credit card
#533
kakampassi
opened
9 months ago
0
Feature: Add Row Level Result Treatment Options for Uniqueness and Completeness
#532
eycho-am
closed
9 months ago
0
Anomaly checks when fails
#531
dinjazelena
opened
10 months ago
0
[FEATURE] Filter condition is ignored when filtering records based on row-level checks
#530
eapframework
opened
10 months ago
6
support col match and change to DatasetMatch
#529
VenkataKarthikP
closed
9 months ago
2
[FEATURE] Supporing Aggregation metrics for a group
#528
theajay87
opened
10 months ago
0
numerical statistical indicators have lost precision
#527
huwujingling
opened
10 months ago
0
add data synchronization test to verification Suite.
#526
VenkataKarthikP
closed
10 months ago
1
Exposing Helpful Anomaly Detection Metadata from Anomaly Strategies (ie Anomaly Thresholds)
#525
arsenalgunnershubert777
opened
11 months ago
8
fix ratio in constraint_message
#524
Aigul9
opened
11 months ago
0
Compliance calculation result
#523
vaishnavibv13
opened
11 months ago
1
Is Redshift supported as a data source?
#522
jbleduigou
opened
12 months ago
0
[FEATURE] Exposing Anomaly Strategy Calculation Thresholds for Users
#521
arsenalgunnershubert777
opened
1 year ago
0
Updating release version in pom.xml
#520
rdsharma26
closed
1 year ago
0
[BUG] Row based output incorrect when using satisfies check and assertion with upper bound < 1
#519
arsenalgunnershubert777
closed
7 months ago
3
MetricsRepository using Spark tables as the data source
#518
VenkataKarthikP
closed
12 months ago
4
Verify that non key columns exist in each dataset
#517
rdsharma26
closed
1 year ago
0
Test that exceptions within a check's constraints do not affect other…
#516
tylermcdaniel0
closed
1 year ago
0
[Data Synchronization/Matching] Delegate to Spark for checking existence of columns in the given dataframes
#515
rdsharma26
closed
1 year ago
0
Add Spark 3.5 support
#514
jhchee
closed
9 months ago
1
Update minor version for Spark 3.4 maven release
#513
eycho-am
closed
1 year ago
0
Creation of Exact Quantile Check
#512
jmilis2000
closed
1 year ago
2
Fix CustomSQL test syntax
#511
eycho-am
closed
1 year ago
0
Fail when CustomSql has syntax errors
#510
mentekid
closed
1 year ago
1
Custom SQL Analyzer
#509
mentekid
closed
1 year ago
0
Allow all DQ constraints to be generated from an Analyzer
#508
mentekid
closed
1 year ago
0
[FEATURE] Add support for Spark 3.5
#507
jhchee
closed
9 months ago
1
checks that 95% of entire table satisfy multiple conditions over different columns
#506
baljijit
closed
1 year ago
1
Add Spark 3.4 support
#505
jhchee
closed
1 year ago
2
Getting Error name 'isComplete' is not defined while running deequ code in Azure Databricks
#504
dilkushpatel
closed
1 year ago
4
Issue: Anomaly Detection - HoltWinters fail with SBT
#503
pawelpinkos
closed
2 months ago
0
[FEATURE] Add spark table metric repository
#502
charlieyou
opened
1 year ago
4
Incorporate referential integrity and data synchronization checks into Deequ's VerificationSuite
#501
rdsharma26
opened
1 year ago
6
[BUG] Unable to serialize Histogram with binningUdf when using them with useRepository
#500
psyking841
opened
1 year ago
0
Convert codebase to scala 2.13
#499
samidalouche
opened
1 year ago
1
Update release version to 2.0.4-spark-3.3
#498
eycho-am
closed
1 year ago
1
Is this library can be used with other Technolgy rather than Spark, such as Flink for example?
#497
abeermohamed1
closed
1 year ago
2
Support for Snowflake Connector's query pushdown
#496
Ioankall
opened
1 year ago
1
java.lang.NoSuchMethodError: org.apache.spark.sql.catalyst.expressions.aggregate.AggregateFunction.toAggregateExpression(Z)Lorg/apache/spark/sql/catalyst/expressions/aggregate/AggregateExpression;
#495
DivyangPatelIITD
opened
1 year ago
0
[FEATURE] Extract failing reason when filtering records based on row-level checks
#494
eapframework
closed
1 year ago
0
Previous
Next