issues
search
awslabs
/
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Apache License 2.0
3.18k
stars
517
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Optional specification of instance name in CustomSQL analyzer metric.
#569
tylermcdaniel0
closed
1 month ago
0
Add DateTimeMetric, Analyzer and Example
#568
zeotuan
opened
1 month ago
0
Adding Wilson Score Confidence Interval Strategy
#567
zeotuan
closed
1 month ago
5
Why is `Distance` not an analyzer?
#566
CarterFendley
opened
2 months ago
0
[BUG] Row-level filtering marking the records as pass when null values are present in the column
#565
eapframework
opened
2 months ago
0
Configurable RetainCompletenessRule
#564
zeotuan
closed
2 months ago
1
[FEATURE] Support Wilson Score Interval for RetainCompletenessRule
#563
zeotuan
closed
1 month ago
0
Update Breeze version for spark 3.3
#562
zeotuan
opened
2 months ago
0
Updated version in pom.xml to 2.0.7-spark-3.5
#561
rdsharma26
closed
2 months ago
0
Updated version in pom.xml to 2.0.7-spark-3.4
#560
rdsharma26
closed
2 months ago
0
Updated version in pom.xml to 2.0.7-spark-3.3
#559
rdsharma26
closed
2 months ago
0
Updated version in pom.xml to 2.0.7-spark-3.2
#558
rdsharma26
closed
2 months ago
0
Updated version in pom.xml to 2.0.7-spark-3.1
#557
rdsharma26
closed
2 months ago
0
[FEATURE] Extend RatioOfSums to support other aggregations
#556
mentekid
opened
2 months ago
0
Column Count Analyzer and Check
#555
mentekid
closed
2 months ago
0
Question: DQ over time
#554
jonathanapp
opened
2 months ago
0
Fix for satisfies row level results bug
#553
rdsharma26
closed
3 months ago
0
New analyzer, RatioOfSums
#552
scott-gunn
closed
2 months ago
5
Support for Custom SQL Execution in Deequ Library
#551
skarthikbigdata
opened
3 months ago
0
Added RatioOfSums analyzer and tests
#550
scott-gunn
closed
3 months ago
0
Custom user analyzers
#549
sonofagunn
opened
3 months ago
0
[FEATURE] Can we enhance `VerificationSuite` to supports more than one Dataframe?
#548
Sat30
opened
3 months ago
0
[MinLength/MaxLength] Apply filtered row behavior at the row level evaluation
#547
rdsharma26
closed
3 months ago
1
Anomaly Detection: Add Daily Season with Hourly Interval to HoltWinter
#546
zeotuan
closed
3 months ago
5
Fix Breeze dependency conflict in Anomaly Detection Spark 3.4+
#545
zeotuan
closed
2 months ago
6
[BUG] Spark 3.4 and Deequ breeze version conflict
#544
zeotuan
closed
4 months ago
1
[Min/Max] Apply filtered row behavior at the row level evaluation
#543
rdsharma26
closed
3 months ago
0
Java null pointer issue , while creating sparksession , with deequ jar
#542
koustreak
opened
4 months ago
0
How to use Deequ to implement a custom return result set and return the correct and incorrect number of each check result
#541
yeyeywye123
opened
4 months ago
0
[FEATURE] Cross-building via Mill
#540
nightscape
opened
4 months ago
5
Is AggregateMatch type check supported in the library?
#539
chaurasiya
opened
4 months ago
1
Fix bug in MinLength and MaxLength when NullBehavior.EmptyString
#538
eycho-am
closed
4 months ago
0
Add analyzerOption to add filteredRowOutcome for isPrimaryKey Check
#537
eycho-am
closed
4 months ago
0
Skip SparkTableMetricsRepositoryTest test when SupportsRowLevelOperations is not available
#536
eycho-am
closed
4 months ago
0
Feature: Add Row Level Result Treatment Options for Miminum and Maximum
#535
eycho-am
closed
4 months ago
0
Performance impact when trying to generate profiling report for more than 200 columns
#534
eapframework
opened
4 months ago
2
containsCreditCardNumber analyser constraint doesnt support for JCB credit card
#533
kakampassi
opened
4 months ago
0
Feature: Add Row Level Result Treatment Options for Uniqueness and Completeness
#532
eycho-am
closed
4 months ago
0
Anomaly checks when fails
#531
dinjazelena
opened
5 months ago
0
[FEATURE] Filter condition is ignored when filtering records based on row-level checks
#530
eapframework
opened
5 months ago
5
support col match and change to DatasetMatch
#529
VenkataKarthikP
closed
4 months ago
2
[FEATURE] Supporing Aggregation metrics for a group
#528
theajay87
opened
6 months ago
0
numerical statistical indicators have lost precision
#527
huwujingling
opened
6 months ago
0
add data synchronization test to verification Suite.
#526
VenkataKarthikP
closed
5 months ago
1
Exposing Helpful Anomaly Detection Metadata from Anomaly Strategies (ie Anomaly Thresholds)
#525
arsenalgunnershubert777
opened
6 months ago
7
fix ratio in constraint_message
#524
Aigul9
opened
7 months ago
0
Compliance calculation result
#523
vaishnavibv13
opened
7 months ago
1
Is Redshift supported as a data source?
#522
jbleduigou
opened
7 months ago
0
[FEATURE] Exposing Anomaly Strategy Calculation Thresholds for Users
#521
arsenalgunnershubert777
opened
7 months ago
0
Updating release version in pom.xml
#520
rdsharma26
closed
7 months ago
0
Next