issues
search
awslabs
/
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Apache License 2.0
3.32k
stars
539
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Exposing Helpful Anomaly Detection Metadata from Anomaly Strategies (ie Anomaly Check Range/Thresholds) through backwards compatible function
#593
arsenalgunnershubert777
opened
2 weeks ago
4
[BUG] Deequ 2.0.7 - Spark CodeGenerator ERROR - Expression is not an rvalue
#592
pawelpinkos
opened
1 month ago
2
Add commits from master branch to release/2.0.8-spark-3.1
#591
eycho-am
closed
1 month ago
0
Add commits from master branch to release/2.0.8-spark-3.2
#590
eycho-am
closed
1 month ago
0
Add commits from master branch to release/2.0.8-spark-3.3
#589
eycho-am
closed
1 month ago
0
Add commits from master branch to release/2.0.8-spark-3.4
#588
eycho-am
closed
1 month ago
0
Add commits from master branch to release/2.0.8-spark-3.5
#587
eycho-am
closed
1 month ago
0
feature/replace-rdd
#586
shriyavanvari
closed
1 month ago
0
Make addAnomalyCheck not requiring State type
#585
zeotuan
opened
2 months ago
0
Refactor add anomaly check
#584
zeotuan
closed
2 months ago
0
[FEATURE] Improve performance of KLLSketch and DataType Analyzer
#583
zeotuan
opened
2 months ago
0
Fix row-level results implementation for Spark versions <3.3
#582
marcantony
closed
2 months ago
1
Backport breeze version upgrade to spark 3.4
#581
zeotuan
closed
1 month ago
2
Implement Features Per Issue 579
#580
jasonhorner
opened
2 months ago
2
[FEATURE] Extend PatternMatch Class to Implement US Postal Code and Phone Number
#579
jasonhorner
opened
2 months ago
0
Updated version in pom.xml to 2.0.8-spark-3.5
#578
mentekid
closed
1 month ago
0
Fix performance of building row-level results
#577
marcantony
closed
2 months ago
6
[BUG] Performance for building row-level results scales poorly with number of checks
#576
marcantony
closed
2 months ago
1
Release cadence?
#575
marcantony
closed
1 month ago
2
fix typo
#574
bojackli
closed
2 months ago
0
Bugfix- isNonNegative and isPositive checks
#573
SagarMoghe
opened
3 months ago
1
CustomAggregator
#572
joshuazexter
closed
3 months ago
2
Add support for Conditional Aggregation Analyzer
#571
joshuazexter
closed
3 months ago
0
Add support for EntityTypes dqdl rule
#570
joshuazexter
closed
4 months ago
0
Optional specification of instance name in CustomSQL analyzer metric.
#569
tylermcdaniel0
closed
6 months ago
0
Add DateTimeMetric, Analyzer and Example
#568
zeotuan
opened
6 months ago
1
Adding Wilson Score Confidence Interval Strategy
#567
zeotuan
closed
6 months ago
5
Why is `Distance` not an analyzer?
#566
CarterFendley
opened
6 months ago
0
[BUG] Row-level filtering marking the records as pass when null values are present in the column
#565
eapframework
opened
7 months ago
0
Configurable RetainCompletenessRule
#564
zeotuan
closed
6 months ago
1
[FEATURE] Support Wilson Score Interval for RetainCompletenessRule
#563
zeotuan
closed
6 months ago
0
Update Breeze version for spark 3.3
#562
zeotuan
closed
1 month ago
0
Updated version in pom.xml to 2.0.7-spark-3.5
#561
rdsharma26
closed
7 months ago
0
Updated version in pom.xml to 2.0.7-spark-3.4
#560
rdsharma26
closed
7 months ago
0
Updated version in pom.xml to 2.0.7-spark-3.3
#559
rdsharma26
closed
7 months ago
0
Updated version in pom.xml to 2.0.7-spark-3.2
#558
rdsharma26
closed
7 months ago
0
Updated version in pom.xml to 2.0.7-spark-3.1
#557
rdsharma26
closed
7 months ago
0
[FEATURE] Extend RatioOfSums to support other aggregations
#556
mentekid
opened
7 months ago
0
Column Count Analyzer and Check
#555
mentekid
closed
7 months ago
0
Question: DQ over time
#554
jonathanapp
opened
7 months ago
0
Fix for satisfies row level results bug
#553
rdsharma26
closed
7 months ago
0
New analyzer, RatioOfSums
#552
scott-gunn
closed
7 months ago
5
Support for Custom SQL Execution in Deequ Library
#551
skarthikbigdata
opened
7 months ago
0
Added RatioOfSums analyzer and tests
#550
scott-gunn
closed
8 months ago
0
Custom user analyzers
#549
sonofagunn
opened
8 months ago
0
[FEATURE] Can we enhance `VerificationSuite` to supports more than one Dataframe?
#548
Sat30
opened
8 months ago
0
[MinLength/MaxLength] Apply filtered row behavior at the row level evaluation
#547
rdsharma26
closed
8 months ago
1
Anomaly Detection: Add Daily Season with Hourly Interval to HoltWinter
#546
zeotuan
closed
8 months ago
5
Fix Breeze dependency conflict in Anomaly Detection Spark 3.4+
#545
zeotuan
closed
7 months ago
8
[BUG] Spark 3.4 and Deequ breeze version conflict
#544
zeotuan
closed
8 months ago
1
Next