issues
search
MrPowers
/
mack
Delta Lake helper methods in PySpark
https://mrpowers.github.io/mack/
MIT License
303
stars
44
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Remove unused code
#84
robertkossendey
closed
1 year ago
1
Fix append without duplicates
#83
robertkossendey
closed
1 year ago
1
Add LICENSE
#82
MrPowers
closed
1 year ago
2
Duplication allowed in append_without_duplicates when it comes in the input dataframe
#81
brayanjuls
closed
1 year ago
1
Add confessionsofadataguy blog post to README
#80
MrPowers
closed
1 year ago
1
Type 3 SCD Upsert Abstraction
#79
MrPowers
opened
1 year ago
3
Remove code not being used in drop_duplicates_pkey
#78
brayanjuls
closed
1 year ago
2
kB convert with 1024
#77
betizad
closed
1 year ago
8
Order columns for Data Skipping
#76
robertkossendey
opened
1 year ago
14
Ordering columns better for data skipping
#75
MrPowers
opened
1 year ago
2
Datavault Raw Vault loading
#74
MiguelElGallo
opened
1 year ago
1
Add maintainers to the README
#73
MrPowers
closed
1 year ago
0
Docker image for contributors
#72
Triamus
opened
1 year ago
11
Add better contributing instructions
#71
MrPowers
closed
1 year ago
3
Promote functionality of project on social media
#70
MrPowers
opened
1 year ago
15
correcting function name in readme docs
#69
Triamus
closed
1 year ago
2
Add a "constraint append" that appends the valid rows and puts the invalid rows in another table
#68
MrPowers
closed
1 year ago
6
Change SCD functions to take DeltaTable instead of Path
#67
PadenZach
closed
1 year ago
1
Small function name error in README docs
#66
Triamus
closed
1 year ago
4
Allow the use of Catalog Providers on sc2 upserts.
#65
PadenZach
closed
1 year ago
3
Feature Request: Flatten Nested Schema
#64
gardnmi
closed
1 year ago
1
Add show_delta_file_sizes
#63
robertkossendey
closed
1 year ago
5
Add a show_delta_file_sizes method
#62
MrPowers
closed
1 year ago
0
added type hints and docstrings
#61
squerez
closed
1 year ago
7
humanize delta_file_sizes
#60
christophergrant
closed
1 year ago
4
Feature/dlq
#59
souvik-databricks
opened
1 year ago
4
get all composite key candidates function added
#58
souvik-databricks
opened
1 year ago
5
Refactor find composite key
#57
MrPowers
closed
1 year ago
0
find_composite_key_candidates should allow users to return all possible candidate keys
#56
MrPowers
opened
1 year ago
1
Make with_md5_cols example more minimalistic
#55
MrPowers
closed
1 year ago
2
Make actions fail
#54
robertkossendey
closed
1 year ago
0
Brainstorm data quality features
#53
robertkossendey
opened
1 year ago
6
Refactor validate_append
#52
MrPowers
closed
1 year ago
1
Add validate append method
#51
robertkossendey
closed
1 year ago
1
Flake8 does not fail
#50
robertkossendey
closed
1 year ago
3
Parameter names for append_without_duplicates function
#49
MrPowers
opened
1 year ago
1
Brainstorm correct ways to include PySpark & Delta dependencies in pyproject.toml file
#48
MrPowers
opened
1 year ago
4
Add type hints for every public facing API function
#47
MrPowers
closed
1 year ago
3
Test every edge case
#46
MrPowers
opened
1 year ago
2
Set shuffle partitions lower to speed up test suite
#45
MrPowers
closed
1 year ago
1
Add with_md5_cols and find_composite_key_candidates functions
#44
souvik-databricks
closed
1 year ago
5
Brainstorm middle ground type of schema evolution
#43
MrPowers
closed
1 year ago
6
Spelling mistake
#42
robertkossendey
closed
1 year ago
1
Rename is_composite_key
#41
robertkossendey
closed
1 year ago
0
Possibly rename is_composite_key function
#40
MrPowers
closed
1 year ago
1
Deprecate mack validation error
#39
robertkossendey
closed
1 year ago
1
Create better publishing process
#38
MrPowers
opened
1 year ago
1
Raise exceptions better
#37
MrPowers
closed
1 year ago
1
Add is unique col method
#36
robertkossendey
closed
1 year ago
5
Add some code philosophy discussion points
#35
MrPowers
closed
1 year ago
3
Previous
Next