issues
search
CoxAutomotiveDataSolutions
/
waimak
Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Apache License 2.0
75
stars
16
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Changed delimiter for Deequ genericSQLCheck
#98
vavison
closed
4 years ago
1
Generic sql deequ checks with commas in do not work
#97
vavison
closed
4 years ago
0
Create directory before doing move in writeAsNamedFiles
#96
vavison
closed
4 years ago
1
writeAsNamedFiles should create the output directory if it does not exist
#95
vavison
closed
4 years ago
0
Make Storage Layer implementation consistent on S3
#94
alexjbush
opened
5 years ago
0
Feature/spark cache action
#93
vavison
closed
5 years ago
0
Feature/data quality monitoring
#92
vavison
closed
5 years ago
1
Remove modules and update Spark 2.12 version
#91
alexjbush
closed
5 years ago
1
Write as named file(s)
#90
alexjbush
closed
5 years ago
0
Feature/property retry
#89
alexjbush
closed
5 years ago
0
Feature/config driven extension
#88
alexjbush
closed
5 years ago
1
Retry on property value getters
#87
alexjbush
closed
5 years ago
0
Rework the api again
#86
alexjbush
closed
5 years ago
1
Feature/generalise commit and extensions
#85
alexjbush
closed
5 years ago
1
Feature/using f bounded types
#84
vavison
closed
5 years ago
1
Feature/hadoop db connectors for complex types
#83
vavison
closed
5 years ago
1
Feature/multiple environments in environment action
#82
vavison
closed
5 years ago
0
Feature/max executors for waimak apps
#81
vavison
closed
5 years ago
1
Fix for parallel scheduler hanging on certain types of exceptions
#80
alexjbush
closed
5 years ago
1
Feature/issue 74 execute action
#79
alexjbush
closed
5 years ago
1
Feature/move to codecov
#78
alexjbush
closed
5 years ago
1
Issue 75 - Configurable Spark prefix in case class configuration parser
#77
alexjbush
closed
5 years ago
2
Track lineage in Waimak
#76
alexjbush
opened
5 years ago
1
Make the spark prefix configurable for Case Class config parser
#75
alexjbush
closed
5 years ago
1
Add execute action/function to DataFlow/SparkDataFlow
#74
alexjbush
closed
5 years ago
0
Feature/single compaction stage
#73
vavison
closed
5 years ago
1
Clean up temporary folder by default after flow execution
#72
alexjbush
closed
5 years ago
1
Simplify compaction stages
#71
alexjbush
closed
5 years ago
0
Waimak tmp cleanup flag
#70
alexjbush
closed
5 years ago
0
Feature/allow recreation of storage table metadata
#69
vavison
closed
5 years ago
0
Change force recreate tables flag to waimak configuration parameter
#68
alexjbush
closed
5 years ago
1
Feature/experimental module
#67
vavison
closed
5 years ago
1
Feature/optionally remove storage history on compaction
#66
vavison
closed
5 years ago
1
Force recreate tables should be a Waimak flag rather than connector flag
#65
alexjbush
closed
5 years ago
0
Allow tables in the storage layer to be marked in such a way that they will be deduplicated on compaction with no history retained
#64
vavison
closed
5 years ago
0
Auto-detect schema change a recreate tables for HadoopConnectors
#63
alexjbush
opened
5 years ago
1
Feature/generic properties providers
#62
alexjbush
closed
5 years ago
1
Clean-up strategy should error if no folders exist after committing
#61
alexjbush
opened
5 years ago
0
Rework storage API
#60
alexjbush
closed
5 years ago
2
Consider average row size for compaction and fix recompactAll behaviour
#59
alexjbush
closed
5 years ago
1
Added test jar sources to generated artifacts
#58
alexjbush
closed
5 years ago
1
Issues #53 Allow repartition by integer value
#57
alexjbush
closed
5 years ago
1
Exposed uri used on the flow in the Spark context
#56
alexjbush
closed
5 years ago
1
ParquetDataCommitter fails when performing listStatus on the output directory if it does not exist
#55
vavison
closed
5 years ago
1
Only cache labels if they are actually used by more than one action further down the flow
#54
vavison
closed
5 years ago
1
Allow repartition by non-named partition for ParquetDataCommiter
#53
alexjbush
closed
5 years ago
0
Create a generic Main method for Waimak
#52
alexjbush
closed
5 years ago
0
Rework storage API
#51
alexjbush
closed
5 years ago
0
Test sources are missing from Maven
#50
alexjbush
closed
5 years ago
0
Implement an approach to filter existing flows so a subflow could be executed
#49
alexjbush
opened
5 years ago
0
Previous
Next