issues
search
lisad
/
phaser
The missing layer for complex data batch integration pipelines
MIT License
9
stars
1
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Removes ReshapePhase and replaces it with renumber attribute
#128
lisad
closed
6 months ago
0
Support DVC format diffs? data people already familiar with
#127
lisad
opened
6 months ago
1
Adds row number validation when we recordize data.
#126
lisad
closed
6 months ago
1
Adds a new builtin step - filter_rows.
#125
lisad
closed
6 months ago
0
Write a filter method as a builtin step that takes a lambda
#124
lisad
closed
6 months ago
0
Have a way to suppress drop-row messages by step
#123
lisad
closed
5 months ago
1
ReshapePhase needs columns for parsing types
#122
lisad
closed
5 months ago
1
Add some kind of command line control over running just one phase?
#121
lisad
opened
6 months ago
1
Phase save errors bubble up - including error if no rows return
#120
lisad
closed
6 months ago
0
Changes the column error handling keyword to be consistent
#119
lisad
closed
6 months ago
0
Default to dropping rows with only whitespace
#118
lisad
closed
5 months ago
1
Moves previous run files to another directory - seems most useful
#117
lisad
closed
6 months ago
0
Enables document generation on Read the Docs
#116
jeffkole
closed
6 months ago
0
Use proper python logging and make it configurable
#115
lisad
opened
6 months ago
0
Proof of concept of using Sphinx to generate docs
#114
jeffkole
closed
6 months ago
1
Handles all the LMDTODOs
#113
lisad
closed
7 months ago
0
Fixes tests by mirroring run command behavior
#112
jeffkole
closed
7 months ago
0
Adds error_policy to pipeline, to CLI, and tests it on pipeline
#111
lisad
closed
7 months ago
0
Set pytest as a dev dependency rather than a hard dependency
#110
jeffkole
closed
1 month ago
1
Reduce dependency on Pandas
#109
jeffkole
closed
1 month ago
0
Error handling in pipeline, better reporting of which phase
#108
lisad
closed
7 months ago
0
Adds support for sources in CLI
#107
jeffkole
closed
7 months ago
0
Adds test for unicode support end-to-end
#106
jeffkole
closed
7 months ago
0
Fix csv parsing errors that occur with small files
#105
jeffkole
closed
5 days ago
1
Refactors Context, especially to index and store evens differently
#104
lisad
closed
7 months ago
0
Adds helper features for manipulating outputs and sources
#103
jeffkole
closed
7 months ago
1
Adds row to exception if exception is thrown in a batch test
#102
lisad
closed
7 months ago
0
Move error policy to running Pipeline, not as a baked-in property of a Phase
#101
lisad
closed
7 months ago
1
Refactor that moves process_exception to Context, and constants to a constants file
#100
lisad
closed
7 months ago
0
Reporting tracebacks
#99
lisad
closed
7 months ago
0
Adds a documentation section on running phaser, and fixes docs for cli
#98
lisad
closed
7 months ago
1
We should create run-id subdirectories inside the working directory
#97
lisad
closed
6 months ago
2
Dataframe step can pass row numbers and preserve, or not
#96
lisad
closed
7 months ago
0
Adds row number continuation to batch step
#95
lisad
closed
7 months ago
1
Row numbers stay consistent
#94
lisad
closed
7 months ago
0
Add helper features for manipulating outputs and sources
#93
jeffkole
closed
7 months ago
8
Pipes extra outputs to extra sources
#92
jeffkole
closed
7 months ago
0
Makes ReshapePhase more like Phase - replaces DataFramePhase with dataframe_step
#91
lisad
closed
8 months ago
1
Testing batch step more: fixed how it works when the batch step forgets to declare context param
#90
lisad
closed
8 months ago
1
Log correct row number with header consistency check
#89
jeffkole
opened
8 months ago
1
Add CLI support for additional sources
#88
jeffkole
closed
7 months ago
0
Capture traceback information for errors as they occur
#87
jeffkole
closed
7 months ago
0
Fixes a bug caused by DataFrames
#86
jeffkole
closed
8 months ago
0
Adds more None and null and empty value handling and removes another pandas import
#85
lisad
closed
8 months ago
0
Adds more functionality and tests to the method that canonicalizes field names
#84
lisad
closed
8 months ago
0
Adds a test where I had a comment to remember to do one
#83
lisad
closed
8 months ago
0
clevercsv puts "NULL" in columns that do not have values
#82
jeffkole
closed
8 months ago
0
Adds saving CSV via clevercsv, fixes tests, adds test to make sure we don't save NaN
#81
lisad
closed
8 months ago
1
DataFramePhase must be subclassed ... seems annoying
#80
lisad
closed
7 months ago
0
Add tests to confirm that unicode characters roundtrip from file to pipeline to phase back to file
#79
lisad
closed
7 months ago
1
Previous
Next