-
Extend the Flask API to read from a .jsonl file of scraped Officers data and insert it into the Postgres DB.
**Is your feature request related to a problem? Please describe.**
As part of our data …
-
It's possible to write rudimentary tests in DAX Studio using a combinations of VAR for ExpectedValues, CalculatedValues
And then check values match using an IF equality check.
It would be good i…
-
This notebook covers Phase 1 of the streaming data pipeline:
- https://nbviewer.jupyter.org/github/SuperCowPowers/bat/blob/master/notebooks/Zeek_to_Kafka.ipynb
Lets write a second notebook that co…
-
We analyze zipped ETL trace files which contain included NGEN pdbs in an automated pipeline. In order to give the location info of those NGEN pdbs to SymbolReader we use the source server style syntax…
-
Hello team,
I'd like to try the kafka-connect-sap to setup an ETL pipeline to extract data from SAP HANA, while the transactional data in SAP HANA is used schema-based multi-tenancy, which means ea…
-
I have a use case where I'm picking up CSV over SFTP (using a library called `pysftp`) and running it through a pipeline. Initially I thought it would look something like this:
```
import petl as …
-
At this time they are sent via array in the DAG params (e.g. `"schema_fields_array": "['field1', 'field2']"`) and everything is treated as `text` type.
What are good alternatives?
Questions:
- Modi…
-
The documentation, explaining the ETL process for images (https://deeplearning4j.org/simple-image-load-transforml) could be improved regarding describing the transformation possibilities:
"The pipe…
-
Hi,
I am getting the following error while running the command bash run.sh
Cleaning working and output directories...
CommandException: "rm" command does not support "file://" URLs. Did you mean…
-
Possible solutions:
- https://github.com/misterion/ko-process
- https://github.com/videlalvaro/php-amqplib