-
## Description
Our user input into the database won't be all tickboxes; some of the entries will involve text (eg. the user's emotions). We will need to clean the inputs on user emotions.
Possible o…
-
### Describe the feature
In certain contexts an `ExEx` requires access to comprehensive data from all pipeline stages including hashed state, merkle, transaction lookup and history. Currently the pip…
-
# Description
Add capital planning to the ETL
## Acceptance criteria
- [ ] Capital planning data is loaded into db for NYCPlanning/ae-zoning-api
-
### Motivation
We have now verified that the basic lake functionality is working as expected.
We now want to verify the data quality and completeness.
This means that additional SQL queries a…
-
## Summary
There should be a Prometheus metric showing whether there is at least one ETL node alive.
## Motivation
Such metric would be helpful to create alert if ETL all ETL nodes Clio…
-
### Background / motivation
To integrate duckDB (#685)[https://github.com/oceanprotocol/pdr-backend/issues/685] the user should be able to follow the README.md, get their lake setup, and complete a…
-
How feasible is it to allow users to define their own custom functions to achieve complex ETL tasks?
Pandas provides a [Dataframe.eval()](https://pandas.pydata.org/docs/reference/api/pandas.DataFram…
-
This can be closed when the Python file has the ability to return structured data from the unstructured data sources below,
https://www.weather.gov/nwr/station_search
https://radio-locator…
-
**Is your feature request related to a problem?**
Apache Iceberg is designed for managing large analytic tables in a scalable and performant way, using features like schema evolution, partitioning,…
-
## User story
1. As a Machine Learning engineer
2. I want/need to refactor the existing scripts
3. So that we can fix the bugs and improve performance
## Acceptance criteria
* Examine the existing s…