-
CoreNLP version 4.5.0 using `pos lemma depparse`. I run the pipeline within Spark (Scala). I lazy initialise the CoreNLP pipeline and I broadcast the pipeline to each executor using lazy instantiation…
-
https://docs.openrefine.org/technical-reference/openrefine-api has limited operations on projects: create (and upload inputs), apply operations, export data, delete.
We use OntoRefine to automate k…
-
Cloud Native Postgres currently relies only on the archive log to synchronize those standby servers that have fallen out of sync. However, for some high workload scenarios, this can be inefficient. We…
-
As with the 2021 release of Visual Behavior ophys data, the April release of Visual Behavior Neuropixels data will have two types of files: NWB files with Neuropixels (ecephys) data and behavior-only …
-
From discussion here https://github.com/owid/etl/pull/144#pullrequestreview-937631199
> Thanks @Marigold ! The data, metadata and pipeline look good. Just two comments (that should not affect this …
-
Requisitos:
● Conhecimento em banco de dados NoSQL e SQL;
● Computação em nuvem como AWS e GCP;
● Arquitetura e estruturas de dados;
● Modelagem dimensional de dados;
● Experiência em pipelines d…
-
### Apache Airflow version
2.2.5 (latest released)
### What happened
I have defined a DAG
```python
default_args = {
'owner': 'airflow',
'depends_on_past': False,
'email': ['info…
-
[Software Engineer Fullstack with English] - Remoto - PJ - USD 5 K - USD 6 K - Brazil Residents.
RESPONSIBILITIES
Strategically define, design, implement, deliver and operate software applicat…
-
## What You'll get
### Salary Expectation
- $120K - $200K dependent on experience and ability - mid/senior/principle levels
- Full Time
### Benefits
- Health Insurance.
- Financi…
-
### Feature description
Integrate an open source workflow manager, perhaps [Dagster.io](https://dagster.io/) that would allow the user to build and deploy ETL and computational pipelines easily.
###…