OHDSI / WhiteRabbit

WhiteRabbit is a small application that can be used to analyse the structure and contents of a database as preparation for designing an ETL. It comes with RabbitInAHat, an application for interactive design of an ETL to the OMOP Common Data Model with the help of the the scan report generated by White Rabbit.
http://ohdsi.github.io/WhiteRabbit
Apache License 2.0
178 stars 88 forks source link

Using Synapse with WhiteRabbit #396

Open solmazeradat opened 10 months ago

solmazeradat commented 10 months ago

Hi,

Hope you are well.

I posted a similar post on the use of Databricks, Spark and Snowflake with whiteRabbit here.

We are looking at building a pipeline where the data volume/size is of the order of terabits. We want to ensure both the source data as well as the CDM data is compatible with analytical tools for big data as well as the OHDSI analytical tool kit.

Since the scanReport from WhiteRabbit tool is integral to the mapping process, wanted to check if WhiteRabbit supports the use of any of the following data databases in Synapse :

Many thanks, Solmaz

janblom commented 9 months ago

Hi,

I presume you are referring to the Azure Synapse Analytics platform, and I also assume that the Azure database connection option that is currently present in WhiteRabbit does not work with Synapse.

There is currently no support in WhiteRabbit for Synapse, and there are also no plans to implement this, as far as I know.

There are a few options though:

Best regards, Jan Blom

pasirikala commented 9 months ago

This is not worth much but the Synapse Analytics dedicated instance database "kind of works" with the current WhiteRabbit:

Disclaimer: Some scan results work perfectly, others look ok but aren't accepted by Rabbit in a Hat (details are a bit foggy at this moment. Seems to be related to "<=" that is generated into the Fraction unique-column).