BirdiD / BirdiDQ

BirdiDQ leverages the power of the Python Great Expectations open-source library and combines it with the simplicity of natural language queries to effortlessly identify and report data quality issues, all at the tip of your fingers.
Apache License 2.0
15 stars 2 forks source link
ai dataquality great-expectations large-language-models

BirdiDQ - About 🔍

BirdiDQ is an intuitive and user-friendly data quality application that allows you to run data quality checks on top of python great expectation open source library using natural language queries. Type in your requests, and BirdiDQ will generate the appropriate GE method, run the quality control and return the results along with data docs you need. Demo Video

BirdiDQ Screenshot 1

BirdiDQ Screenshot 2

Note

BirdiDQ is under development and is an open source project. Contributions are welcomed!

Features

Tech Stack

This app is an LLM-powered app built using:

Instalation instructions

To run BirdiDQ, you need to perform the following steps:

  1. Clone the repository locally:

    git clone https://github.com/BirdiD/BirdiDQ.git
  2. (Recommended) Create a virtual environment and activate it:

    python3 -m venv bir_env
    source bir_env/bin/activate
  3. Install the required dependencies:

    pip install -r requirements.txt
  4. Run the app:

    streamlit run great_expectations/app.py

Note: BirdiDQ can use OpenAI's ChatGPT or Falcon LLM to convert the natural language descriptions to expectations. If you plan to use Falcon, consider using Pytorch with GPU support for better performance. To install Pytorch with CUDA support follow the instructions avaiable at for your Operating System at Pytorch.

System requirements for local Falcon LLM usage

Falcon 7b is an open source large language model (LLM) that can be used with BirdiDQ to convert natural language descriptions to Great Expectations expectations. To use the current fine-tuned Falcon 7b, you need to have a system with the following minimum requirements:

Example Queries

Here are some example queries you can try with BirdiDQ:

BirdiDQ Integration Stack

BirdiDQ integrates, connects, and works with a range of tools and services.