spiceai / spiceai

A self-hostable CDN for databases. Spice provides a unified SQL query interface and portable runtime to locally materialize, accelerate, and query datasets across databases, data warehouses, and data lakes.
https://docs.spiceai.org
Apache License 2.0
1.86k stars 73 forks source link

Enhancement: MotherDuck Data Connector #1159

Open digadeesh opened 5 months ago

digadeesh commented 5 months ago

Goal-State/What/Result

MotherDuck can be used as a data connector with the Spice OSS

Milestones:

Why/Purpose

MotherDuck is a cloud based, serverless compute option integrated with DuckDB. Spice runtime users can seamlessly leverage their MotherDuck compute services in Spice.

By When

Issue/Spec written and reviewed: 05/14

M1: TBD Done-Done: TBD

Done-Done

The Algorithm

Spec

M1 MotherDuck Data Connector Alpha

- from: motherduck:database.schema.table
  name: my_table
  params: 
    motherduck_token: <token>
spice login motherduck <token>

Token from secrets is used in this case.

- from: motherduck:database.schema.table
  name: my_table

How/MAP

Implementation (issues)

QA

The following query must be used for manual QA to confirm Data Connector performance.

SELECT
regexp_extract(url, 'http[s]?://([^/]+)/', 1) AS domain,
count(*) AS count
FROM hacker_news
WHERE url IS NOT NULL AND regexp_extract(url, 'http[s]?://([^/]+)/', 1) != ''
GROUP BY domain
ORDER BY count DESC
LIMIT 20;
phillipleblanc commented 4 months ago

@sgrebnov to get PM/UX review sign off Goal State: Done-Done before next milestone.