lancedb / lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
https://lancedb.github.io/lance/
Apache License 2.0
3.95k stars 220 forks source link

Scanner tech debt #1992

Closed chebbyChefNEQ closed 8 months ago

chebbyChefNEQ commented 8 months ago

Problems:

eddyxu commented 8 months ago

I think we should also consider what logic should be in lance and what should be in lancedb (as vectordb).

Lance itself as format could be more generic.

wjones127 commented 8 months ago

We already have an issue here: https://github.com/lancedb/lance/issues/1782

I've folded these items into that issue.