github / release-radar

Repo for submission of projects to consider for the GitHub Release Radar 📡
https://releaseradar.github.com
Creative Commons Zero v1.0 Universal
308 stars 22 forks source link

[Release Radar Request] DataChain - open source Python library for processing and curating unstructured data at scale #224

Closed jendefig closed 1 month ago

jendefig commented 3 months ago

Open Source Project name

DataChain

What is your project?

DataChain is an open-source Python library for processing and curating unstructured data at scale.

🤖 AI-Driven Data Curation: Use local ML models or LLM APIs calls to enrich your data. 🚀 GenAI Dataset scale: Handle tens of millions of multimodal files. 🐍 Python-friendly: Use strictly-typed Pydantic objects instead of JSON.

DataChain supports parallel processing, parallel data downloads, and out-of-memory computing. It excels at optimizing offline batch operations.

The typical use cases include Computer Vision data curation, LLM analytics, and validation of multimodal AI applications.

Version

0.2.12 (debut to the public)

Date

July 23, 2024

Description of breaking changes

nil

GitHub Repo

https://github.com/iterative/datachain

Website

https://dvc.ai

Link to changelog

https://github.com/iterative/datachain/releases/tag/0.2.12

Social media

https://x.com/DVCorg/status/1815735081551729119

Anything else to add?

This release is the debut of this open-source tool to the community!

mishmanners commented 1 month ago

Thanks for submitting, we loved your project. The Release Radar features major version releases. Please submit again when you have version 1.0 and we'd love to consider your project for feature 😄

jendefig commented 1 month ago

Thank you!  Will do!

Jeny De Figueiredo

Community Manager

@. ( @. )

( https://calendly.com/jeny-dvc/30min ) ( https://github.com/iterative ) ( https://www.linkedin.com/company/iterative-ai/ )

( https://dvc.ai/ )

Sent via Superhuman ( @.*** )

On Wed, Sep 04, 2024 at 5:05 PM, mishmanners < @.*** > wrote:

Thanks for submitting, we loved your project. The Release Radar features major version releases. Please submit again when you have version 1.0 and we'd love to consider your project for feature 😄

— Reply to this email directly, view it on GitHub ( https://github.com/github/release-radar/issues/224#issuecomment-2330349106 ) , or unsubscribe ( https://github.com/notifications/unsubscribe-auth/AIFTMLZED6GU7CLZO2MIBRTZU6N35AVCNFSM6AAAAABLNBBCE6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMZQGM2DSMJQGY ). You are receiving this because you authored the thread. Message ID: <github/release-radar/issues/224/2330349106 @ github. com>