activeloopai / deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
https://activeloop.ai
Mozilla Public License 2.0
7.88k stars 607 forks source link

Fix nan case for ingest_dataframe #2800

Closed istranical closed 3 months ago

istranical commented 3 months ago

🚀 🚀 Pull Request

Impact

Description

-- Replaces nans in the dataframe so ingestion does not fail for cases when there are strings and nans in the same column. In general, since Deep Lake doesn't support nan explicitly, it's best to replace these values with None in all cases, even if it could work for arrays in some cases.

Things to be aware of

Things to worry about

-- How much is the slowdown for replacing nans in large dataframes

Additional Context

CLAassistant commented 3 months ago

CLA assistant check
All committers have signed the CLA.

codecov[bot] commented 3 months ago

Codecov Report

All modified and coverable lines are covered by tests :white_check_mark:

Project coverage is 83.96%. Comparing base (5004257) to head (70a84a5).

Additional details and impacted files ```diff @@ Coverage Diff @@ ## main #2800 +/- ## ========================================== - Coverage 84.50% 83.96% -0.55% ========================================== Files 235 235 Lines 26527 26528 +1 ========================================== - Hits 22417 22273 -144 - Misses 4110 4255 +145 ``` | [Flag](https://app.codecov.io/gh/activeloopai/deeplake/pull/2800/flags?src=pr&el=flags&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=activeloopai) | Coverage Δ | | |---|---|---| | [unittests](https://app.codecov.io/gh/activeloopai/deeplake/pull/2800/flags?src=pr&el=flag&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=activeloopai) | `83.96% <100.00%> (-0.55%)` | :arrow_down: | Flags with carried forward coverage won't be shown. [Click here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=activeloopai#carryforward-flags-in-the-pull-request-comment) to find out more.

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.

sonarcloud[bot] commented 3 months ago

Quality Gate Passed Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
100.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud