issues
search
vaexio
/
vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
https://vaex.io
MIT License
8.22k
stars
590
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix pip install in CONTRIBUTING
#2261
franz101
closed
1 year ago
2
[BUG-REPORT] Export to parquet fails when first row of virtual column is None
#2260
vladmihaisima
opened
1 year ago
8
[FEATURE-REQUEST] vaex.agg.cpercent
#2259
bls-lehoai
closed
1 year ago
2
[feature-request] how to use Median, mode with Group by ?
#2258
oan-Dev05
opened
1 year ago
2
[BUG-REPORT] Requesting a vaex-dataset that contains both numeric and categorical columns
#2257
abf7d
opened
1 year ago
1
Update fits.py
#2254
balbinot
closed
1 year ago
2
[BUG-REPORT] Export to csv overrides mode and header settings
#2253
charvolant
opened
1 year ago
1
Slow groupby after adding column from array
#2252
statsrunner
opened
1 year ago
1
Fix: allow single columns or expressions in materialize
#2249
JovanVeljanoski
closed
1 year ago
0
[BUG-REPORT] unexpected behavior of `agg={"count": vaex.agg.count()}`
#2248
cgjosephlee
closed
1 year ago
2
[BUG-REPORT] materialize do not support column name length greater than 1
#2247
cgjosephlee
closed
1 year ago
2
[BUG-REPORT] Aggregation using "list" eats all memory and crashes kernel
#2246
LiamNiisan
closed
1 year ago
4
Fix: export to hdf5 when there are numeric missing values
#2245
JovanVeljanoski
closed
1 year ago
1
Fix: filtering with an arrow string scalar
#2244
JovanVeljanoski
closed
1 year ago
0
docs: update open_many docstring
#2243
JovanVeljanoski
closed
1 year ago
0
[BUG-REPORT] String comparison failed with `pyarrow.StringScalar` type
#2242
cgjosephlee
opened
1 year ago
1
Complete documentation for vaex.open_many()
#2241
joseberlines
opened
1 year ago
1
How to increase join performance
#2239
hermidalc
closed
1 year ago
1
How to get join to use less memory
#2238
hermidalc
opened
1 year ago
2
[BUG-REPORT] min aggregation returns inf when sorting and delay Flags are set to True
#2235
vignesh-bungee
opened
1 year ago
2
[FEATURE-REQUEST] The join "how" doesn't support "cross" option
#2234
hewittzgh
closed
1 year ago
2
Fix: `extract()` on an empty dataframe due to a filter
#2233
JovanVeljanoski
closed
1 year ago
1
[BUG-REPORT] to_pandas_df throws exception on filtered data frame on virtual column, with empty result
#2232
vladmihaisima
closed
1 year ago
8
[BUG-REPORT] Vaex uses more threads than it should / documentation is incomplete
#2231
vladmihaisima
closed
1 year ago
1
[BUG-REPORT] median_approx and percentile_approx return nan instead of actual value
#2230
abianco88
opened
1 year ago
17
Exporting to arrow seems to create corrupted or invalid output
#2228
alvations
closed
1 year ago
2
[BUG-REPORT] `.join()` throws `pyarrow.lib.ChunkedArray' object has no attribute 'view'`
#2227
jmakov
opened
1 year ago
0
[FEATURE-REQUEST] Import and version checking improvements #2211
#2226
franz101
closed
1 year ago
2
Docs: update i/o guide for with the new CSV methods backed by arrow.
#2225
JovanVeljanoski
closed
1 year ago
1
Lazy CSV reading improvement: auto-detect types
#2224
JovanVeljanoski
closed
1 year ago
0
Docs: Add missing docstrings to `df.export_csv` and `df.export_csv_arrow`
#2223
JovanVeljanoski
closed
1 year ago
2
Feat: better support for JSON i/o
#2222
JovanVeljanoski
opened
1 year ago
0
fix: support arrow datasets containing multiple fragment without row_groups
#2221
maartenbreddels
closed
1 year ago
0
Add support for the arrow export csv backend in export_many
#2220
JovanVeljanoski
closed
1 year ago
1
[BUG-REPORT] `vaex.from_arrow_dataset` throws`'pyarrow._dataset.FileFragment' object has no attribute 'row_groups'`
#2219
jmakov
closed
1 year ago
0
fix: convert options for lazy from_csv_arrow did not work
#2218
maartenbreddels
closed
1 year ago
0
[BUG-REPORT] Cannot export large_string type to arrow file
#2217
Ben-Epstein
closed
1 year ago
2
[BUG-REPORT] dataframes with columns of large strings cannot be concatenated
#2216
Ben-Epstein
opened
1 year ago
5
Fix for the vaex-viz CI issue due to the latest matplotlib (3.6.0)
#2215
JovanVeljanoski
closed
1 year ago
1
[Question] How to strip blank,new line from data when importing CSV?
#2213
lehoai
closed
1 year ago
0
[FEATURE-REQUEST] Support Google Colab installation without runtime restart
#2211
franz101
closed
1 year ago
6
Export_hdf5 with an error: ValueError: No memory tracker found with name default
#2210
wybert
opened
1 year ago
5
Fix: names with math symbols should work consistently
#2208
JovanVeljanoski
closed
1 year ago
0
[BUG-REPORT] df.limits breaks in unittests with RuntimeError: stride is not equal to 1
#2207
abf7d
opened
1 year ago
2
Fix Typo in eq2gal
#2206
TomCallingham
closed
1 year ago
2
Add a `__dataframe__` method to `_VaexDataFrame`
#2205
rgommers
closed
1 year ago
7
[BUG-REPORT] Error converting from csv file to hdf5 file with
#2202
zhiyongm
opened
1 year ago
1
[BUG-REPORT] Extreme CPU and memory usage for df.apply
#2201
piri-p
opened
1 year ago
1
[BUG-REPORT] cannot request duplicate column names before export to arrow or parquet
#2200
Ben-Epstein
opened
1 year ago
0
Fix: nunique agg for numeric with selections
#2199
JovanVeljanoski
closed
1 year ago
0
Previous
Next