dask-dataframes Search Results

1000+ results
for dask-dataframes

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

apache/arrow #14025

Is it possible to provide a partial schema to pyarrow to_par…

I know that one particular column will always be a list of floats, so I do: ``` import pyarrow as pa schema = {"col1": pa.list_(int64())} df.to_parquet(schema=schema) ``` however, it seems …

hungcs updated 2 years ago
5
dmlc/xgboost #7454

`AttributeError` with fitting model on Dask Array backed by …

I came across a use case where attempting to fit a `DaskXGBClassifier` on a Dask Array whose partitions are `scipy.sparse.csr_matrix`s (as is returned by Dask-ML's `HashingVectorizer`) results in a `A…

jrbourbeau updated 2 years ago
4
dask/dask #7475

Array: Slightly More Complex Chunks

Hi, I am reading through the chunking options in - https://docs.dask.org/en/latest/array-chunks.html - https://docs.dask.org/en/latest/array-api.html?highlight=from_array#other-functions I wan…

ax3l updated 2 years ago
11
jmcarpenter2/swifter #143

Progress Bar doesn't seem to be working

Hi! I've been using swifter for a while as I'm working on an ETL process where I need to handle huge dataframes. I was used to seeing the progress bar when I used swifter.apply(), but it hasn't …

santiarcar updated 2 years ago
4
pola-rs/polars #6395

Pyarrow filter not pushed to scan_ds if datatype is a string

### Polars version checks - [X] I have checked that this issue has not already been reported. - [X] I have confirmed this bug exists on the [latest version](https://pypi.org/project/polars/) of …

dominikpeter updated 1 year ago
18
catalyst-cooperative/pudl-catalog #29

Improve catalog-level CI tests

The [CarbonPlan data catalog repo](https://github.com/carbonplan/data) provides [some examples of tests](https://github.com/carbonplan/data/blob/main/carbonplan_data/tests/__init__.py) that could appl…

zaneselvans updated 2 years ago
1
rapidsai/cudf #11382

Old issue( std::bad_alloc: CUDA error at: /workspace/.conda-…

Hi, I'm getting the error given below and using the WSL2 Ubuntu 20.04 instance on Windows 11 Preview. RuntimeError: CUDA error encountered at: /workspace/.conda-bld/work/cpp/src/bitmask/null_mas…

Shafi2016 updated 2 years ago
14
dask/distributed #6899

scheduler.get_comm_cost a significant portion of runtime in …

I've been profiling distributed workflows in an effort to understand where there are potential performance improvements to be made (this is ongoing with @gjoseph92 amongst others). I'm particularly in…

wence- updated 2 years ago
10
coiled/dask-community #1085

[Discourse] Lager than memory CSV processing

Hi all, I wanted to ask for some help to understand how to work with dataframes/CSVs larger than RAM/memory limit. What I want to do is being able to read a large CSV with a set `memory_limit` in th…

github-actions[bot] updated 2 years ago
1
coiled/feedback #171

Scheduler dying during task runs

A customer is experiecing the scheduler dying after running tasks successfully for a while (possibly a deadlock) Example cluster that died https://cloud.coiled.io/julianfb51/clusters/36106/2/detail…

shughes-uk updated 2 years ago
12

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for dask-dataframes

1000+ results
for dask-dataframes