distributed-datasets Search Results

1000+ results
for distributed-datasets

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/ray #39737

[Data] Support passing iterators to `from_arrow`

### Description Make `ray.data.from_arrow` efficiently support iterators of Arrow tables. Currently, `from_arrow` loads all of the data in memory. ### Use case If your data can't fit in memor…

bveeramani updated 2 weeks ago
2
huggingface/transformers #33233

The llava processor does not appear to support batch process…

### System Info - `transformers` version: 4.41.2 - Platform: Linux-5.10.0-1.0.0.28-x86_64-with-glibc2.31 - Python version: 3.10.14 - Huggingface_hub version: 0.24.6 - Safetensors version: 0.4.4…

zhangzef updated 2 weeks ago
6
sparsehash/sparsehash #123

Both dense_hash_map and sparse_hash_map hangs when executing…

I have tried using both dense_hash_map and sparse_hash_map for inserting millions of entries into the hash table in a distributed environment, where each process holds its own private hash table. Unfo…

pghosh2 updated 4 years ago
9
rapidsai/cuml #4406

[BUG] Training cuML single GPU models on dask dataframe obje…

**Describe the bug** With [PR](https://github.com/rapidsai/cuml/pull/4300) we enabed training single GPU cuML models using Dask DataFrames and Series but we use `compute` there which brings data t…

VibhuJawa updated 2 years ago
3
open-mmlab/Multimodal-GPT #26

bug:导入vaq_ocr数据，

Flamingo model initialized with 23461888 trainable parameters ╭───────────────────── Traceback (most recent call last) ──────────────────────╮ │ /home/jovyan/taoheng/work/Multimodal-GPT/mmgpt/tr…

franztao updated 1 year ago
2
deepchem/deepchem #1972

Adding DaskDataset

At present, `DiskDataset` is our workhorse class for large datasets. This class is pretty nicely optimized with a cache and everything, and I've been able to use it on 50GB datasets without too much t…

rbharath updated 4 years ago
1
unisonweb/unison #2785

Docs feature: link across Doc terms to Anchor tags

Currently you can link to an anchor tag _within_ a `Doc` term with the anchor tag link syntax, but we've encountered a few cases where it would be useful to link to an anchor tag within a different `D…

rlmark updated 4 days ago
1
dask/dask #9888

Reuse of keys in blockwise fusion can cause spurious KeyErro…

Subsequent Blockwise layers are currently fused into a single layer. This reduces the number of tasks, the overhead and is very generally a good thing to do. Currently, the fused output does not gener…

fjetter updated 3 weeks ago
16
Nora-Zhang98/VTSCN #1

TypeError when training

Thanks for you code. However, when I run "CUDA_VISIBLE_DEVICES=0 python -m torch.distributed.launch --master_port 10025 --nproc_per_node=1 tools/relation_train_net.py --config-file "configs/SHA_GCL_…

TNT028 updated 3 months ago
6
ydataai/ydata-profiling #1290

Feature Request: Support for modin framework to make the EDA…

### Missing functionality # Support for modin framework to make the EDA on larger datasets much faster I would like to request the addition of support for the Modin framework to our EDA tools. As …

danishbansal808 updated 8 months ago
2

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for distributed-datasets

1000+ results
for distributed-datasets