large-scale-data-processing Search Results

1000+ results
for large-scale-data-processing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

e-mission/op-admin-dashboard #145

Collect data on performance improvements to the dashboard

Plan discussed at meeting We will use the following environments: - stage - ccebikes (large: 86k trips, 108 users) - smart commute (medium, long-term: 18k trips, 21 users) - STM community (smal…

shankari updated 1 week ago
52
sgkit-dev/vcf-zarr-publication #161

Python Tensorstore af-dist implementation

The C++ af-dist implementation added in #157 (and notes in #160) is surprisingly slow. In some informal tests it looks like it's about 3X slower than Python Zarr, just retrieving the chunks. There's …

jeromekelleher updated 3 days ago
41
mosaicml/llm-foundry #870

How to support multi-threaded parallel data preprocessing?

I want to pretrain an LLM with 2T tokens using llm-foundry. But before training, the data processing time is too long. Is there any way to accelerate it?

YixinSong-e updated 9 months ago
11
nextcloud/android #8154

'Processing' indicator for AutoUpload

### Is your feature request related to a problem? Please describe. When you tell the NextCloud app to AutoUpload a camera dir with >3000 images in it, the app needs to process each of the images befo…

jeffWelling updated 11 months ago
7
ModelAtlasofTheEarth/model_submission #24

Flexural isostatic response of continental-scale deltas to c…

### -> submitter ORCID (or name) 0000-0002-1270-4377 ### -> slug polanco-2024-deltas ### -> license CC-BY-4.0 ### -> alternative license URL _No response_ ### -> model category…

saraemp updated 6 months ago
15
JackKelly/light-speed-io #10

High performance cloud object storage (for reading chunked m…

Some notes (in no particular order) about speeding up reading many chunks of data from cloud storage. ## General info about cloud storage buckets In general, cloud storage buckets are highly dis…

JackKelly updated 4 months ago
11
ballerina-platform/ballerina-library #5034

Getting two string[] for some CSV row when accessing through…

**Description:** Currently we get two string[] for some CSV row when we access through byte[] stream. **Describe your problem(s)** We need to access a CSV file through byte[] stream and convert…

Chuhaa updated 1 year ago
10
LBL-EESA/fastkde #37

Separate plotting functionality from kde estimation for bett…

It would be nice to not have matplotlib so tightly integrated into the plot.py functionality so that we can exersize tighter control over the output graphs. Doing this would also result in faster adop…

k-a-mendoza updated 1 month ago
7
winsiderss/systeminformer #1386

DPI Awareness Support

The recent addition of V2 DPI awareness introduced a few issues: - [x] Toolbar (fixed) - [x] Searchbox (fixed) - [x] Menu Icons (fixed) - [x] Tray Icons (fixed) - [x] Process tree icons corrupt…

dmex updated 1 week ago
23
dedupeio/dedupe #1024

What is the system core and memory benchmark for dedupe libr…

I am using python [dedupe ](https://github.com/dedupeio/dedupe-examples) library for large dataset. Initially we tested with 3K dataset and it could run with "AWS Sagemaker instance -> ml.r5.4xlarge 1…

sarbaniAi updated 2 years ago
12

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for large-scale-data-processing

1000+ results
for large-scale-data-processing