-
**Issues**
The website has a 'teach performance' page dedicated to teaching performance engineering, and a 'resources' page dedicated to software that uses OpenCilk, but it lacks a page dedicated to …
-
Thanks to @friedmud for pointing this out last night.
I've never seen any noticeable wait in parallel_sync(), but that's probably because I'm running on hundreds of processors, not tens of thousand…
-
Based on discussions in #1, we will cover the optimization of parallel code in Julia after #4. Here are a few suggested topics:
- Built-in Julia threading (`@threads`, etc.)
- Convenience packages…
-
**Background**:
In my problem, I need to compute the cost over the whole global batch, i.e., the batch-correlated cost instead of the average of sample-independent losses (like L1), for example, the…
-
The MapReduce design pattern is designed to process large volumes of data in a distributed and parallel manner, improving scalability and performance by utilizing multiple processing nodes. Originatin…
-
Hi,
The uncertainpy calculation quickly goes out of control with increasing no. of uncertain parameters mainly in terms of memory requirement for any personal machines. I could be wrong but don't thi…
-
- Abstract (2-3 lines)
As a Data Scientist, we face few challenges while dealing with large volume of data:
1. Popular Python libraries like NumPy & Pandas are not designed to scale beyond singl…
-
When using the `fullydistributed::Triangulation` in combination with a `TransfiniteInterpolationManifold`, an error occurs during the triangulation creation from a description generated by `Triangulat…
-
- [ ] move to implementation to __step-40__
- [ ] test if all 60 processors are used by program
- [ ] measure performance acceleration, and make similar plots as in step-40
-
**Is your feature request related to a problem? Please describe.**
Current MNMG RF is more like a model-parallel approach. We distribute the data among the workers and also distribute the work of bui…