-
Hi folks,
I'm in the process of building a storage system and evaluating PhotonLib. Same architectural decisions on the system:
1. Thread per core, no context switching is desired
2. Share noth…
-
We need to do this asap otherwise it will be a lot of effort.
There are a lot of perf gains to be had here - changing `Arc` to `Rc` and `Mutex` to `RefCell`. Even quinn is a bit faster when used in…
-
The current system runs on thread per hyper-core. But this hasn't been benchmarked for a long time and the system has changed substantially since then. We should benchmark this when we have a reasonab…
-
On my machine it takes about 18 minutes for wamrc to compile this wasm file:
https://github.com/bnason-nf/karting/raw/main/wgpukarting.wasm.gz
My command line is:
`wamrc --target=x86_64 -o wg…
-
tokio tracing is slow as requires `Sync` and uses slow primitives
Can take inspiration from https://github.com/tikv/minitrace-rust
And optimize further for thread-per-core.
Actually minitrace is…
-
Currently libaio-based IO may block in `io_submit(2)` even if `RWF_NOWAIT` is passed if we hit the `nr_requests` limit of requests enqueued in the block layer. This is a death sentence for async threa…
-
I was looking into the different options for parallelizing WRF, always find it quite confusing.
From what I understand now, it works like this:
- MPI is used for multiprocessing (DMPAR).
Th…
-
## Issue
When trying to limit the building threads with either `colcon build --parallel-workers 2` or `colcon build --executor sequential`, the underlying `make` command is still called with the nu…
-
Good day.
After some non-intensive load, my tests were deadlocked.
I have made a thread dump and debugged a bit and have found the following:
Thread dump https://gist.github.com/andrii0lomakin/9c84…
-
1. Please describe the issue:
Floating point computing capacity not match with Orin-x's datasheet
2. Detailed steps on how to reproduce the issue:
Run cuda sample `cudaTensorCoreGemm`
```
Initi…