-
I just noticied that the JSC easyconfigs set a number of additional options for Clang 9.0.0:
```
configopts = '-DCLANG_OPENMP_NVPTX_DEFAULT_ARCH=sm_37'
configopts += ' -DLIBOMPTARGET_NVPTX_COMPUTE_…
-
### Description
Approximately 15% performance regression in transaction handling is caused by enabling `streamMode = BOTH` in #16167.
The wall clock time cost is attributed to `Hedera.manageBlockEnd…
-
When running the test-suite after a 3 stage build, if the `offload` runtime is enabled (tested with host offloading only), all test related to offloading fails with error of the type described in:
…
-
# Summary
I am currently evaluating argo-workflows a goto solution for scheduling tasks for my company. So far we really like it featurewise and we thing it is really good fit 👍
Problem is that …
-
### Is there an existing issue for this problem?
- [X] I have searched the existing issues
### Operating system
Windows
### GPU vendor
Nvidia (CUDA)
### GPU model
RTX 3060
### GPU VRAM
12GB
…
-
Building an OpenMP hello world fails to link, when Optimization is turned on:
```c
#include
#include
int main(int argc, char** argv){
printf("Devices: %i\n", omp_get_num_devices());
int a[…
-
### Background and motivation
`memmove` and `memset` are the most common operations from the earliest days of computing to today.
They are included in most practical programming languages, except as…
-
```
➜ go-ggllm.cpp git:(master) ✗ CGO_LDFLAGS="-lcublas -lcudart -L/usr/local/cuda/lib64/" LIBRARY_PATH=$PWD C_INCLUDE_PATH=$PWD go run ./examples -m "/media/NAS/MLModels/02_LLMs/falcon-40b-instruct…
-
Existing approach is that if users specify `offloadNone()` execution strategy on the builder, we respect that and disable offloading. However, if they have a filter that for sure has blocking code, it…
-
### Describe the feature request
[As per this Huggingface article:](https://huggingface.co/docs/accelerate/usage_guides/big_modeling)
It has a way to load very large models in limited GPU by divin…