issues
search
tugrul512bit
/
Cekirdekler
Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
GNU General Public License v3.0
93
stars
10
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Mandelbrot benchmark's or other test's source
#56
PascalSoftwares
opened
2 years ago
0
How to share Big Array, like a lookup table among various kernel calls
#55
rajxabc
opened
5 years ago
6
Any of the opencl 2 version does not work
#54
rajxabc
opened
5 years ago
38
Is there an example of generating a Unity Texture?
#53
mfagerlund
opened
5 years ago
4
Can you set pipeline mode for each device separately?
#52
jinxiu0406
closed
6 years ago
5
1D NBODY scores
#51
cmisztur
opened
7 years ago
9
add task types to control pool behavior (sync, broadcast task, shutdown devices)
#50
tugrul512bit
closed
7 years ago
0
add duplicated compute option to device pool / task pool / task for initializing same buffer on all devices
#49
tugrul512bit
closed
7 years ago
0
add callback option to ClTask
#48
tugrul512bit
closed
7 years ago
0
add multiple opencl-kernel instances for different compute-id values, for tiled computing, in task pool, with device pool
#47
tugrul512bit
closed
7 years ago
0
array.nextParam(array2).task() ---> creates ClTask to compute later in pool, with all the fields set at that time but with the latest array data
#46
tugrul512bit
closed
7 years ago
0
add "batch mode compute"(pool of devices for pool of kernels) with multiple devices where each compute() is computed by 1 device only, with greedy scheduling
#45
tugrul512bit
closed
7 years ago
0
single device pipeline: kernel repeat option
#44
tugrul512bit
opened
7 years ago
0
single device pipeline: overlapping regions percentage in total latency
#43
tugrul512bit
opened
7 years ago
0
clNumberCruncher.enqueueModeAsyncEnable to enqueue different kernels and arrays concurrently
#42
tugrul512bit
closed
7 years ago
0
ClArray.async to make an array copy operation done on another commandQueue(concurrently)
#41
tugrul512bit
closed
7 years ago
1
ClArray.name to bind an array to a kernel parameter with exact spelling
#40
tugrul512bit
opened
7 years ago
1
Read-only and write-only flags for ClArray
#39
tugrul512bit
closed
7 years ago
2
Enqueue mode with single gpu (and for device to device pipeline) ---- lower latency per command
#38
tugrul512bit
closed
7 years ago
3
nonPartialWrite capability for buffers
#37
tugrul512bit
closed
7 years ago
3
Device to device pipeline: enable mixed ordering of kernel arrays (in kernel function definition)
#36
tugrul512bit
opened
7 years ago
0
Device to device pipeline: optimize single stage multiple kernel compute with less synchronizations
#35
tugrul512bit
closed
7 years ago
0
Device to device pipeline: balancing load (kernel names) between neighboring stages
#34
tugrul512bit
opened
7 years ago
0
[canceled]Dynamic device to device pipeline
#33
tugrul512bit
closed
7 years ago
0
Image decode+resize+multiple_encode pipeline
#32
tugrul512bit
opened
7 years ago
0
Complete device to device pipeline stage initialization kernel execution
#31
tugrul512bit
closed
7 years ago
0
Some helper methods into ClNumberCruncher
#30
tugrul512bit
closed
7 years ago
0
add struct array support with byte-length descriptors for Unity's Vector3-Vector2 arrays
#29
tugrul512bit
closed
7 years ago
0
kernel repeat count number and repeat-end function name(kernel) with 64 global size(auto) for each repeat
#28
tugrul512bit
closed
7 years ago
0
add built-in matrix multiplication with sizes between 2x2 and 8192x8192
#27
tugrul512bit
opened
7 years ago
0
nbody(benchmark based) device selection disposes shared platform
#26
tugrul512bit
closed
7 years ago
0
English language translation of cluster-computing related classes(multi-pc centered-control)
#25
tugrul512bit
closed
7 years ago
0
Add device limits stress testing to have numbers used later in production or alarming when approaching limits.
#24
tugrul512bit
opened
7 years ago
0
add built-in image-resizing method for png,gif and jpeg
#23
tugrul512bit
opened
7 years ago
0
Add built-in jpeg,gif,png decompression-recompression methods
#22
tugrul512bit
opened
7 years ago
0
Add speed-ratio indicator between devices after 10-20 iterations
#21
tugrul512bit
opened
7 years ago
0
Arrays: bounds check before compute.
#20
tugrul512bit
closed
7 years ago
0
For explicit device selection, ClNumberCruncher still expects number of cores and gpus
#19
tugrul512bit
closed
7 years ago
0
inhibit use of ClDevice constructor
#18
tugrul512bit
closed
7 years ago
0
Workitems: Grain size - local size - global size: bounds check
#17
tugrul512bit
closed
7 years ago
0
Nbody benchmark-based explicit device selection
#16
tugrul512bit
closed
7 years ago
0
Explicit device selection disposes handles twice, giving error
#15
tugrul512bit
closed
7 years ago
0
C++ array wrapper re-creating(and computing) in loop throws error(CL_INVALID_MEM_OBJECT) but works for prepared N-array of C++ arrays
#14
tugrul512bit
closed
7 years ago
0
Disposing unused buffers with warning message
#13
tugrul512bit
opened
7 years ago
0
Redefine properties that are with underscores, to have a proper naming
#12
tugrul512bit
closed
7 years ago
0
Force multiple-of-64 for array size when using streaming and C++ arrays (cl_mem_use_host_ptr)
#11
tugrul512bit
opened
7 years ago
0
Hide Unnecessary Methods and Classes
#10
tugrul512bit
closed
7 years ago
1
Explicit Pipelining
#9
tugrul512bit
closed
7 years ago
0
Explicit Device to Device Pipelining
#8
tugrul512bit
closed
7 years ago
0
Lazy compute
#7
tugrul512bit
closed
7 years ago
0
Next