tugrul512bit Cekirdekler issues

tugrul512bit / Cekirdekler

Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).

GNU General Public License v3.0

93 stars 10 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Mandelbrot benchmark's or other test's source

#56 PascalSoftwares opened 2 years ago
0
How to share Big Array, like a lookup table among various kernel calls

#55 rajxabc opened 5 years ago
6
Any of the opencl 2 version does not work

#54 rajxabc opened 5 years ago
38
Is there an example of generating a Unity Texture?

#53 mfagerlund opened 5 years ago
4
Can you set pipeline mode for each device separately?

#52 jinxiu0406 closed 6 years ago
5
1D NBODY scores

#51 cmisztur opened 7 years ago
9
add task types to control pool behavior (sync, broadcast task, shutdown devices)

#50 tugrul512bit closed 7 years ago
0
add duplicated compute option to device pool / task pool / task for initializing same buffer on all devices

#49 tugrul512bit closed 7 years ago
0
add callback option to ClTask

#48 tugrul512bit closed 7 years ago
0
add multiple opencl-kernel instances for different compute-id values, for tiled computing, in task pool, with device pool

#47 tugrul512bit closed 7 years ago
0
array.nextParam(array2).task() ---> creates ClTask to compute later in pool, with all the fields set at that time but with the latest array data

#46 tugrul512bit closed 7 years ago
0
add "batch mode compute"(pool of devices for pool of kernels) with multiple devices where each compute() is computed by 1 device only, with greedy scheduling

#45 tugrul512bit closed 7 years ago
0
single device pipeline: kernel repeat option

#44 tugrul512bit opened 7 years ago
0
single device pipeline: overlapping regions percentage in total latency

#43 tugrul512bit opened 7 years ago
0
clNumberCruncher.enqueueModeAsyncEnable to enqueue different kernels and arrays concurrently

#42 tugrul512bit closed 7 years ago
0
ClArray.async to make an array copy operation done on another commandQueue(concurrently)

#41 tugrul512bit closed 7 years ago
1
ClArray.name to bind an array to a kernel parameter with exact spelling

#40 tugrul512bit opened 7 years ago
1
Read-only and write-only flags for ClArray

#39 tugrul512bit closed 7 years ago
2
Enqueue mode with single gpu (and for device to device pipeline) ---- lower latency per command

#38 tugrul512bit closed 7 years ago
3
nonPartialWrite capability for buffers

#37 tugrul512bit closed 7 years ago
3
Device to device pipeline: enable mixed ordering of kernel arrays (in kernel function definition)

#36 tugrul512bit opened 7 years ago
0
Device to device pipeline: optimize single stage multiple kernel compute with less synchronizations

#35 tugrul512bit closed 7 years ago
0
Device to device pipeline: balancing load (kernel names) between neighboring stages

#34 tugrul512bit opened 7 years ago
0
[canceled]Dynamic device to device pipeline

#33 tugrul512bit closed 7 years ago
0
Image decode+resize+multiple_encode pipeline

#32 tugrul512bit opened 7 years ago
0
Complete device to device pipeline stage initialization kernel execution

#31 tugrul512bit closed 7 years ago
0
Some helper methods into ClNumberCruncher

#30 tugrul512bit closed 7 years ago
0
add struct array support with byte-length descriptors for Unity's Vector3-Vector2 arrays

#29 tugrul512bit closed 7 years ago
0
kernel repeat count number and repeat-end function name(kernel) with 64 global size(auto) for each repeat

#28 tugrul512bit closed 7 years ago
0
add built-in matrix multiplication with sizes between 2x2 and 8192x8192

#27 tugrul512bit opened 7 years ago
0
nbody(benchmark based) device selection disposes shared platform

#26 tugrul512bit closed 7 years ago
0
English language translation of cluster-computing related classes(multi-pc centered-control)

#25 tugrul512bit closed 7 years ago
0
Add device limits stress testing to have numbers used later in production or alarming when approaching limits.

#24 tugrul512bit opened 7 years ago
0
add built-in image-resizing method for png,gif and jpeg

#23 tugrul512bit opened 7 years ago
0
Add built-in jpeg,gif,png decompression-recompression methods

#22 tugrul512bit opened 7 years ago
0
Add speed-ratio indicator between devices after 10-20 iterations

#21 tugrul512bit opened 7 years ago
0
Arrays: bounds check before compute.

#20 tugrul512bit closed 7 years ago
0
For explicit device selection, ClNumberCruncher still expects number of cores and gpus

#19 tugrul512bit closed 7 years ago
0
inhibit use of ClDevice constructor

#18 tugrul512bit closed 7 years ago
0
Workitems: Grain size - local size - global size: bounds check

#17 tugrul512bit closed 7 years ago
0
Nbody benchmark-based explicit device selection

#16 tugrul512bit closed 7 years ago
0
Explicit device selection disposes handles twice, giving error

#15 tugrul512bit closed 7 years ago
0
C++ array wrapper re-creating(and computing) in loop throws error(CL_INVALID_MEM_OBJECT) but works for prepared N-array of C++ arrays

#14 tugrul512bit closed 7 years ago
0
Disposing unused buffers with warning message

#13 tugrul512bit opened 7 years ago
0
Redefine properties that are with underscores, to have a proper naming

#12 tugrul512bit closed 7 years ago
0
Force multiple-of-64 for array size when using streaming and C++ arrays (cl_mem_use_host_ptr)

#11 tugrul512bit opened 7 years ago
0
Hide Unnecessary Methods and Classes

#10 tugrul512bit closed 7 years ago
1
Explicit Pipelining

#9 tugrul512bit closed 7 years ago
0
Explicit Device to Device Pipelining

#8 tugrul512bit closed 7 years ago
0
Lazy compute

#7 tugrul512bit closed 7 years ago
0