tugrul512bit Cekirdekler issues

tugrul512bit / Cekirdekler

Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).

GNU General Public License v3.0

93 stars 9 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Mandelbrot benchmark's or other test's source

#56 PascalSoftwares opened 1 year ago
0
How to share Big Array, like a lookup table among various kernel calls

#55 rajxabc opened 5 years ago
6
Any of the opencl 2 version does not work

#54 rajxabc opened 5 years ago
38
Is there an example of generating a Unity Texture?

#53 mfagerlund opened 5 years ago
4
Can you set pipeline mode for each device separately?

#52 jinxiu0406 closed 5 years ago
5
1D NBODY scores

#51 cmisztur opened 7 years ago
9
add task types to control pool behavior (sync, broadcast task, shutdown devices)

#50 tugrul512bit closed 7 years ago
0
add duplicated compute option to device pool / task pool / task for initializing same buffer on all devices

#49 tugrul512bit closed 7 years ago
0
add callback option to ClTask

#48 tugrul512bit closed 7 years ago
0
add multiple opencl-kernel instances for different compute-id values, for tiled computing, in task pool, with device pool

#47 tugrul512bit closed 7 years ago
0
array.nextParam(array2).task() ---> creates ClTask to compute later in pool, with all the fields set at that time but with the latest array data

#46 tugrul512bit closed 7 years ago
0
add "batch mode compute"(pool of devices for pool of kernels) with multiple devices where each compute() is computed by 1 device only, with greedy scheduling

#45 tugrul512bit closed 7 years ago
0
single device pipeline: kernel repeat option

#44 tugrul512bit opened 7 years ago
0
single device pipeline: overlapping regions percentage in total latency

#43 tugrul512bit opened 7 years ago
0
clNumberCruncher.enqueueModeAsyncEnable to enqueue different kernels and arrays concurrently

#42 tugrul512bit closed 7 years ago
0
ClArray.async to make an array copy operation done on another commandQueue(concurrently)

#41 tugrul512bit closed 7 years ago
1
ClArray.name to bind an array to a kernel parameter with exact spelling

#40 tugrul512bit opened 7 years ago
1
Read-only and write-only flags for ClArray

#39 tugrul512bit closed 7 years ago
2
Enqueue mode with single gpu (and for device to device pipeline) ---- lower latency per command

#38 tugrul512bit closed 7 years ago
3
nonPartialWrite capability for buffers

#37 tugrul512bit closed 7 years ago
3
Device to device pipeline: enable mixed ordering of kernel arrays (in kernel function definition)

#36 tugrul512bit opened 7 years ago
0
Device to device pipeline: optimize single stage multiple kernel compute with less synchronizations

#35 tugrul512bit closed 7 years ago
0
Device to device pipeline: balancing load (kernel names) between neighboring stages

#34 tugrul512bit opened 7 years ago
0
[canceled]Dynamic device to device pipeline

#33 tugrul512bit closed 7 years ago
0
Image decode+resize+multiple_encode pipeline

#32 tugrul512bit opened 7 years ago
0
Complete device to device pipeline stage initialization kernel execution

#31 tugrul512bit closed 7 years ago
0
Some helper methods into ClNumberCruncher

#30 tugrul512bit closed 7 years ago
0
add struct array support with byte-length descriptors for Unity's Vector3-Vector2 arrays

#29 tugrul512bit closed 7 years ago
0
kernel repeat count number and repeat-end function name(kernel) with 64 global size(auto) for each repeat

#28 tugrul512bit closed 7 years ago
0
add built-in matrix multiplication with sizes between 2x2 and 8192x8192

#27 tugrul512bit opened 7 years ago
0
nbody(benchmark based) device selection disposes shared platform

#26 tugrul512bit closed 7 years ago
0
English language translation of cluster-computing related classes(multi-pc centered-control)

#25 tugrul512bit closed 7 years ago
0
Add device limits stress testing to have numbers used later in production or alarming when approaching limits.

#24 tugrul512bit opened 7 years ago
0
add built-in image-resizing method for png,gif and jpeg

#23 tugrul512bit opened 7 years ago
0
Add built-in jpeg,gif,png decompression-recompression methods

#22 tugrul512bit opened 7 years ago
0
Add speed-ratio indicator between devices after 10-20 iterations

#21 tugrul512bit opened 7 years ago
0
Arrays: bounds check before compute.

#20 tugrul512bit closed 7 years ago
0
For explicit device selection, ClNumberCruncher still expects number of cores and gpus

#19 tugrul512bit closed 7 years ago
0
inhibit use of ClDevice constructor

#18 tugrul512bit closed 7 years ago
0
Workitems: Grain size - local size - global size: bounds check

#17 tugrul512bit closed 7 years ago
0
Nbody benchmark-based explicit device selection

#16 tugrul512bit closed 7 years ago
0
Explicit device selection disposes handles twice, giving error

#15 tugrul512bit closed 7 years ago
0
C++ array wrapper re-creating(and computing) in loop throws error(CL_INVALID_MEM_OBJECT) but works for prepared N-array of C++ arrays

#14 tugrul512bit closed 7 years ago
0
Disposing unused buffers with warning message

#13 tugrul512bit opened 7 years ago
0
Redefine properties that are with underscores, to have a proper naming

#12 tugrul512bit closed 7 years ago
0
Force multiple-of-64 for array size when using streaming and C++ arrays (cl_mem_use_host_ptr)

#11 tugrul512bit opened 7 years ago
0
Hide Unnecessary Methods and Classes

#10 tugrul512bit closed 7 years ago
1
Explicit Pipelining

#9 tugrul512bit closed 7 years ago
0
Explicit Device to Device Pipelining

#8 tugrul512bit closed 7 years ago
0
Lazy compute

#7 tugrul512bit closed 7 years ago
0