Closed fumik closed 5 years ago
Inter-clusters events handlers, processInterClustersOutgoingSpikes and processInterClustersIncomingSpikes, are called when a single clusters simulation is running. These methods are called once per every synaptic transmission delay period (about 10 simulation steps). One processInterClustersIncomingSpikes method calls 2 cudaMalloc API and 3 cudaMemcpy (D->H) API, and one processInterClustersIncomingSpikes method calls 1 cudaMalloc API and 2 cudaMemcpy (D->H, H->D) ,API.
Skipping these methods when single cluster simulation is running results in 6~7% performance improvement (100x100 neurons, connected, 100sx2 epochs).
What kind of issue is this?
What is affected by this?
How do we replicate the issue/how would it work?
Expected behavior (i.e. solution or outline of what it would look like)
Other Comments