I've already learnt about the Parallel Sum Reduction technique. However, I want to know if it is possible to add up different values from different Threads to a same __global variable like this :
float a = ...; // Assign different a values for each Thread
Gvar[1] += a; // Do the sum simultaneously to the same global variable index
Thanks