I am implementing a separation and subjugation approach to a convex hull in CUDA. This is my approach: Below:
But it becomes too complicated, and I feel that I am not using CUDA parallel power, because at each level of the tree I create N / 2 ^ i threads, the complexity of which is O (N) when combining all adjacent cases at this level, Therefore, the network complexity is still O (N logN).
Can you tell me how to do it better or give an alternative parallel lead algorithm for the convex hull (it would be great if I can get an algorithm for the parallel version of the Graham scan)?
source share