This is a stereo tracking issue and one way to merge n. Different images will use some local functions (SIFT, SURF, FAST, etc.). The OpenCV library has an already discovered SURF detector. You may need to use C or C ++ for real-time processing.
source
share