Real-time Template Compatibility - OpenCV, C ++

Question

Real-time Template Compatibility - OpenCV, C ++

I am trying to implement real-time tracking using patterns. I want to update the template with every frame. The main changes that I made:

1) divided the correspondence of the template and minmaxLoc into separate modules, namely: TplMatch () and minmax () , respectively.

2) Inside the track () function, the select_flag parameter always remains true, so the new template is copied to "myTemplate" with each iteration.

3) The last 3 lines of the track () function should update the template (roiImg).

4) In addition, I deleted any arguments to the track () function, since img and roiImg > are global variables and therefore you do not need to pass them to functions.

Below is the code:

#include <iostream> #include "opencv2/opencv.hpp" #include <opencv2/imgproc/imgproc.hpp> #include <opencv2/highgui/highgui.hpp> #include <opencv2/objdetect/objdetect.hpp> #include <sstream> using namespace cv; using namespace std; Point point1, point2; /* vertical points of the bounding box */ int drag = 0; Rect rect; /* bounding box */ Mat img, roiImg; /* roiImg - the part of the image in the bounding box */ int select_flag = 0; bool go_fast = false; Mat mytemplate; ///------- template matching ----------------------------------------------------------------------------------------------- Mat TplMatch( Mat &img, Mat &mytemplate ) { Mat result; matchTemplate( img, mytemplate, result, CV_TM_SQDIFF_NORMED ); normalize( result, result, 0, 1, NORM_MINMAX, -1, Mat() ); return result; } ///------- Localizing the best match with minMaxLoc ------------------------------------------------------------------------ Point minmax( Mat &result ) { double minVal, maxVal; Point minLoc, maxLoc, matchLoc; minMaxLoc( result, &minVal, &maxVal, &minLoc, &maxLoc, Mat() ); matchLoc = minLoc; return matchLoc; } ///------- tracking -------------------------------------------------------------------------------------------------------- void track() { if (select_flag) { roiImg.copyTo(mytemplate); // select_flag = false; go_fast = true; } // imshow( "mytemplate", mytemplate ); waitKey(0); Mat result = TplMatch( img, mytemplate ); Point match = minmax( result ); rectangle( img, match, Point( match.x + mytemplate.cols , match.y + mytemplate.rows ), CV_RGB(255, 255, 255), 0.5 ); std::cout << "match: " << match << endl; /// latest match is the new template Rect ROI = cv::Rect( match.x, match.y, mytemplate.cols, mytemplate.rows ); roiImg = img( ROI ); imshow( "roiImg", roiImg ); //waitKey(0); } ///------- MouseCallback function ------------------------------------------------------------------------------------------ void mouseHandler(int event, int x, int y, int flags, void *param) { if (event == CV_EVENT_LBUTTONDOWN && !drag) { /// left button clicked. ROI selection begins point1 = Point(x, y); drag = 1; } if (event == CV_EVENT_MOUSEMOVE && drag) { /// mouse dragged. ROI being selected Mat img1 = img.clone(); point2 = Point(x, y); rectangle(img1, point1, point2, CV_RGB(255, 0, 0), 3, 8, 0); imshow("image", img1); } if (event == CV_EVENT_LBUTTONUP && drag) { point2 = Point(x, y); rect = Rect(point1.x, point1.y, x - point1.x, y - point1.y); drag = 0; roiImg = img(rect); // imshow("MOUSE roiImg", roiImg); waitKey(0); } if (event == CV_EVENT_LBUTTONUP) { /// ROI selected select_flag = 1; drag = 0; } } ///------- Main() ---------------------------------------------------------------------------------------------------------- int main() { int k; /* ///open webcam VideoCapture cap(0); if (!cap.isOpened()) return 1;*/ ///open video file VideoCapture cap; cap.open( "Megamind.avi" ); if ( !cap.isOpened() ) { cout << "Unable to open video file" << endl; return -1; } /* /// Set video to 320x240 cap.set(CV_CAP_PROP_FRAME_WIDTH, 320); cap.set(CV_CAP_PROP_FRAME_HEIGHT, 240);*/ cap >> img; GaussianBlur( img, img, Size(7,7), 3.0 ); imshow( "image", img ); while (1) { cap >> img; if ( img.empty() ) break; // Flip the frame horizontally and add blur cv::flip( img, img, 1 ); GaussianBlur( img, img, Size(7,7), 3.0 ); if ( rect.width == 0 && rect.height == 0 ) cvSetMouseCallback( "image", mouseHandler, NULL ); else track(); imshow("image", img); // waitKey(100); k = waitKey(75); k = waitKey(go_fast ? 30 : 10000); if (k == 27) break; } return 0; }

Updated template is not tracked. I cannot understand why this is happening, as I update my template (roiImg) with each iteration. The match value from the minmax () function returns the same point (coordinates) every time. Test video is available at: http://www.youtube.com/watch?v=vpnkk7N2E0Q&feature=youtu.be Please study it and forward it ... thanks a lot!

+8

c ++ image-processing opencv computer-vision template-matching

learner Nov 24 '13 at 19:46

source share

3 answers

You can follow the OpenCV tutorial "Template Matching" . Your track function may contain code to search for a template in the current frame; simple code based on matchTemplate and minMaxLoc .

An interesting problem associated with part of your "real time" is to find a match, if any, during the time between the current frame and the next.

Edit :

The following quick and dirty code and video at http://www.youtube.com/watch?v=vpnkk7N2E0Q&feature=youtu.be shows what I mean for tracking.

Since I don't have a webcam, I changed your code a bit to just use the video, this https://code.ros.org/trac/opencv/export/7237/trunk/opencv/samples/cpp/tutorial_code/HighGUI/ video-input-psnr-ssim / video / Megamind.avi

Then I add the track function and some logic to slow down the video until I select ROI, and then play the video at normal speed.

 #include <iostream> #include "opencv2/opencv.hpp" #include <opencv2/imgproc/imgproc.hpp> #include <opencv2/highgui/highgui.hpp> #include <opencv2/objdetect/objdetect.hpp> #include <sstream> using namespace cv; using namespace std; Point point1, point2; /* vertical points of the bounding box */ int drag = 0; Rect rect; /* bounding box */ Mat img, roiImg; /* roiImg - the part of the image in the bounding box */ int select_flag = 0; bool go_fast = false; Mat mytemplate; void track(cv::Mat &img, const cv::Mat &templ, const cv::Rect &r ) { static int n = 0; if (select_flag) { templ.copyTo(mytemplate); select_flag = false; go_fast = true; } cv::Mat result; /// Do the Matching and Normalize matchTemplate( img, mytemplate, result, CV_TM_SQDIFF_NORMED ); normalize( result, result, 0, 1, NORM_MINMAX, -1, Mat() ); /// Localizing the best match with minMaxLoc double minVal; double maxVal; Point minLoc; Point maxLoc; Point matchLoc; minMaxLoc( result, &minVal, &maxVal, &minLoc, &maxLoc, Mat() ); matchLoc = minLoc; rectangle( img, matchLoc, Point( matchLoc.x + mytemplate.cols , matchLoc.y + mytemplate.rows ), CV_RGB(255, 255, 255), 3 ); std::cout << matchLoc << "\n"; } ///MouseCallback function void mouseHandler(int event, int x, int y, int flags, void *param) { if (event == CV_EVENT_LBUTTONDOWN && !drag) { /* left button clicked. ROI selection begins */ point1 = Point(x, y); drag = 1; } if (event == CV_EVENT_MOUSEMOVE && drag) { /* mouse dragged. ROI being selected */ Mat img1 = img.clone(); point2 = Point(x, y); rectangle(img1, point1, point2, CV_RGB(255, 0, 0), 3, 8, 0); imshow("image", img1); } if (event == CV_EVENT_LBUTTONUP && drag) { point2 = Point(x, y); rect = Rect(point1.x, point1.y, x - point1.x, y - point1.y); drag = 0; roiImg = img(rect); } if (event == CV_EVENT_LBUTTONUP) { /* ROI selected */ select_flag = 1; drag = 0; } } ///Main function int main() { int k; /* VideoCapture cap(0); if (!cap.isOpened()) return 1; */ VideoCapture cap; //cap.open("~/Downloads/opencv-2.4.4/samples/cpp/tutorial_code/HighGUI/video-input-psnr-ssim/video/Megamind.avi"); cap.open("./Megamind.avi"); if (!cap.isOpened()) { printf("Unable to open video file\n"); return -1; } /* // Set video to 320x240 cap.set(CV_CAP_PROP_FRAME_WIDTH, 320); cap.set(CV_CAP_PROP_FRAME_HEIGHT, 240); */ cap >> img; imshow("image", img); while (1) { cap >> img; if (img.empty()) break; if (rect.width == 0 && rect.height == 0) cvSetMouseCallback("image", mouseHandler, NULL); else track(img, roiImg, rect); if (select_flag == 1) imshow("Template", roiImg); imshow("image", img); k = waitKey(go_fast ? 30 : 10000); if (k == 27) break; } return 0; }

+2

Alessandro jacopson Nov 24 '13 at 20:00

source share

You can also have a general introduction to the topic starting on this wikipedia page http://en.wikipedia.org/wiki/Video_tracking

0

Alessandro jacopson Mar 18 '14 at 21:06

source share

Alessandro jacopson · Accepted Answer · 2013-12-08T09:41:48+0000

I get your source code from this revision of your question: https://stackoverflow.com/revisions/20180073/3

I made a minimal change to the source code, resulting in the following code:

 #include <iostream> #include "opencv2/opencv.hpp" #include <opencv2/imgproc/imgproc.hpp> #include <opencv2/highgui/highgui.hpp> #include <opencv2/objdetect/objdetect.hpp> #include <sstream> using namespace cv; using namespace std; Point point1, point2; /* vertical points of the bounding box */ int drag = 0; Rect rect; /* bounding box */ Mat img, roiImg; /* roiImg - the part of the image in the bounding box */ int select_flag = 0; bool go_fast = false; Mat mytemplate; ///------- template matching ----------------------------------------------------------------------------------------------- Mat TplMatch( Mat &img, Mat &mytemplate ) { Mat result; matchTemplate( img, mytemplate, result, CV_TM_SQDIFF_NORMED ); normalize( result, result, 0, 1, NORM_MINMAX, -1, Mat() ); return result; } ///------- Localizing the best match with minMaxLoc ------------------------------------------------------------------------ Point minmax( Mat &result ) { double minVal, maxVal; Point minLoc, maxLoc, matchLoc; minMaxLoc( result, &minVal, &maxVal, &minLoc, &maxLoc, Mat() ); matchLoc = minLoc; return matchLoc; } ///------- tracking -------------------------------------------------------------------------------------------------------- void track() { if (select_flag) { //roiImg.copyTo(mytemplate); // select_flag = false; go_fast = true; } // imshow( "mytemplate", mytemplate ); waitKey(0); Mat result = TplMatch( img, mytemplate ); Point match = minmax( result ); rectangle( img, match, Point( match.x + mytemplate.cols , match.y + mytemplate.rows ), CV_RGB(255, 255, 255), 0.5 ); std::cout << "match: " << match << endl; /// latest match is the new template Rect ROI = cv::Rect( match.x, match.y, mytemplate.cols, mytemplate.rows ); roiImg = img( ROI ); roiImg.copyTo(mytemplate); imshow( "roiImg", roiImg ); //waitKey(0); } ///------- MouseCallback function ------------------------------------------------------------------------------------------ void mouseHandler(int event, int x, int y, int flags, void *param) { if (event == CV_EVENT_LBUTTONDOWN && !drag) { /// left button clicked. ROI selection begins point1 = Point(x, y); drag = 1; } if (event == CV_EVENT_MOUSEMOVE && drag) { /// mouse dragged. ROI being selected Mat img1 = img.clone(); point2 = Point(x, y); rectangle(img1, point1, point2, CV_RGB(255, 0, 0), 3, 8, 0); imshow("image", img1); } if (event == CV_EVENT_LBUTTONUP && drag) { point2 = Point(x, y); rect = Rect(point1.x, point1.y, x - point1.x, y - point1.y); drag = 0; roiImg = img(rect); roiImg.copyTo(mytemplate); // imshow("MOUSE roiImg", roiImg); waitKey(0); } if (event == CV_EVENT_LBUTTONUP) { /// ROI selected select_flag = 1; drag = 0; } } ///------- Main() ---------------------------------------------------------------------------------------------------------- int main() { int k; /* ///open webcam VideoCapture cap(0); if (!cap.isOpened()) return 1;*/ ///open video file VideoCapture cap; cap.open( "Megamind.avi" ); if ( !cap.isOpened() ) { cout << "Unable to open video file" << endl; return -1; } /* /// Set video to 320x240 cap.set(CV_CAP_PROP_FRAME_WIDTH, 320); cap.set(CV_CAP_PROP_FRAME_HEIGHT, 240);*/ cap >> img; GaussianBlur( img, img, Size(7,7), 3.0 ); imshow( "image", img ); while (1) { cap >> img; if ( img.empty() ) break; // Flip the frame horizontally and add blur cv::flip( img, img, 1 ); GaussianBlur( img, img, Size(7,7), 3.0 ); if ( rect.width == 0 && rect.height == 0 ) cvSetMouseCallback( "image", mouseHandler, NULL ); else track(); imshow("image", img); // waitKey(100); k = waitKey(75); k = waitKey(go_fast ? 30 : 10000); if (k == 27) break; } return 0; }

The video at https://www.youtube.com/watch?v=rBCopeneCos shows a test of the above program.

I would avoid using a global variable because I think they do not help to understand where the problems lie; in addition, I would also pay attention to the shallow and deep copy for the OpenCV Mat class, since 1 '' wrote in its answer :

The OpenCV Mat class is simply a header for the actual image data that contains a pointer to. operator= copies the pointer (and other information in the header, such as image sizes) so that both mats have the same data. This means that changing data in one mat also changes it in another. This is called a “shallow” copy, since only the top layer (title) is copied, not the lower level (data).
To make a copy of the underlying data (called a “deep copy”), use clone() . You can find information about this on the page that you are attached to.

Change drift: In the comment , Real-time template compatibility - OpenCV, C ++ , learner asks about tracking drift. Looking at the video https://www.youtube.com/watch?v=rBCopeneCos , we see that at the beginning of the video the program tracks the girl’s right eye, and at 0:15 it starts tracking the girl’s eyebrows, at 0:19 it starts tracking eyebrows boy, and he no longer tracks the girl’s eye, for example, at 0:27 he tracks the girl’s right eyebrow, while the right eye is clearly visible in the image,

This drift from eye tracking to eyebrow tracking is normal in simple code, like the one I posted, and the explanation is pretty simple: see the video at https://www.youtube.com/watch?v=sGHEu3u9XvI , the video starts with tracking (contents of the black rectangle) of the game card, then I delete the game card from the scene, and the black tracking rectangle “drifts” in the lower left corner of the scene; because we constantly update the template, and therefore the behavior is correct: the program stops to track the game card and starts tracking the white background, and therefore you have a "drift" ... in other words, your TplMatch() function will always return a valid image result , and your current minmax() implementation will always return a valid minimum.

Real-time Template Compatibility - OpenCV, C ++

More articles: