After you have calibrated your camera, you will have a transition from the image plane to world coordinates. Using this information, you can predict the height of the object you are looking for, of course, at this stage you somehow need to identify the object of interest to you.
In general, this question is too broad and covers many fundamental concepts of computer vision, so consult your favorite tutorial before attacking a problem.
source share