This question may have been answered, but I did not find a simple answer to this question. I created a pipeline using Keras to classify the Simpsons characters ( dataset here ).
I have 20 classes and an image as input, I return the name of the character. It is pretty simple. My dataset contains images with the main character in the image and only the name of the character as a label.
Now I would like to add object detection by asking that I draw a bounding box around the characters in the picture and predict what character it is. I do not want to use a sliding window because it is very slow. So I thought about using a faster RCNN ( github repo ) or YOLO ( github repo ). Should I add bounding box coordinates for each image of my training set? Is there a way to make object detection (and get bounding fields in my test) without specifying coordinates for a set of workouts?
In general, I would like to create a simple object detection model, I do not know if it is possible to create a simpler YOLO or Faster RCNN.
Thanks so much for any help.
deep-learning object-detection classification keras
A. Attia
source share