Hi,
I need some help as i´ve been trying to create a working model with my own pictures now for a while and so far there has been a lot of trouble and less progress. The closest i have gotten is that when i have confidence threshold under 0.20 the entire screen is more or less filled with bounding boxes but as soon as i go over there is nothing.
Well; To the question;
I was wondering about the images for training a ssd_mobilenet v1 model in tensorflow. I read on Openvino that their model wanted BGR, is it the same for TF? What “bit depth” (i´m not sure if this is the right word for it) should the images have? (ex.8, 24 or so). For what i understand they should be in jpg-format, right?
I´m trying to trace potential errors and taking it from the top.
I´m trying to train a model that should detect two classes. As the camera will be mounted and the objects moving only along one axis i´m having some trouble as its hard getting pictures that are not almost identical to each other. The environment is also (color vise) almost gray scale of nature so everything sort of blend together. Does anyone know a model for object detection that is good at identifying round shapes in a “gray-on-gray” environment?
Another quick question. Would it be ok to post a more specific question with my settings (command arguments for scripts, general config settings) and the working procedure for training a model here so that (hopefully) someone would read it and find what i´m doing wrong?
Best,
Martin