There is various way to implement YOLO. I have tried to Pytorch implementation, Darknet Implementation until so far. But we are building a system that interatcts with Flutter Application. So, darkflow is a tensorflow translation of Darknet. I have used Anaconda and jupyer notebook.
In Flutter application, we need to detect a pothole by training a custom object detection.
At the end of the trainning, there is some checkpoints files are created in the /ckpt
directory.
Also, It is convertable file to *.pb
. Also it is convertable to *.lite
which can be used in mobile
side of application.
Python3 or Python2, tensorflow 1.0, numpy, opencv 3. After training part: We need to do:
pip uninstall tensorflow
pip install "tensorflow==1.9.*"
Purpose of doing this is that there is some confilicting dependicies to new tensorflow version. To obtain an easiest and efficient way is upgrading tensorflow version for using tf_convert API which is developed by Google.
I have trained the model on:
- NVDIA 2080 SLI GPU
- 24 Core Processor, its able to run the program with 40 threads.
You can choose one of the following three ways to get started with darkflow. If you are using Python 3 on windows you will need to install Microsoft Visual C++ 14.0. Here you can find installation process, why it is required, references etc or you can try stackoverflow.
-
Just build the Cython extensions in place. NOTE: If installing this way you will have to use
./flow
in the cloned darkflow directory instead offlow
as darkflow is not installed globally.python3 setup.py build_ext --inplace
-
Let pip install darkflow globally in dev mode (still globally accessible, but changes to the code immediately take effect)
pip install -e .
-
Install with pip globally
pip install .
A step by step series of examples that tell you how to get a development env running.
Download the DARKFLOW repository. You can find it under this link. (https://github.com/thtrieu/darkflow)
For the purpose of training custom model. You need the start with initail weights which is obtained by under this link.(https://pjreddie.com/darknet/yolo/) or you can download by bash command.
wget https://pjreddie.com/media/files/yolov3.weights
You can download yolov3 weights in this way.
And repeat
until finished
End with an example of getting some data out of the system or using it for a little demo
To obtain a state-of-art result in Pothole Detection, we need to collect massive dataset. I have collected the dataset from open-source platforms and connections from other companies. I have resized them, I have spent 2 hours to annotate them. Hence, to best of our knowledge, we need to see first preliminary results, so I have annotated the small amount of data and ou can see in the jupyter notebook file, we can find the pothole at given a image with .36 confidence score (labeled and trained on 50 images.).
- To annotate images download labelImg.
- Check this video to learn how to use lebelImg.
- Github repo for labelImg can be found here
The steps below assume we want to use tiny YOLO and our dataset has 1 classes
-
Create a copy of the configuration file
tiny-yolo-voc.cfg
and rename it according to your preferencetiny-yolo-voc-custom.cfg
(It is crucial that you leave the originaltiny-yolo-voc.cfg
file unchanged). Heretiny-yolo-voc-custom.cfg
is for 1 classes, you can change the name as you wish. -
In
tiny-yolo-voc-custom.cfg
, change classes in the [region] layer (the last layer) to the number of classes you are going to train for. In our case, classes are set to 1.... [region] anchors = 1.08,1.19, 3.42,4.41, 6.63,11.38, 9.42,5.11, 16.62,10.52 bias_match=1 classes=1 ## 1 classes coords=4 num=5 softmax=1 ...
-
In
tiny-yolo-voc-custom.cfg
, change filters in the [convolutional] layer (the second to last layer) to num * (classes + 5). In our case, num is 5 and classes are 1 so 5 * (1 + 5) = 30 therefore filters are set to 30.... [convolutional] size=1 stride=1 pad=1 filters=30 ## 5 * (1 + 5) = 30 activation=linear [region] anchors = 1.08,1.19, 3.42,4.41, 6.63,11.38, 9.42,5.11, 16.62,10.52 ...
-
Change
labels.txt
to include the label(s) you want to train on (number of labels should be the same as the number of classes you set intiny-yolo-voc-custom.cfg
file). In my case,labels.txt
will contain 1 labels.pothole
-
Reference the
tiny-yolo-voc-custom.cfg
model when you train.
flow --model cfg/tiny-yolo-voc-custom.cfg --load weights/tiny-yolo-voc.weights --train --annotation train/annot --dataset train/images --gpu 1.0 --epochs 1000
Spesifying the model --model cfg/tiny-yolo-voc-custom.cfg
and the weights --load weights/tiny-yolo-voc.weights
. After that specify the path for the annatations --annotation train/anoot
and images --dataset train/images
. Use --gpu 1.0
to use gpu for speed, if you do not have GPU just don't use this part. You can specify the number of epochs. By default it is 1000. However it can be stopped anytime. I recommend to keep the lose below 1. Darkflow saving training step every 250 iteration under the /ckpt
folder. From my experiments, I have seen the loss at least approximately 1.50, but loading more data it will be less than 1.
## Saving the lastest checkpoint to protobuf file
flow --model cfg/yolo-new.cfg --load -1 --savepb
When saving the .pb file, a .meta file will also be generated alongside it. This .meta file is a JSON dump of everything in the meta dictionary that contains information nessecary for post-processing such as anchors and labels. This way, everything you need to make predictions from the graph and do post processing is contained in those two files - no need to have the .cfg or any labels file tagging along.
In flutter side of our application, we need to obtain tflite format model file. To get a result, I have used Tensorflow API. But in this step you need to download tensorflow 1.9.* version
pip install "tensorflow==1.9.*"
After that run this command to make sure the using tflite_convert command.
tflite_convert --help
Converting the file into tflite format, run this command
tflite_convert \
--graph_def_file=built_graph/tiny-yolo-voc-custom.pb \
--output_file=built_graph/model.lite \
--input_format=TENSORFLOW_GRAPHDEF \
--output_format=TFLITE \
--input_shape=1,416,416,3 \
--input_array=input \
--output_array=output \
--inference_type=FLOAT \
--input_data_type=FLOAT
- Enes Karanfil - Student at TOBB University of Economics and Technology - [email protected]
This project is licensed under the MIT License - see the LICENSE.md file for details
- Paper based research and training process
- YOLOV3 -tiny based-
- Realtime Streaming
- One-pass CNN structure
- RoadPulse
Darkflow