ESP32-CAM-Sign-Recognition

Overview

The aim of this university project is to implement a CNN (convolutional neural network) for gesture recognition on ESP32-CAM. Keras is used for model's training and TensorFlowLite allows to implement the model on a microcontroler such as ESP32. An American Sign Language's data set is used to train the model, several optimizations were made to improve the precision in practice (merging labels, artificial augmentation of the data set).

CNN architecture

Tensors

Input : 28*28 = 784 pixels
Output : 24 labels (but merged to 4 due to optimization)

Hidden layers

3 Convolution + ReLu + Max-pooling
1 Flatten
1 Fully-connected + ReLu
1 Fully-connected + Softmax

How to create binary model's file with TFLite

At the end of training in the notebook, ASL_256_lite.tflite is created by TensorFlowLite. To create binary model's file model_data.cc to implement on ESP32 (p_det_model.cpp in source code), entry this command in the folder terminal :

xxd -i ASL_256_lite.tflite > model_data.cc

Results with C-H-L-Y labels

~100% with test data (ASL data set) but biased
~70% with real capture (ESP32-CAM)

Documentations

Board : AI Thinker ESP32-CAM
Data Set : MNIST ASL
CNN modeling and training : Keras
CNN microcontroler's post-training implementation : TensorFlowLite

Special thanks

TensorFlowLite (C++ API for ESP32)

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
ESP32 source & pio ini		ESP32 source & pio ini
sign_reco		sign_reco
utils script		utils script
CNN Doc.pdf		CNN Doc.pdf
CNN Slides.pdf		CNN Slides.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ESP32-CAM-Sign-Recognition

Overview

CNN architecture

Tensors

Hidden layers

How to create binary model's file with TFLite

Results with C-H-L-Y labels

Documentations

Special thanks

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ESP32-CAM-Sign-Recognition

Overview

CNN architecture

Tensors

Hidden layers

How to create binary model's file with TFLite

Results with C-H-L-Y labels

Documentations

Special thanks

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages