tflm_label_image#

Overview#

This example demonstrates a TensorFlow Lite Micro model for image classification on MCUs. It is based on the TensorFlow Lite Label Image example adjusted to run on MCUs.

In this example, a static test image (“stopwatch”) is evaluated for classification into one of 1000 ImageNet classes.

Key Features#

Model Architecture: Mobilenet V1 0.25 128 quantized convolutional neural network
Input: 3-channel color image (128×128 pixels)
Output: Classification into 1000 classes
Detection Threshold: 23%
Neutron Software Version: MCU SDK 26.06.00 comes with a default Neutron Software Version of 3.1.1

Network Structure#

The neural network is based on Mobilenet V1 architecture:

Depthwise separable convolutions for efficient computation
Quantized model optimized for MCU deployment

Model Origin#

Source: TensorFlow Lite Models
Reference: TensorFlow Lite Label Image Example
Pre-trained Model: mobilenet_v1_0.25_128_quant_int8.tflite

Project Files#

File	Description
`main.cpp`	Example main function
`stopwatch.bmp`	Static test image (source: Wikimedia Commons)
`image_data.h`	Image converted to C array (RGB values)
`labels.h`	Names of object classes
`timer.c`	Timer source code
`image/*`	Image capture and pre-processing code
`video/*`	Camera and display handling
`get_top_n.cpp`	Top-N results retrieval
`model.cpp`	Model initialization and inference
`model_mobilenet_ops.cpp`	Model operations registration
`output_postproc.cpp`	Output post-processing
`model_data.h`	Model data from `.tflite` converted to C array

Project Structure in MCU SDK#

Only primary files are listed here:

Path_to_MCUSDK/mcuxsdk/examples/
├── eiq_examples/
│   ├── common/                # Common utilities for eIQ examples
│   │   ├── image/*            # Image capture and pre-processing module
│   │   └── video/*            # Camera and display handling module
│   └── tflm_cifar10/          
│       ├── CMakeLists.txt
│       ├── image_data.h
│       ├── labels.h
│       ├── labels.txt
│       ├── main.cpp
│       ├── readme.md
│       └── stopwatch.bmp
└── _boards/
    └── <board_name>/
        └── eiq_examples/
            └── tflm_label_image/
                ├── example_board_readme.md
                └── pcq_npu    # NPU version of the model files
                    ├── mobilenet_v1_0.25_128_quant_int8_npu.tflite
                    ├── model_mobilenet_ops_npu.cpp
                    └── model_data.h

(For the boards without NPU hardware, the CPU version of the model files is located in the pcq/ folder.)

Project Structure in IDE#

(1) Project Structure in MCUXpresso IDE

Only primary files are listed here:

.
├── eiq
│   ├── neutron       # Neutron-Software files
│   │   ├── common
│   │   │   └── include
│   │   │       └── NeutronErrors.h
│   │   ├── driver
│   │   │   └── include
│   │   │       └── NeutronDriver.h
│   │   └── rt700
│   │       └── cm33
│   │           ├── libNeutronDriver.a
│   │           └── libNeutronFirmware.a
│   └── tensorflow-lite/     
source
│   ├── board_init.h
│   ├── demo_config.h
│   ├── demo_info.cpp
│   ├── demo_info.h
│   ├── image/*
│   ├── labels.h
│   ├── main.cpp
│   ├── mcux_config.h
│   ├── mcuxsdk_version.h
│   ├── model
│   │   ├── get_top_n.cpp
│   │   ├── get_top_n.h
│   │   ├── model.cpp
│   │   ├── model.h
│   │   ├── model_data.h    # Model data file
│   │   ├── model_mobilenet_ops_npu.cpp
│   │   ├── output_postproc.cpp
│   │   └── output_postproc.h
│   ├── pmic_support.c
│   ├── pmic_support.h
│   ├── semihost_hardfault.c
│   ├── timer.c
│   └── timer.h
└── doc/                     # Documentation
    └── readme.md

The library files and header files in the eiq/neutron/ folder are Neutron-Software related files.

Users can update the Neutron-Software version in this project by replacing these four files in the eiq/neutron/ folder: NeutronDriver.h, NeutronErrors.h, libNeutronDriver.a, and libNeutronFirmware.a.

(2) Project Structure in MCUXpresso for VScode

Please select the following two options in the MCUXpresso for VS Code extension settings:

[✓] Mcuxpresso > Experimental: Enable Add Files To Project
[✓] Mcuxpresso > Experimental: Enable Freestanding Copy Board

After importing a Freestanding project from the MCU SDK, the project structure is as follows (only primary files are listed here):

.
├── CMakeLists.txt
├── labels.h
├── main.cpp
├── stopwatch.bmp
├── image_data.h
├── image/*
├── common
│   ├── timer.c
│   ├── timer.h
│   └── timer_xtensa.h
├── model
│   ├── get_top_n.cpp
│   ├── get_top_n.h
│   ├── model.h
│   ├── output_postproc.cpp
│   └── output_postproc.h
├── pcq_npu
│   ├── mobilenet_v1_0.25_128_quant_int8_npu.tflite
│   ├── model_data.h             # Model data file
│   └── model_mobilenet_ops_npu.cpp
├── <board_name_folder> (e.g. mimxrt700evk_cm33_core0) 
│   └── middleware_eiq_neutron   # Neutron-Software files
│       ├── NeutronDriver.h
│       ├── NeutronErrors.h
│       └── cm33
│           ├── libNeutronDriver.a
│           └── libNeutronFirmware.a
└── tflm
    ├── demo_info.cpp
    └── model.cpp

For the boards with NPU hardware, the NPU model file is located in the pcq_npu/ folder. (For the boards without NPU hardware, the CPU version of the model file is located in the pcq/ folder.)

The header files and library files in the <board_name_folder>/middleware_eiq_neutron folder are Neutron-Software related files.

Users can update the Neutron-Software version in this project by replacing these four files: NeutronDriver.h, NeutronErrors.h, libNeutronDriver.a, and libNeutronFirmware.a.

Replace the model file#

If you need to replace the model file with your own model, please follow two main steps:

Step 1: Modify the model_data.h file.
Step 2: Modify the model_mobilenet_ops_npu.cpp file.

Step 1: Modify the `model_data.h` file.#

(1) Use NPU model

For NPU boards, you can use the Neutron Converter tool to convert your .tflite model into an NPU-optimized version, which simultaneously generates an NPU-specific .tflite file and a corresponding .h header file.

You can obtain the Neutron Converter tool from the eIQ Neutron SDK.
After downloading and extracting the Neutron SDK Zip package, you can find the Neutron Converter tool at the following path: /eiq-neutron-sdk-linux-x.x.x/bin/neutron-converter.

# Set environment variables
export NEUTRON_SDK_PATH="/path/to/eiq-neutron-sdk-linux-x.x.x"
export LD_LIBRARY_PATH="${NEUTRON_SDK_PATH}/lib:${LD_LIBRARY_PATH}"
export PATH="${NEUTRON_SDK_PATH}/bin:${PATH}"

# Convert for mcxn94x target
neutron-converter --input model_name.tflite --output model_name_mcxn94x.tflite --target mcxn94x --dump-header-file-output

# Convert for imxrt700 target
neutron-converter --input model_name.tflite --output model_name_rt700.tflite --target imxrt700 --dump-header-file-output

Neutron Converter Target Platforms:

Target	Description	Boards
`mcxn94x`	MCXN94x series	frdmmcxn947, mcxn5xxevk, mcxn9xxevk
`imxrt700`	i.MX RT700 series	mimxrt700evk

Modify the model_data.h file in the project:

Replace the original model data with model_data array and model_data_len from your model’s .h file.
Refer to the “Total data” value for graph “main” shown in the terminal output of neutron converter, and adjust the kTensorArenaSize value in the model_data.h file accordingly. Typically, kTensorArenaSize should be set to 105% or more of the “Total data” value.

# Example terminal output from neutron converter:
# The "Total data" value for graph "main" is 16,512 (bytes), so the kTensorArenaSize should be set to about 17,338 (bytes) (16,512 * 1.05)

Overall statistics for graph "main":
  Operators:
    ...<operator details>...
  Memory:
    Total data    = 16,512 (bytes) (Inputs + Outputs + Intermediate Variable Tensors)
    Total weights = 30,400 (bytes) (Weights)
    Total size    = 46,912 (bytes) (All)

// Modify the model_data.h file:

constexpr int kTensorArenaSize = <Total data * 1.05>;  // TensorArenaSize in (bytes)

static const uint8_t model_data[] __ALIGNED(16) __PLACEMENT = {
...<your model array>...
}

static const unsigned int model_data_len = <your model length>;

If the following error occurs while running the example, it is caused by the kTensorArenaSize being too small. Please increase the kTensorArenaSize accordingly.

Failed to resize buffer. Requested:16544, available 15928, missing:616
AllocateTensors() failed

(2) Use CPU model

You can use the following workflow to run a CPU-version model in this example.

The xxd tool can convert your .tflite model file into a .h header file. The command is as follows:

xxd -i model_name.tflite > model_name.h

Or you can also use Neutron Converter (Recommend) to generate the .h file by using the --dump-header-file-input option. This will convert your input CPU-version .tflite model into a CPU-version .h file.

neutron-converter --input model_name.tflite  --output model_name_imxrt700.tflite --target imxrt700  --dump-header-file-input

Modify the model_data.h file in the project: replace the original model data with model_data array and model_data_len from your model’s .h file.

# Modify the model_data.h file:

static const uint8_t model_data[] __ALIGNED(16) __PLACEMENT = {
...<your model array>...
}

static const unsigned int model_data_len = <your model length>;

Step 2: Modify the `model_mobilenet_ops_npu.cpp` file.#

In step 2, you need to refer to the .h file generated by the Neutron Converter in step 1 and modify the model operators in the model_mobilenet_ops_npu.cpp file.

Typically, the .h file output by the Neutron Converter contains content similar to the following:

/*
// Register operators for TFLite Micro.
static tflite::MicroMutableOpResolver<4> s_microOpResolver;
s_microOpResolver.AddQuantize();
s_microOpResolver.AddSoftmax();
s_microOpResolver.AddDequantize();
s_microOpResolver.AddCustom(tflite::GetString_NEUTRON_GRAPH(), tflite::Register_NEUTRON_GRAPH());
*/

Copy the operator registration code from the .h file generated by the Neutron Converter into the model_mobilenet_ops_npu.cpp file.

Image Conversion Script#

The image_data.h file is generated using Python with OpenCV and NumPy:

import cv2
import numpy as np

img = cv2.imread('stopwatch.bmp')
img = cv2.resize(img, (128, 128))
img = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)

with open('image_data.h', 'w') as fout:
    print('#define STATIC_IMAGE_NAME "stopwatch"', file=fout)
    print('static const uint8_t image_data[] = {', file=fout)
    img.tofile(fout, ', ', '0x%02X')
    print('};\n', file=fout)

Note: For more details on model quantization and conversion, refer to the eIQ TensorFlow Lite User’s Guide included in the MCUXpresso SDK package.

Running the Demo#

Run result on MIMXRT700-EVK board with ARM GCC toolchain (NPU version):

Label image example using a TensorFlow Lite Micro model.
Detection threshold: 23%
Model: mobilenet_v1_0.25_128_quant_int8_npu
Core/NPU Frequency: 324 MHz
TensorArena Addr: 0x20000000 - 0x20040000
TensorArena Size: Total 0x40000 (262144 B); Used 0x246c0 (149184 B)
Model Addr: 0x20400000 - 0x2047b760
Model Size: 0x7b760 (505696 B)
Total Size Used: 654880 B (Model (505696 B) + TensorArena (149184 B))

Static data processing:
----------------------------------------
     Inference time: 2759 us
     Detected: stopwatch (87%)
----------------------------------------

Supported Boards with NPU#

Supported Boards with CPU Only#

MIMXRT1040-EVK
EVKB-IMXRT1050
MIMXRT1060-EVKB
MIMXRT1060-EVKC
EVK-MIMXRT1064
MIMXRT1160-EVK
MIMXRT1170-EVKB
MIMXRT1180-EVK
FRDM-IMXRT1186
EVK-MIMXRT595
EVK-MIMXRT685
MIMXRT685-AUD-EVK

tflm_label_image

Contents