Discrepancy in Predictions Between Laptop and STM Microcontroller for 1D CNN Model

khushbu_parmar · ‎2024-12-11

I am working on deploying a 1D CNN model on an STM microcontroller using STM32CubeMX with the X-CUBE-AI package to convert the TensorFlow Lite model into C code.

While testing the model, I noticed significant discrepancies in predictions:

On Laptop: Using the quantized model for inference gives reasonable predictions.
On Microcontroller: Using the same test data, the predictions differ substantially when validating the model on hardware.

Steps Taken:

Ensured the test data was preprocessed identically for both laptop and microcontroller predictions.
Used X-CUBE-AI to generate C code from the quantized TFLite model.
Deployed the generated C code on the microcontroller for validation.

Questions:

What could cause such prediction differences between the laptop and microcontroller?
Could this issue be related to quantization or numerical precision differences during deployment?
Are there any additional steps or configurations in X-CUBE-AI to ensure consistent performance between platforms?

fauvarque.daniel · ‎2024-12-12

The STM32Cube.AI (i.e. X-CUBE-AI) library used on the target has it's own implementation of the neural network kernels so you may see different results between the results when executed in Python versus the target.

In the X-Cube-AI tool you have 2 validate buttons, one to validate on the desktop using the same library as the one used on the target and one validate on target.

The results of the validate on desktop and validate on target should not differ as it is basically the same library. On the target we have some more optimizations based on the processor you use.

If you see a bad COS (less that 0.98) when you validate the model with real data then it is likely to be a bug in the implementation of the kernels that we should look at.

Could you share your model and ideally some input/output data so we can reproduce the issue ?

Thanks in advance

Regards

In order to give better visibility on the answered topics, please click on 'Accept as Solution' on the reply which solved your issue or answered your question.

View solution in original post

fauvarque.daniel · ‎2024-12-12

The STM32Cube.AI (i.e. X-CUBE-AI) library used on the target has it's own implementation of the neural network kernels so you may see different results between the results when executed in Python versus the target.

In the X-Cube-AI tool you have 2 validate buttons, one to validate on the desktop using the same library as the one used on the target and one validate on target.

The results of the validate on desktop and validate on target should not differ as it is basically the same library. On the target we have some more optimizations based on the processor you use.

If you see a bad COS (less that 0.98) when you validate the model with real data then it is likely to be a bug in the implementation of the kernels that we should look at.

Could you share your model and ideally some input/output data so we can reproduce the issue ?

Thanks in advance

Regards

In order to give better visibility on the answered topics, please click on 'Accept as Solution' on the reply which solved your issue or answered your question.

khushbu_parmar · ‎2025-01-05

Thank you for your response. I’d like to clarify my issue further:

I am using a quantized .tflite model for inference. On my laptop, the predictions from this model (using TensorFlow Lite) are reasonable and consistent.
When I use X-CUBE-AI’s Validate on Desktop functionality to test the same model and input data, the predictions differ significantly from those obtained via TensorFlow Lite.
The same issue persists when validating on the target STM32 hardware (Validate on Target).

Steps I’ve taken:

Ensured consistent preprocessing for input data across platforms.
Verified that the quantized .tflite model is correctly converted using X-CUBE-AI.
Cross-checked the scaling and quantization parameters.

Given these discrepancies, I suspect there may be differences in how TensorFlow Lite and STM32Cube.AI implement quantized operations. Could you please confirm whether this is expected behavior or suggest further steps to investigate?

Thank you for your assistance.

Best regards,
Khushbu

fauvarque.daniel · ‎2025-01-06

Yes I confirm that the implementation of the kernels are different.

If you see a significant accuracy drop using X-CUBE-AI then it may be an issue we have to work on.

Regards

In order to give better visibility on the answered topics, please click on 'Accept as Solution' on the reply which solved your issue or answered your question.

khushbu_parmar · ‎2025-01-26

Thank you for your previous response. I trained a more complex and generalized model and achieved satisfactory results when deploying it on the STM32 board using X-CUBE-AI. Thank you for your support.

I am now exploring the possibility of training the model directly on the STM32 board to enable adaptive on-device learning. My model is a 1D Convolutional Neural Network (CNN) designed to predict target features from historical time-series data.

Could you please confirm if any STM32 microcontrollers support on-board training? If so, I would appreciate guidance on the tools, libraries, or methods to enable this functionality. Alternatively, if on-board training is not currently supported, any recommendations for adaptive personalization strategies would also be helpful.

Thank you for your time and assistance. I look forward to your response.

Best regards,
Khushbu

Julian E. · ‎2025-01-27

Hello @khushbu_parmar,

On board training is not supported by X Cube AI.

As you are working with 1D time series data, you may try to use NanoEdge AI Studio instead.

The tool is very easy to use, it will take you just a few minutes to check if you get good performances.

In Nanoedge, you just import data, the tool looks for the best model given your data and then it output a C library with 2 to 3 functions to use.

Anomaly detection support training on device.

Doc:

NanoEdge: AI:NanoEdge AI Studio - stm32mcu
C integration: AI:NanoEdge AI Library for anomaly detection (AD) - stm32mcu
Example: AI:How to Build an Anomaly Detection Project for Predictive Maintenance with NanoEdge AI Studio - stm32mcu

Have a good day,

Julian

In order to give better visibility on the answered topics, please click on 'Accept as Solution' on the reply which solved your issue or answered your question.