NUCLEO-N657X0-Q: NPU-Enabled AI Model Fails to Run and Shows Poor Performance
Hi ,
I am currently working with the NUCLEO-N657X0-Q board and the STM32N6570-DK board. I am deploying and testing my machine learning models using STM32Cube AI Studio.
I have noticed something unusual. The model runs successfully on all target boards except the NUCLEO-N657X0-Q when the NPU is enabled. I am not sure why this is happening and would appreciate any insights or suggestions from the community.
Another issue is that the NUCLEO-N657X0-Q shows significantly higher inference/execution times compared to the other boards. Surprisingly, even a Cortex-M7-based board appears to perform better in some cases. This is unexpected, especially considering the capabilities of the Cortex-M55 and NPU.
Has anyone experienced similar behavior with the NUCLEO-N657X0-Q? Are there any known configuration issues, performance settings, memory considerations, or NPU-related limitations that I should check?
Any help or guidance would be greatly appreciated.
Thank you!
