STM32F407 DSP - execution speed, clock and cycle definition
Hi,
I need to calculate DSP speed for STM32F407, but I can't find ARM documentation on DSP extension of Cortex-M4 nor any documentation of DSP extension implementation in STM32F4.
I know that Cortex-M4 have pipeline with 3 stage + branch speculation.
So, if we disregard pipeline, worst case time it takes for one instruction to complete is 4 periods of master clock. Right? On average, due to pipeline, this time approaches to one period of master clock. Right?
But, is the same truth for DSP instructions? - I think that it's not, but I'm not sure.
ARM site only says, quote: "Single cycle dual 16-bit MAC". But what is one cycle in this situation? Is it one cycle of master clock, one DSP instruction cycle or something else?
How is this "Cycle" defined?
Basically, I want to know how much 16bit MAC instructions per second can be executed by STM32F407 working at certain clock (master clock).
By the way, on which bus is DSP extension connected in STM32F407?
Thank you very much.