2025-01-22 02:21 AM
Hello,
We have run into some issues regarding SPI in interrupt mode, especially when building with optimizations turned on. We are running SPI at 21MHz.
The situation:
We transmit one byte using HAL_SPI_Transmit_IT. This immediately triggers a TXE interrupt which puts our one byte into the DR register to be send by the peripheral. The hal interrupt routing checks if any more bytes must be send before calling the user callback to let us know the transmit is finished. All good so far.
We then want to receive 3 bytes using HAL_SPI_Receive_IT. The hal will start by sending out dummy bytes, again the TXE interrupt is called where a byte is put in the DR register. This IRQ returns and we wait for an interrupt for TXE letting the us know we can put the next byte in the DR register. During this second TXE IRQ the first byte is clocked out and now our receive buffer contains a byte, setting the RXNE flag. We finish putting the second transmit byte in the DR register and enter the RXNE IRQ.
This is where it gets tricky, remember that we put the second byte in the DR register, that is being clocked out while we are handling the RXNE IRQ from the first byte. Depending on the timing/speed of the IRQ we might be fast enough to still catch the first byte before it is overwritten by the second.
Proposed fix #1: Do not start sending out more bytes just because the TXE flag is set, the hal should check if the previous byte is successfully handled/read before the next transmit can start. (Described here as well)
The above situation should set the OVR bit to indicate to us that an overrun has been detected. However, as this post describes, the next TXE IRQ actually clears this flag, making the overrun virtually undetectable. The user callback is never called, as the hal rx counter will never reach the desired amount (3) as it only serviced 2 RXNE IRQs.
Proposed fix #2: Check for the OVR flag before servicing TXE interrupts.
Summarized:
If the IRQs cannot be handled quickly enough (either because other interrupts are being handled or because the hal IRQs take way too long), an undetectable overrun can occur, leaving the hal SPI in a waiting state.
Situation is summarized in the picture below.
I'd be happy to provide more details if required.