I'm attempting to optimize the amount of instructions/time it takes to move data from CPU to GPIO. When doing ODR = X, I lose about 40ns, closer to 80 because I need to use 2 ODR writes to different ports and in general ODR output seems to sit at abo...