LD instruction with pre-decrement can load data from program memory since the flash is memory
mapped. Loading data from the data memory takes two clock cycles, and loading from the program
memory takes three clock cycles. But if an interrupt occur (before the last clock cycle) no additional
clock cycles are necessary when loading from the program memory. Hence, the instruction takes
only one clock cycle to execute.
Atmel AVR Instruction Set Manual [OTHER]
Atmel-0856L-AVR-Instruction-Set-Manual_Other-11/2016
114