The hardware executes an instruction for all threads in the same warp, before moving to the next instruction.
- This style of execution, called single-instruction, multiple-thread (SIMT), is motivated by hardware cost constraints, as it allows the cost of fetching and processing an instruction to be amortized over a large number of threads.
- It works well when all threads within a warp follow the same control flow path when working their data.