Martin Harper wrote: > > So there are several stages in the hardware: > 1) load the whole 16 bits from memory or cache This should be "whole 4 bits". I'm an idiot...