Scott Duplichan wrote:
for AMD processors rep movs and rep stos performance is not great, though better than a byte loop.
Intel 386-Pentium is when I last did performance critical code like this. 32-bit mov,mov,add,add,dec ecx,jnz was the standard back then. rep for size optimization, but not performance.
//Peter