448 bytes should be enough, the first 448 or last 448 in 64k range.
I tried your code it doesn't work too.
The reason for using auto.inc and failover.inc, i don't want to set cs, ds, es....second time. Also I wonder it will need go back to crt0.s to start from __main ...
YH