Must be your C++ code, asm should be really fast to compiler. What other flags did you set, are you doing multiple passes?

As for your errors: see post above about referencing with asm.