Remove most of m64 type usage
MMX instructions were first SIMD version, the registers management is complex (_mm_empty())
so, it is better to remove it now and use more recent instructions
anyway, gcc/clang should replace it automatically as long as we enable sse2, that should always be the case but for code understanding and for ARM porting for example, it is better to explicitly remove it