maskStore improvements
- fix the aligned version - make sure the test case is incommensurate with the vector width - implement a fallback for instruction sets that don't support it natively
- fix the aligned version - make sure the test case is incommensurate with the vector width - implement a fallback for instruction sets that don't support it natively