Add CI job for non-x86 vectorization

16 jobs for qemu in 49 minutes and 30 seconds (queued for 3 seconds)