Automatically align to what is required for vectorization

If this cannot be detected because cpuinfo is missing, use 512 bit
8 jobs for alignment in 1 minute and 38 seconds (queued for 1 second)