Make create_staggered_kernel work with OpenMP

8 jobs for staggered in 2 minutes and 21 seconds (queued for 2 seconds)