Improve documentation of GPU code
- explain the differences between the various packing schemes and communication schemes for GPU
- explain how communicators work for blocks on the same process vs. blocks on different processes
- fix broken links to classes and methods
- reduce number of doxygen warnings