Thanks for all the suggestions, turned out we had NPROC set too high on Packages.
This server had the most significant reduction (from load spikes of 120+ to a load average under 1), surprisingly enough it was still functioning pretty well under those crazy load spikes, high iowait (between 60-95%) was causing the load:
data:image/s3,"s3://crabby-images/9b05f/9b05fd52a68e6a3a1b5f7a7bbe8e8c243d394b31" alt=""
The following commands were helpful in narrowing the problem down in case anyone else runs into this:
Monitor CPU / RAM usage:
top -c
Input/output disk statistics by process / thread:
iotop
Virtual memory statistics:
vmstat
Monitor system input / output statistics:
iostat
View the processes when a load spike hits, ultimately for us there were a bunch of D state processes causing this:
ps -e v