CrossBow: Scaling Deep Learning on Multi-GPU Servers