Optimizing Sorting for Chiplet-Based CPUs