Abstract: Modern training clusters use a distributed approach to train deep learning jobs. In each communication round phase, distributed machine learning jobs generate multiple parallel data flows, ...