Cloud operators utilize collective communication optimizers to enhance the efficiency of the single-tenant, centrally managed training clusters they manage. However, current optimizers struggle to ...
Abstract: We propose a deep reinforcement learning (DRL) approach to optimize the multi commodity flow problem under uncertainty. We frame the problem as a Markov decision problem (MDP) and develop a ...