The Effect of Network Width on the Performance of Large-batch Training

Published in NeurIPS, 2018