RDMA over Commodity Ethernet at Scale - Chuanxiong Guo

TCP does not work for distributed DNN training; For 16-GPU, 2-host speech training with CNTK, TCP communications dominant the training ...
展开查看详情