Towards understanding the mixture-of-experts layer in deep learning

Jan 1, 2022ยท
Zixiang Chen
Yihe Deng
Yihe Deng
,
Yue Wu
,
Quanquan Gu
,
Yuanzhi Li
ยท 0 min read
Type
Publication
Advances in neural information processing systems (NeurIPS)