Towards understanding the mixture-of-experts layer in deep learningJan 1, 2022ยทZixiang ChenYihe Deng,Yue Wu,Quanquan Gu,Yuanzhi Liยท 0 min read PDF CiteTypeJournal articlePublicationAdvances in neural information processing systems (NeurIPS)Last updated on Jan 1, 2022Mixture of Experts Deep Learning Theory AuthorsYihe DengPhD Student ← PhyGCN: Pre-trained Hypergraph Convolutional Neural Networks with Self-supervised Learning Oct 1, 2023Adversarial training with fast gradient projection method against synonym substitution based text attacks Jan 1, 2021 →