1. Efficient representation learning via adaptive context pooling;huang;International Conference on Machine Learning,2022
2. Ponet: Pooling network for efficient token mixing in long sequences;tan;International Conference on Learning Representations,2022
3. IEMOCAP: interactive emotional dyadic motion capture database
4. SUPERB: Speech Processing Universal PERformance Benchmark
5. Branchformer: Parallel mlp-attention architectures to capture local and global context for speech recognition and understanding;peng;International Conference on Machine Learning,2022