1. X-LANCE Lab, Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University State Key Lab of Media Convergence Production Technology and Systems, Beijing, China. zhenchi713@sjtu.edu.cn
2. X-LANCE Lab, Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University State Key Lab of Media Convergence Production Technology and Systems, Beijing, China
3. X-LANCE Lab, Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University State Key Lab of Media Convergence Production Technology and Systems, Beijing, China. chenlusz@sjtu.edu.cn
4. AISpeech Co., Ltd., Suzhou, China
5. X-LANCE Lab, Department of Computer Science and Engineering MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University State Key Lab of Media Convergence Production Technology and Systems, Beijing, China. kai.yu@sjtu.edu.cn