Alexa Teacher Model-Reference-Cited by-同舟云学术

Alexa Teacher Model

Published:2022-08-14 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
language:
Short-container-title:

Author:

FitzGerald Jack¹,Ananthakrishnan Shankar²,Arkoudas Konstantine³,Bernardi Davide⁴,Bhagia Abhishek⁵,Delli Bovi Claudio⁶,Cao Jin³,Chada Rakesh⁵,Chauhan Amit⁵,Chen Luoxin²,Dwarakanath Anurag⁷,Dwivedi Satyam⁷,Gojayev Turan⁶,Gopalakrishnan Karthik⁸,Gueudre Thomas⁶,Hakkani-Tur Dilek⁹,Hamza Wael³,Hüser Jonathan J.⁶,Jose Kevin Martin⁶,Khan Haidar³,Liu Beiye³,Lu Jianhua²,Manzotti Alessandro¹⁰,Natarajan Pradeep¹¹,Owczarzak Karolina²,Oz Gokmen²,Palumbo Enrico¹²,Peris Charith²,Prakash Chandana Satya²,Rawls Stephen³,Rosenbaum Andy²,Shenoy Anjali⁷,Soltan Saleh³,Sridhar Mukund Harakere²,Tan Lizhen²,Triefenbach Fabian⁶,Wei Pan²,Yu Haiyang²,Zheng Shuai⁵,Tur Gokhan⁹,Natarajan Prem¹³

Affiliation:

1. Amazon, Denver, CO, USA

2. Amazon, Cambridge, MA, USA

3. Amazon, New York, NY, USA

4. Amazon, Turin, AA, USA

5. Amazon, Seattle, WA, USA

6. Amazon, Aachen, Germany

7. Amazon, Bangalore, India

8. Amazon, Santa Clara, CA, USA

9. Amazon, Sunnyvale, CA, USA

10. Amazon, Turin, Italy

11. Amazon, Chicago, IL, USA

12. Spotify, Turin, Italy

13. Amazon, Los Angeles, CA, USA

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3534678.3539173

Reference55 articles.

1. Armen Aghajanyan , Anchit Gupta , Akshat Shrivastava , Xilun Chen , Luke Zettlemoyer , and Sonal Gupta . 2021 . Muppet: Massive Multi-task Representations with Pre-Finetuning. , 5799--5811 pages. https://doi.org/10.18653/v1/2021.emnlp-main.468 Armen Aghajanyan, Anchit Gupta, Akshat Shrivastava, Xilun Chen, Luke Zettlemoyer, and Sonal Gupta. 2021. Muppet: Massive Multi-task Representations with Pre-Finetuning. , 5799--5811 pages. https://doi.org/10.18653/v1/2021.emnlp-main.468

2. Jimmy Ba and Rich Caruana . 2014 . Do Deep Nets Really Need to be Deep?. In Advances in Neural Information Processing Systems, Z. Ghahramani, M. Welling, C. Cortes, N. Lawrence, and K. Q . Weinberger (Eds.) , Vol. 27 . Curran Associates, Inc. https://proceedings.neurips.cc/paper/ 2014/file/ea8fcd92d59581717e06eb187f10666d-Paper.pdf Jimmy Ba and Rich Caruana. 2014. Do Deep Nets Really Need to be Deep?. In Advances in Neural Information Processing Systems, Z. Ghahramani, M. Welling, C. Cortes, N. Lawrence, and K. Q. Weinberger (Eds.), Vol. 27. Curran Associates, Inc. https://proceedings.neurips.cc/paper/2014/file/ea8fcd92d59581717e06eb187f10666d-Paper.pdf

3. Tom Brown , Benjamin Mann , Nick Ryder , Melanie Subbiah , Jared D Kaplan , Prafulla Dhariwal , Arvind Neelakantan , Pranav Shyam , Girish Sastry , Amanda Askell , Sandhini Agarwal , Ariel Herbert-Voss , Gretchen Krueger , Tom Henighan , Rewon Child , Aditya Ramesh , Daniel Ziegler , Jeffrey Wu , Clemens Winter , Chris Hesse , Mark Chen , Eric Sigler , Mateusz Litwin , Scott Gray , Benjamin Chess , Jack Clark , Christopher Berner , Sam McCandlish , Alec Radford , Ilya Sutskever , and Dario Amodei . 2020. Language Models are Few-Shot Learners . , Vol. 33 ( 2020 ), 1877--1901. https://proceedings.neurips.cc/paper/2020/file/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel Ziegler, Jeffrey Wu, Clemens Winter, Chris Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language Models are Few-Shot Learners. , Vol. 33 (2020), 1877--1901. https://proceedings.neurips.cc/paper/2020/file/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf

4. Cristian Buciluundefined , Rich Caruana , and Alexandru Niculescu-Mizil . 2006 . Model Compression. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining ( Philadelphia, PA, USA) (KDD '06). Association for Computing Machinery, New York, NY, USA, 535--541. https://doi.org/10.1145/1150402.1150464 Cristian Buciluundefined, Rich Caruana, and Alexandru Niculescu-Mizil. 2006. Model Compression. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Philadelphia, PA, USA) (KDD '06). Association for Computing Machinery, New York, NY, USA, 535--541. https://doi.org/10.1145/1150402.1150464

5. Jin Cao Jun Wang Wael Hamza Kelly Vanee and Shang-Wen Li. 2020. Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding. (2020). Jin Cao Jun Wang Wael Hamza Kelly Vanee and Shang-Wen Li. 2020. Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding. (2020).

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Lightweight and Effective Multi-View Knowledge Distillation Framework for Text-Image Retrieval;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

2. A criteria-based classification model using augmentation and contrastive learning for analyzing imbalanced statement data;Heliyon;2024-06

3. Initial Development and Performance Evaluation of a Bengali Voice-Operated Virtual Assistant for Personal Computer Control;2023 IEEE 64th International Scientific Conference on Information Technology and Management Science of Riga Technical University (ITMS);2023-10-05

4. A Mixed-Methods Approach to Understanding User Trust after Voice Assistant Failures;Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems;2023-04-19

5. Optimal Transport Posterior Alignment for Cross-lingual Semantic Parsing;Transactions of the Association for Computational Linguistics;2023