Single-shot optical neural network-Reference-Cited by-同舟云学术

Single-shot optical neural network

Published:2023-06-23 Issue:25 Volume:9 Page:
ISSN:2375-2548
Container-title:Science Advances
language:en
Short-container-title:Sci. Adv.

Author:

Bernstein Liane¹^ORCID,Sludds Alexander¹²^ORCID,Panuski Christopher¹,Trajtenberg-Mills Sivan¹,Hamerly Ryan¹³^ORCID,Englund Dirk¹^ORCID

Affiliation:

1. Research Laboratory of Electronics, Massachusetts Institute of Technology, 50 Vassar St, Cambridge, MA 02139, USA.

2. Lightmatter Inc., 100 Summer St, Boston, MA 02110, USA.

3. NTT Research Inc., Physics and Informatics Laboratories, Sunnyvale, CA 94085, USA.

Abstract

Analog optical and electronic hardware has emerged as a promising alternative to digital electronics to improve the efficiency of deep neural networks (DNNs). However, previous work has been limited in scalability (input vector length K ≈ 100 elements) or has required nonstandard DNN models and retraining, hindering widespread adoption. Here, we present an analog, CMOS–compatible DNN processor that uses free-space optics to reconfigurably distribute an input vector and optoelectronics for static, updatable weighting and the nonlinearity—with K ≈ 1000 and beyond. We demonstrate single-shot-per-layer classification of the MNIST, Fashion-MNIST, and QuickDraw datasets with standard fully connected DNNs, achieving respective accuracies of 95.6, 83.3, and 79.0% without preprocessing or retraining. We also experimentally determine the fundamental upper bound on throughput (∼0.9 exaMAC/s), set by the maximum optical bandwidth before substantial increase in error. Our combination of wide spectral and spatial bandwidths enables highly efficient computing for next-generation DNNs.

Publisher

American Association for the Advancement of Science (AAAS)

Subject

Multidisciplinary

Link

https://www.science.org/doi/pdf/10.1126/sciadv.adg7904

Reference75 articles.

1. A. Krizhevsky I. Sutskever G. E. Hinton ImageNet classification with deep convolutional neural networks in Advances in Neural Information Processing Systems F. Pereira C. J. Burges L. Bottou K. Q. Weinberger Eds. (Curran Associates Inc. 2012) vol. 25 pp. 1097–1105.

2. A. Vaswani N. Shazeer N. Parmar J. Uszkoreit L. Jones A. N. Gomez Ł. Kaiser I. Polosukhin Attention is all you need in Advances in Neural Information Processing Systems I. Guyon U. Von Luxburg S. Bengio H. Wallach R. Fergus S. Vishwanathan R. Garnett Eds. (Curran Associates Inc. 2017) vol. 30 pp. 5998–6008.

3. A guide to deep learning in healthcare

4. Scaling for edge inference of deep neural networks

5. J. Kaplan S. McCandlish T. Henighan T. B. Brown B. Chess R. Child S. Gray A. Radford J. Wu D. Amodei Scaling laws for neural language models. arXiv:2001.08361 [cs.LG] (23 January 2020).

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Role of all-optical neural networks;Physical Review Applied;2024-01-17

2. Multichannel meta-imagers for accelerating machine vision;Nature Nanotechnology;2024-01-04

3. Photonic optical accelerators: The future engine for the era of modern AI?;APL Photonics;2023-11-01

4. The physics of optical computing;Nature Reviews Physics;2023-10-09