Certifying the True Error: Machine Learning in Coq with Verified Generalization Guarantees-Reference-Cited by-同舟云学术

Certifying the True Error: Machine Learning in Coq with Verified Generalization Guarantees

Published:2019-07-17 Issue: Volume:33 Page:2662-2669
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Bagnall Alexander,Stewart Gordon

Abstract

We present MLCERT, a novel system for doing practical mechanized proof of the generalization of learning procedures, bounding expected error in terms of training or test error. MLCERT is mechanized in that we prove generalization bounds inside the theorem prover Coq; thus the bounds are machine checked by Coq’s proof checker. MLCERT is practical in that we extract learning procedures defined in Coq to executable code; thus procedures with proved generalization bounds can be trained and deployed in real systems. MLCERT is well documented and open source; thus we expect it to be usable even by those without Coq expertise. To validate MLCERT, which is compatible with external tools such as TensorFlow, we use it to prove generalization bounds on neural networks trained using TensorFlow on the extended MNIST data set.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Verifying the Generalization of Deep Learning to Out-of-Distribution Domains;Journal of Automated Reasoning;2024-08-03

2. Experimenting with an Intrinsically-Typed Probabilistic Programming Language in Coq;Programming Languages and Systems;2023

3. Formalizing Piecewise Affine Activation Functions of Neural Networks in Coq;Lecture Notes in Computer Science;2023

4. CheckINN: Wide Range Neural Network Verification in Imandra;Proceedings of the 24th International Symposium on Principles and Practice of Declarative Programming;2022-09-20

5. Neural Networks in Imandra: Matrix Representation as a Verification Choice;Lecture Notes in Computer Science;2022