Journal of Computational and Applied Mathematics2022,Vol.40517.DOI:10.1016/j.cam.2021.113943

A sequential quadratic hamiltonian algorithm for training explicit RK neural networks

Hofmann, S. Borzi, A.
Journal of Computational and Applied Mathematics2022,Vol.40517.DOI:10.1016/j.cam.2021.113943

A sequential quadratic hamiltonian algorithm for training explicit RK neural networks

Hofmann, S. 1Borzi, A.1
扫码查看

作者信息

  • 1. Univ Wurzburg
  • 折叠

Abstract

A sequential quadratic hamiltonian (SQH) algorithm for solving nonsmooth supervised learning problems (SLPs) in the framework of residual neural networks is presented. In this framework, a SLP is interpreted as an optimal control problem and the SQH algorithm determines a solution using the characterization of optimality given by a discrete version of the Pontryagin maximum principle. Convergence and stability of the proposed algorithm is investigated theoretically in the framework of residual neural networks with Runge-Kutta structure, and its computational performance is compared to that of the so-called extended method of successive approximations. Results of numerical experiments demonstrate the superior performance of the SQH algorithm in terms of efficiency and robustness of the training process. (c) 2021 Elsevier B.V. All rights reserved.

Key words

Residual neural networks/Runge-Kutta neural networks/Discrete Pontryagin maximum principle/Sequential quadratic hamiltonian algorithm/Method of successive approximations/Numerical optimization/CONVERGENCE

引用本文复制引用

出版年

2022
Journal of Computational and Applied Mathematics

Journal of Computational and Applied Mathematics

EISCI
ISSN:0377-0427
被引量4
参考文献量37
段落导航相关论文