Learning the nonlinear flux function of a hidden scalar conservation law from data
Peer reviewed, Journal article
Published version
Permanent lenke
https://hdl.handle.net/11250/3088309Utgivelsesdato
2023-10Metadata
Vis full innførselSamlinger
Originalversjon
Li, Q. & Evje, S. (2023) Learning the nonlinear flux function of a hidden scalar conservation law from data. Networks and Heterogeneous Media, 18 (1), 48-79. 10.3934/nhm.2023003Sammendrag
Nonlinear conservation laws are widely used in fluid mechanics, biology, physics, and chemical engineering. However, deriving such nonlinear conservation laws is a significant and challenging problem. A possible attractive approach is to extract conservation laws more directly from observation data by use of machine learning methods. We propose a framework that combines a symbolic multi-layer neural network and a discrete scheme to learn the nonlinear, unknown flux function f(u) of the scalar conservation law
ut + f(u)x = 0 *
with u as the main variable. This identification is based on using observation data u(xj,ti) on a spatial grid xj, j = 1, ... Nx at specified times ti, i = 1, ..., Nobs.
A main challenge with Eq (*) is that the solution typically creates shocks, i.e., one or several jumps of the form (uL, uR) with uL ≠ uR moving in space and possibly changing over time such that information about f(u) in the interval associated with this jump is sparse or not at all present in the observation data. Secondly, the lack of regularity in the solution of (*) and the nonlinear form of f(u) hamper use of previous proposed physics informed neural network (PINN) methods where the underlying form of the sought differential equation is accounted for in the loss function. We circumvent this obstacle by approximating the unknown conservation law (*) by an entropy satisfying discrete scheme where f(u) is represented through a symbolic multi-layer neural network. Numerical experiments show that the proposed method has the ability to uncover the hidden conservation law for a wide variety of different nonlinear flux functions, ranging from pure concave/convex to highly non-convex shapes. This is achieved by relying on a relatively sparse amount of observation data obtained in combination with a selection of different initial data.