Gradient Information and Regularization for Gene Expression Programming to Develop Data-Driven Physics Closure Models. Waschkowski, F., Li, H., Deshmukh, A., Grenga, T., Zhao, Y., Pitsch, H., Klewicki, J., & Sandberg, R. D. November, 2022. arXiv:2211.12341 [physics]
Gradient Information and Regularization for Gene Expression Programming to Develop Data-Driven Physics Closure Models [link]Paper  doi  abstract   bibtex   
Learning accurate numerical constants when developing algebraic models is a known challenge for evolutionary algorithms, such as Gene Expression Programming (GEP). This paper introduces the concept of adaptive symbols to the GEP framework by Weatheritt and Sandberg (2016) to develop advanced physics closure models. Adaptive symbols utilize gradient information to learn locally optimal numerical constants during model training, for which we investigate two types of nonlinear optimization algorithms. The second contribution of this work is implementing two regularization techniques to incentivize the development of implementable and interpretable closure models. We apply $L_2$ regularization to ensure small magnitude numerical constants and devise a novel complexity metric that supports the development of low complexity models via custom symbol complexities and multi-objective optimization. This extended framework is employed to four use cases, namely rediscovering Sutherland's viscosity law, developing laminar flame speed combustion models and training two types of fluid dynamics turbulence models. The model prediction accuracy and the convergence speed of training are improved significantly across all of the more and less complex use cases, respectively. The two regularization methods are essential for developing implementable closure models and we demonstrate that the developed turbulence models substantially improve simulations over state-of-the-art models.
@misc{waschkowski_gradient_2022,
	title = {Gradient {Information} and {Regularization} for {Gene} {Expression} {Programming} to {Develop} {Data}-{Driven} {Physics} {Closure} {Models}},
	url = {http://arxiv.org/abs/2211.12341},
	doi = {10.48550/arXiv.2211.12341},
	abstract = {Learning accurate numerical constants when developing algebraic models is a known challenge for evolutionary algorithms, such as Gene Expression Programming (GEP). This paper introduces the concept of adaptive symbols to the GEP framework by Weatheritt and Sandberg (2016) to develop advanced physics closure models. Adaptive symbols utilize gradient information to learn locally optimal numerical constants during model training, for which we investigate two types of nonlinear optimization algorithms. The second contribution of this work is implementing two regularization techniques to incentivize the development of implementable and interpretable closure models. We apply \$L\_2\$ regularization to ensure small magnitude numerical constants and devise a novel complexity metric that supports the development of low complexity models via custom symbol complexities and multi-objective optimization. This extended framework is employed to four use cases, namely rediscovering Sutherland's viscosity law, developing laminar flame speed combustion models and training two types of fluid dynamics turbulence models. The model prediction accuracy and the convergence speed of training are improved significantly across all of the more and less complex use cases, respectively. The two regularization methods are essential for developing implementable closure models and we demonstrate that the developed turbulence models substantially improve simulations over state-of-the-art models.},
	urldate = {2022-11-28},
	publisher = {arXiv},
	author = {Waschkowski, Fabian and Li, Haochen and Deshmukh, Abhishek and Grenga, Temistocle and Zhao, Yaomin and Pitsch, Heinz and Klewicki, Joseph and Sandberg, Richard D.},
	month = nov,
	year = {2022},
	note = {arXiv:2211.12341 [physics]},
	keywords = {gene expression programming, mentions sympy, model complexity, nonlinear optimization},
}

Downloads: 0