18-660: Numerical Methods for Engineering Design and ...

18-660: Numerical Methods for Engineering Design and Optimization

Xin Li Department of ECE

Carnegie Mellon University Pittsburgh, PA 15213

Overview

Unconstrained Optimization Gradient method Newton method

Unconstrained Optimization

Linear regression with regularization

Unconstrained optimization: minimizing a cost function without any constraint Golden section search Downhill simplex method Gradient method Newton method

BA =α 1

2min αλαα

⋅+− BA

Non-derivative method

Rely on derivatives (this lecture)

Gradient Method

If a cost function is smooth, its derivative information can be used to search optimal solution

( )xfx

Gradient Method

For illustration purpose, we start from a one-dimensional case

( )xfx

( ) ( ) ( ) ( )

( )( )01 >⋅−=−=∆ + k

kdxdfxxx λλ

Step size Derivative

( ) ( ) ( )

( )( )01 >⋅−=+ k

kdxdfxx λλ

Gradient Method

One-dimensional case (continued) ( ) ( ) ( ) ( )

( )[ ] ( )[ ]( )

kkkk xdxdfxfxf

dxdfxxx

∆⋅+≈⋅−=−=∆ ++ 11 &λ

( )[ ] ( )[ ]( )

⋅−⋅+≈+

dxdfxfxf λ1

( )[ ] ( )[ ] ( )

dxdfxfxf ⋅−≈+ λ

The cost function f(x) keeps decreasing if the derivative is non-zero

λ(k) > 0

Gradient Method

One-dimensional case (continued)

0=⋅−=∆ dxdfx λ

Derivative is zero at local optimum (gradient method converges)

Gradient Method

Two-dimensional case ( )21,

,min21

( ) ( )

( ) ( )( ) ( ) ( )[ ]kkk

xxfxxxx

1 ,∇⋅−=

−−

∆∆

∂∂∂∂

121, xf

( )( ) ( ) ( )[ ]kkk

11 ,∇⋅−

Gradient Method

Two-dimensional case (continued)

The cost function f(x1, x2) keeps decreasing if the gradient is non-zero

λ(k) > 0

( )( ) ( ) ( )[ ]

( ) ( )[ ] ( ) ( )[ ] ( ) ( )[ ]( )

∆∆

⋅∇+≈

∇⋅−=

∆∆

kTkkkkkk

xxfxxfxxf

( ) ( )[ ] ( ) ( )[ ] ( ) ( ) ( )[ ] ( ) ( )[ ]kkTkkkkkkk xxfxxfxxfxxf 2121211

1 ,,,, ∇⋅∇⋅−≈++ λ

( ) ( )[ ] ( ) ( )[ ] ( ) ( ) ( )[ ] 2

221211

1 ,,, kkkkkkk xxfxxfxxf ∇⋅−≈++ λ

Gradient Method

N-dimensional case ( )Xf

∂∂∂∂

( ) ( ) ( ) ( )[ ]kkkk XfXX ∇⋅−=+ λ1

Newton Method

Gradient method relies on first-order derivative information Each iteration is simple, but it converges to optimum slowly I.e., require a large number of iteration steps

The step size λ(k) can be optimized by one-dimensional search

for fast convergence

Alternative algorithm: Newton method Rely on both first-order and second-order derivatives Each iteration is more expensive But it converges to optimum more quickly, i.e., requires a

smaller number of iterations to reach convergence

( )( ) ( ) ( )[ ]{ }kkk XfXf

k∇⋅−λ

Newton Method

One-dimensional case ( )xf

( ) ( ) ( )

( ) ( )[ ]kk

fddxdf

−⋅+≈ +

( ) ( )

( ) ( )[ ]kk

fddxdf

−⋅+= +12

First-order derivative is zero at local optimum

Newton Method

One-dimensional case (continued) ( ) ( )

( ) ( )[ ] 012

=−⋅+ + kk

fddxdf

( ) ( ) ( )

( ) ( )kk xx

dxfdxxx ⋅−=−=∆

( ) ( )

( ) ( )kk xx

dxfdxx ⋅−=

Newton Method

( ) ( )

( )[ ] ( )[ ]( )

k xdxdfxfxf

∆⋅+≈⋅−=∆ +

( )[ ] ( )[ ]( ) ( ) ( )

⋅−⋅+≈

kkk xxx

dxdfxfxf

( )[ ] ( )[ ]( ) ( )

dxfdxfxf ⋅−≈

Positive (convex function)

Newton Method

Newton method gives an estimation of the optimal step size λ(k) using second-order derivative

( ) ( )

( ) ( )kk xx

dxfdxx ⋅−=

21( ) ( ) ( )

dxdfxx ⋅−=+ λ1

Gradient method Newton method

dxfdλ (convex function)

Newton Method

One-dimensional case (continued) The step size estimation is based on the following linear

approximation for first-order derivative

The approximation is exact if and only if f(x) is quadratic and, therefore, df/dx is linear

( ) ( ) ( )

( ) ( )[ ]kk

fddxdf

−⋅+≈ +

( ) baxdx

dfcbxaxxf +=⇒++= 22

Newton Method

One-dimensional case (continued) If the actual f(x) is not quadratic, the following step size

estimation may be non-optimal

Using this step size may result in bad convergence

In these cases, a damping factor β is typically introduced

dxfdλ

( ) ( )

( ) ( )( )10

21 <<⋅⋅−=

+ ββkk xx

dxfdxx

Newton Method

Two-dimensional case

( ) ( )[ ] ( ) ( )[ ] ( ) ( )[ ]( ) ( )

( ) ( )

−−

⋅∇+∇≈∇+

kkkkkkkk

xxfxxfxxf2

11 ,,,

( )21,,min

( ) ( )

∂∂∂∂∂∂∂∂∂∂

∂∂∂∂

=∇ 22

121 ,&,

xfxxfxxfxf

xxfxfxf

Hessian matrix

Newton Method

( ) ( )[ ] ( ) ( )[ ] ( ) ( )[ ]( ) ( )

( ) ( ) 0,,,2

−−

⋅∇+∇≈∇+

kkkkkkkk

xxfxxfxxf

( ) ( )

( ) ( )( ) ( )[ ] ( ) ( )[ ]kkkk

xxfxxfxxxx

11 ,, ∇⋅−∇=

−−

∆∆ −

( )( ) ( )[ ] ( ) ( )[ ]kkkk

xxfxxfxx

11 ,, ∇⋅∇−

Newton Method

Positive definite (convex function)

( )( ) ( )[ ] ( ) ( )[ ]

( ) ( )[ ] ( ) ( )[ ] ( ) ( )[ ]( )

∆∆

⋅∇+≈

∇⋅−∇=

∆∆

kTkkkkkk

xxfxxfxxf

xxfxxfxx

( ) ( )[ ] ( ) ( )[ ] ( ) ( )[ ] ( ) ( )[ ] ( ) ( )[ ]kkkkTkkkkkk xxfxxfxxfxxfxxf 211

1 ,,,,, ∇⋅∇⋅∇−≈−++

Newton Method

Gradient method and Newton method do not move along the same direction

Gradient method Newton method

( ) ( ) ( ) ( )[ ]kkkk xxfX 21 ,∇⋅−=∆ λ ( ) ( ) ( )[ ] ( ) ( )[ ]kkkkk xxfxxfX 211

212 ,, ∇⋅−∇=∆

x2 Gradient

Newton

( ) CXBAXXXf TT ++=

Newton Method

Newton method can be extended to high-dimensional cases

The following Hessian matrix is N×N if we have N variables

Numerically computing the Hessian matrix and its inverse (by Cholesky decomposition) can be quite expensive for large N

∂∂∂∂∂∂∂∂

∂∂∂∂∂∂∂∂∂∂∂∂∂∂∂∂

xfxxfxxf

xxfxfxxfxxfxxfxf

( ) ( ) ( )[ ] ( )[ ]kkkk XfXfXX ∇⋅∇−=−+ 121

Newton Method

A number of modified algorithms were developed to address this complexity issue E.g., quasi-Newton method

The key idea is to approximate the Hessian matrix and its

inverse so that: The computational cost is significantly reduced Fast convergence can still be achieved

More details can be found at

Numerical Recipes: The Art of Scientific Computing, 2007

Summary

Unconstrained Optimization Gradient method Newton method

18-660: Numerical Methods for Engineering Design and ...

Documents

Transcript of 18-660: Numerical Methods for Engineering Design and ...

Numerical methods for engineers 5th e-solution ch 04

Fast numerical methods for solving elliptic PDEs - Applied

C3 Numerical Methods - Iterative equationspmt.physicsandmathstutor.com/download/Maths/A-level/C3...C3 Numerical Methods - Iterative equations PhysicsAndMathsTutor.com The iterative

Fast numerical methods for solving linear PDEs · Fast numerical methods for solving linear PDEs P.G. Martinsson, The University of Colorado at Boulder ... As a result, complex systems

Numerical methods for multidimensional radiative …numerik.iwr.uni-heidelberg.de/Paper/meinkoehn-kanschat-rannacher... · Numerical methods for multidimensional radiative transfer

CSE 330 : Numerical Methods Lecture 17: Solution of Ordinary Differential Equations (a) Euler’s Method (b) Runge-Kutta Method Dr. S. M. Lutful Kabir Visiting.

Numerical computing - ARCHER

Numerical Optimization Techniques

Numerical Methods for Elliptic Equations-Igtryggva/CFD-Course2010/2010-Lecture-11.pdfComputational Fluid Dynamics I! ∇2ψ=−ω Examples of Elliptic Equations! 2-D stream function

Low-rank methods and Proper Generalized Decompositions ... · Low-rank methods and Proper Generalized Decompositions ... 1 Complex numerical models u(ξ) ∈ VN, FN (u(ξ);ξ) = 0

Numerical methods for engineers 5th e-solution ch 23

Numerical Methods for PDE-Constrained Optimization ...lruthot/courses/PDECO/2016... · PDE-Constrained Optimization Doktorandenkolleg, Weißensee 2016 ... PDE-Constrained Optimization

Numerical Methods-Lecture IX: Quadrature (and Markov …web.ics.purdue.edu/~tgallen/Teaching/Econ_690_Fall_2015/Lecture 9-Quadrature.pdfNumerical Methods-Lecture IX: Quadrature (and

First Order Numerical Methods - University of Utahgustafso/f2010/numericalDE2010.pdf212 First Order Numerical Methods steps of h= 0:2. Graph the approximate solution and the exact

660 Μειωμένες τιμές τροφίμων

Numerical Methods for Multi-Objective Optimal Control Problems

FIDAP Numerical Modeling

10-01-2020 660/1/787...1 ΑΔΑ: ΩΑ4Ν46ΨΖΣΠ-ΚΒΞ Αθήνα,10-01-2020 Αρ. ρωτ.:660/1/787 E FΟΚΛΗΗ ΚΗΛΩΗ ΝΙΑΟΝ HΟ G ΙΑ HΗΝ 5 EΙΛΟΗ ΜΛΩΝ ΟΜΑ

Numerical Analysis - Part II...Numerical Analysis - Part II Anders C. Hansen Lecture 15 1/29 Spectral Methods 2/29 Chebyshev polynomials The Chebyshev polynomial of degree n is de

Appendix A Numerical Integration Methods