2_Dynamic Learning Rate (ηD) for Recurrent_diapositivas_XP
Transcript of 2_Dynamic Learning Rate (ηD) for Recurrent_diapositivas_XP
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
1/36
Dynamic Learning Rate (D) for Recurrent High Order
Neural Observer (RHONO): Anaerobic Process Application
K. J. Gurubel1, E. N. Snchez1 and S. Carlos-Hernndez2
2011 International Joint Conference on Neural Networks
San Jose, California July 31 - August 5, 2011
1Cinvestav Unidad Guadalajara
2Cinvestav Unidad Saltillo
1
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
2/36
In order to increase the performance of the neuronalobserver, a dynamic learning rate (D) for the EKFtraining algorithm is proposed which depends on
operations condition for a system under control, in orderto improve the learning of the neuronal network inpresence of disturbances. Considering that the pH is on-line measured, this variable is used to determine the
proposed D.
2
OBJECTIVE
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
3/36
INTRODUCTION
Anaerobic digestion is a biological process in which organicmatter (substrate) is degraded by anaerobic bacteria(biomass), in absence of oxygen. Such degradationproduces biogas, consisting primarily of methane (CH4)
and carbon dioxide (CO2), and stable organic residues.
Anaerobic process is a complex and sequential processwhich occurs in four basic stages: Hydrolysis, Acidogenesis,Acetogenesis y Methanogenesis. Each stage has a specific
dynamics; methanogenesis which is the slowest oneimposes the dynamics of the process and is considered asthe critical stage. Then, special attention is focused onmethanogenesis. This process is developed in a continuousstirred tank reactor (CSTR) with biomass filter.
3
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
4/36
INTRODUCTION
Different biogas sensors have been developed in order tomeasure CH4; however, substrate and biomass measuresare more restrictive.
A nonlinear discrete-time neural observer for unknownnonlinear systems in presence of external disturbancesand parameter uncertainties is proposed in a previouswork for estimate unmeasurable variables.
The neural observer is based on a discrete-time recurrenthigh-order neural network trained with an extendedKalman-filter based algorithm. The objective is toestimate biomass concentration, substrate degradationand inorganic carbon in the anaerobic process.
4
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
5/36
MATHEMATICAL MODEL
The biological phenomena are modeled by ordinarydifferential equations, which represent the dynamicalpart of the process
5
ZZDdt
dZ
ICICD
XRRXRXRRdt
dIC
SSDXRXRdt
dS
Xkdt
dX
SSDXRdtdS
Xkdt
dX
inin
inin
inin
d
inin
d
,
,
,
,
,
22311152232
221142232
2222
111161
1111
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
6/36
MATHEMATICAL MODEL
where X1, corresponding to hydrolytic, acidogenic andacetogenic bacteria and X2, corresponding tomethanogenic bacteria. On the other hand, the organicload is classified in S1, the components equivalentglucose, which model complex molecules and S2 , the
components equivalent acetic acid. 1 is the growth rate(Haldane type) of X1 (h1), 2 the growth rate (Haldanetype) of X2 (h1), kd1 the death rate of X1 (mol L1), kd2the death rate of X2 (mol L1), Din the dilution rate(h1), S1in the fast degradable substrate input (mol L1),S2in the slow degradable substrate input (mol L1), ICinorganic carbon (mol L1), Z the total of cations(mol L1), ICin the inorganic carbon input (mol L1), Zinthe input cations (mol L-1), is a coefficient consideringlaw of partial pressure for the dissolved CO2 andR1, . . .,R6 are the yield coefficients.
6
1X2
X
1S
2S
1
2
1X
2X
1h
1h
1X
1dk
1Lmol 2dk2X
1Lmol
1h
inD
inS1
inS
2
1Lmol
1Lmol
1Lmol
1Lmol
1Lmol
1Lmol
inIC
IC
inZZ
61,..., RR
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
7/36
The discrete-time RHONO estimates the variables of themethanogenesis stage: biomass ( X2), substrate ( S2) andinorganic carbon ( IC). The observability property of thisanaerobic digestion process was analyzed in a previous work.
DISCRETE-TIME RHONO
7
Observer scheme
2X
2S
IC
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
8/36
DISCRETE TIME RHONO
Observer structure
8
,
1
,
1
,
1
3
2
35
2
34
233
2
3231
2222
24
2322
222212
122
14
1322
122112
kegkICkICSwkDkICSw
kXSwkICSwkICSwkIC
kegkSkSSw
kICSwkSSwkSSwkS
kegkDkXSw
kICSwkXSwkXSwkX
inin
in
in
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
9/36
DISCRETE TIME RHONO
where wij is the respective on-line adapted weight vector;X2, S2 and IC are the estimated states; S(.) is the sigmoidfunction defined as S(x)= tanh(x); Din is the inputdilution rate and e is the output error.
9
22 ,
SX
IC )(S)tanh()( xxS inD
ijw
e
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
10/36
THE EKF TRAINING ALGORITHM
In this work, we use an EKF-based training algorithmdescribed by
where is the output estimation error and
is the weight estimation error covariance matrix, isthe weight (state) vector, is the respective number ofneural network weights
10
i i i iK k P k H k M k
1T
i i i i i iP k P k K k H k P k Q k
1
, 1, ,T
i i i iiM k R k H k P k H k i n
pe k i iL LiP k
iL
iw
iL
kykykekekKkwkw iii
,1
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
11/36
THE EKF TRAINING ALGORITHM
is the plant output, is the neural networkoutput, , is the number of states, is the Kalmangain matrix, , is the NN weight estimation noisecovariance matrix, , is the error noise covariance,
and is a matrix, given as follows:
11
T
ij
ij
y kH
w k
py
py
iL p
iK
i iL L
iQ
p p
iR
iL piH
where and .
n
1, ,i n 1, , ij L
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
12/36
STATE ESTIMATION WITH
The process model and the observer are implementedusing Matlab/SimulinkTM. Nominal values of weredetermined in a previous work for each one of the
estimated variables as: 1 for the estimation of X2 , 2 forthe estimation of S2 and 3 for the estimation of IC.
12
2
X
2S IC
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
13/36
STATE ESTIMATION WITH
In order to test the observer sensitivity to input changes,a disturbance on the input substrate (270% S2in increase)is incepted at t = 200 h
13
0 500 1000 15000.05
0.1
0.15
0.2
0.25
0.3
Time (h)
S2in(mol/L)
inS2
Disturbance in S2ininS2
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
14/36
STATE ESTIMATION WITH
14
0 500 1000 15000.005
0.01
0.015
Time (h)
X2
(UA)
X2
Estimated
X2
System
0 500 1000 1500
0.01
0.02
0.03
Time (h)
S2
(mol/L)
S2 EstimatedS
2System
0 500 1000 1500
0.1
0.2
0.3
Time (h)
IC
(mol/L)
IC2
Estimated
IC2
System
State estimation
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
15/36
STATE ESTIMATION WITH
States estimation are obtained by the RHONO, with anerror when the step on the input substrate is incepted,which is eliminated in the steady state. This error could
be due to the observer structure; it is possible that theNN is not able to learn all the nonlinear dynamicsrelated to the variables.
15
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
16/36
DYNAMIC LEARNING RATE
pH is a sensible and determining variable in the processfor the adequate growth of bacteria and then for thewastes transformation; therefore it is used to determineD as proportional ( ) to the on-line measurement
substrate pH
16
6 6.5 7 7.5 80.8
0.9
1
1.1
1.2
1.3
pH
D
pHD
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
17/36
DYNAMIC LEARNING RATE
In order to implement this time varying learning rate,the EKF training algorithm is modified as follows:
The other components of algorithm remain unchanged.
17
,1 kekKkwkwiiDiii
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
18/36
SIMULATION RESULTS
With the purpose to analyze the effects ofD, three testsare realized with different initial values (D0), andconsidering the same disturbance on S2in. First, D0 equal
to the nominal values (), second, D0 100 % larger thannominal and third, D0 50 % smaller than nominal .These initial values are selected heuristic ally.
18
inS2
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
19/36
TEST 1: D0 =
19
0 500 1000 1500
0.95
1
Time (h)
1
0 500 1000 1500
0.48
0.5
Time (h)
2
0 500 1000 1500
9.5
10
Time (h)
3
D3
D2
D1
Dynamic trajectories D
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
20/36
20
0 500 1000 15000.005
0.01
0.015
Time (h)
X2
(UA)
X2
Estimated
X2
System
0 500 1000 1500
0.01
0.02
0.03
Time (h)
S2
(m
ol/L)
S2
Estimated
S2
System
0 500 1000 1500
0.1
0.2
0.3
Time (h)
IC
(mol/L)
IC2
Estimated
IC2
System
State estimation
TEST 1: D0 =
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
21/36
and S2 are estimated with a smaller error when thestep on the input substrate is incepted. As is easy to seethe error vanishes quickly. This error could be inducedby the abrupt change in the system conditions. Since the
pH measure is directly related to IC , this variable isestimated with a negligible error during all thesimulation. In general, these estimation errors could bedue to the observer structure, which is a simple one. Theproposed
D
with initial nominal values produces adiminution in the transient error as compared withresults calculated with .
21
2X
2S
IC
TEST 1: D0 =
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
22/36
22
0 500 1000 15001.8
2
2.2
Time (h)
D1
0 500 1000 15000.9
1
1.1
Time (h)
D2
0 500 1000 150018
19
20
21
Time (h)
D3
D1
D2
D3
TEST 2: D0 = 2 (100% larger)
Dynamic trajectories D
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
23/36
TEST 2: D0 = 2 (100% larger)
23
0 500 1000 15000.005
0.01
0.015
Time (h)
X2
(UA)
X2
Estimated
X2
System
0 500 1000 1500
0.01
0.02
0.03
Time (h)
S2
(mol/L)
S2
Estimated
S2
System
0 500 1000 1500
0.1
0.2
0.3
Time (h)
IC
(mo
l/L)
IC2
Estimated
IC2 System
state estimation
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
24/36
24
TEST 2: D0 = 2 (100% larger)
0 500 1000 15000
0.02
0.04
0.06
0.08
0.1
0.12
0.14
0.16
0.18
0.2
Time (h)
W1
Eucliden norm of W1
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
25/36
25
TEST 2: D0 = 2 (100% larger)
0 500 1000 15000
0.05
0.1
0.15
0.2
0.25
Time (h)
W2
Eucliden norm of W2
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
26/36
26
TEST 2: D0 = 2 (100% larger)
0 500 1000 15000
1
2
3
4
5
6
7
8
Time (h)
W3
Eucliden norm of W3
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
27/36
TEST 2: D0 = 2 (100% larger)
X2 and S2 are better estimated compared with resultscalculated with the initial nominal ones, when the stepon the input substrate is incepted. IC is again estimated
with a negligible error. A considerable reduction of thetransient error is obtained, improving the observerperformance.
27
2X
2S
IC
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
28/36
TEST 3: D0 = 0.5 (50% smaller)
28
0 500 1000 15000.46
0.48
0.5
0.52
Time (h)
D1
0 500 1000 15000.23
0.24
0.25
0.26
Time (h)
D2
0 500 1000 1500
4.64.8
5
5.2
Time (h)
D3
D1
D2
D3
Dynamic trajectories D
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
29/36
TEST 3: D0 = 0.5 (50% smaller)
29
0 500 1000 15000.005
0.01
0.015
Time (h)
X2
(UA)
X2
Estimated
X2
System
0 500 1000 15000
0.01
0.02
0.03
Time (h)
S2
(mol/L)
S2
Estimated
S2
System
0 500 1000 1500
0.1
0.2
0.3
Time (h)
IC
(mol/L)
IC2
Estimated
IC2
System
state estimation
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
30/36
TEST 3: D0 = 0.5 (50% smaller)
X2 and S2 estimated present an error when the step onthe input substrate is incepted. From the simulationresults of the three tests, it is clear that the best solution
is to select D larger than the nominal values.
30
2X
2S
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
31/36
ROBUSTNESS OF THE RHONO
On the other hand, observer tolerance to change on thesystem parameters is tested; such variation is incepted asa disturbance on the bacteria concentration. A 30%positive variation in the maximum rate growth of the
metanogenic bacteria (2max), a 15% negative variation inthe maximum rate growth of the hydrolytic, acidogenicand acetogenic bacteria (1max) and the samedisturbance on S2in are considered.
31
m ax2
m ax1
inS2
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
32/36
ROBUSTNESS OF THE RHONO
32
0 500 1000 15001.9
2
2.1
Time (h)
D1
0 500 1000 15000.95
1
1.05
Time (h)
D2
0 500 1000 150019
20
21
Time (h)
D3
D1
D2
D3
D trajectories with disturbance on thebacteria concentration
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
33/36
ROBUSTNESS OF THE RHONO
33
0 500 1000 15000.005
0.01
0.015
Time (h)
X2
(UA)
X2
Estimated
X2
System
0 500 1000 1500
0
0.01
0.02
0.03
Time (h)
S2
(mol/L)
S2
Estimated
S2
System
0 500 1000 15000
0.1
0.2
0.3
Time (h)
IC(mol/L)
IC2
Estimated
IC2
System
state estimation
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
34/36
ROBUSTNESS OF THE RHONO
D trajectories and the performance of the proposedRHONO are illustrated in the previously figuresrespectively, where it is clear that X2 , S2 and IC are wellestimated. Thus, the robustness of the proposed
RHONO to parameters variations is verified.
34
2X 2S IC
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
35/36
CONCLUSIONS
A dynamic learning rate, named as D, is proposed forRHONO, trained with an EKF-based algorithm, applied toanaerobic process.
From simulation results, it is possible to see that D withinitial values larger than nominal ones contributed to the
best performance of the observer with a smaller transienterror compared with the other tests.
Simulation results also illustrate the observer robustness inpresence of disturbances on the system parameters.
Thus, it can be noticed that D is an adequate proposal toimprove the performance of the observer and to estimatethe states of the anaerobic process.
35
-
8/4/2019 2_Dynamic Learning Rate (D) for Recurrent_diapositivas_XP
36/36
FUTURE WORK This research will be pursued in order to evaluate the
application of the proposed D for control of theanaerobic process.
In order to improve the RHONO performance, theobserver scheme may be complemented by adding moreterms to the proposed structure.