identification and control of a nonlinear discrete time system based on its linearization a unified...

11
IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 15, NO. 3, MAY 2004 663 Identification and Control of a Nonlinear Discrete-Time System Based on its Linearization: A Unified Framework Lingji Chen, Member, IEEE, and Kumpati S. Narendra, Life Fellow, IEEE Abstract—This paper presents a unified theoretical framework for the identification and control of a nonlinear discrete-time dynamical system, in which the nonlinear system is represented explicitly as a sum of its linearized component and the residual nonlinear component referred to as a “higher order function.” This representation substantially simplifies the procedure of applying the implicit function theorem to derive local properties of the nonlinear system, and reveals the role played by the linearized system in a more transparent form. Under the assumption that the linearized system is controllable and observable, it is shown that: 1) the nonlinear system is also controllable and observable in a local domain; 2) a feedback law exists to stabilize the nonlinear system locally; and 3) the nonlinear system can exactly track a constant or a periodic sequence locally, if its linearized system can do so. With some additional assumptions, the nonlinear system is shown to have a well-defined relative degree (delay) and zero-dynamics. If the zero-dynamics of the linearized system is asymptotically stable, so is that of the nonlinear one, and in such a case, a control law exists for the nonlinear system to asymptotically track an arbitrary reference signal exactly, in a neighborhood of the equilibrium state. The tracking can be achieved by using the state vector for feedback, or by using only the input and the output, in which case the nonlinear autoregressive moving-average (NARMA) model is established and utilized. These results are important for understanding the use of neural networks as identifiers and controllers for general nonlinear discrete-time dynamical systems. Index Terms—Control, discrete-time, identification, lineariza- tion, neural networks, nonlinear autoregressive moving-average (NARMA) model, nonlinear dynamical systems, normal form, tracking. I. INTRODUCTION I N 1990, it was first proposed [1] that neural networks should be used as components in dynamical systems, and that iden- tification and control of such systems using neural networks should be undertaken within a unified framework of systems theory. Since that time, a great deal of progress has been made both in the theory and the practice of neural network based con- trol (e.g., [2]–[7]). In a number of papers, system theoretic is- sues such as stabilization [8], representation and identification of single-input–single-output (SISO) [9] and multivariable [10] nonlinear systems, existence of controllers for exact and asymp- totic tracking [11], and disturbance rejection [12] have been ad- Manuscript received June 5, 2002; revised August 11, 2003. This work was supported by the NSF by Grant ECS-0113239. L. Chen is with Scientific Systems Company, Inc., Woburn, MA 01801 USA. K. S. Narendra is with Yale University, NewHaven, CT 06510 USA. Digital Object Identifier 10.1109/TNN.2004.826206 dressed. In all the above cases, the results are local in nature, and are valid in a domain around the equilibrium state. More specifically, let a nonlinear system be described by (1) where the input , the state , and the output at time belong respectively to , , and , the functions and are smooth, and the origin is an equilibrium state. The local results for are based on properties of its linearized system described by (2) where The results are derived, for the most part, by using the Implicit Function Theorem: First the nonlinear mapping of interest is derived (which usually involves compositing nonlinear map- pings and ), and then the Jacobian of this mapping is obtained and identified with its counterpart in the linearized system. Since is assumed to be controllable and observable, the Jacobian can be shown to be nonsingular and, therefore, the implicit function theorem can be applied to derive the desired local results. While the procedure is conceptually straightforward, deter- mining the Jacobian is known to be an involved process, as is evident from [11]. In this paper an attempt is made to simplify the above procedure and to display in a more revealing fashion the role played by the linearized system . Toward this end, we rewrite the system equations for in the following equiva- lent form (3) where the nonlinear functions and are naturally defined by the differences, and will be called “higher order functions.” 1045-9227/04$20.00 © 2004 IEEE

Upload: shihab4806

Post on 21-Jul-2016

5 views

Category:

Documents


0 download

DESCRIPTION

Implementation of Neural-network-based Inverse-model Control Strategies on an Exothermic Reactor

TRANSCRIPT

Page 1: Identification and Control of a Nonlinear Discrete Time System Based on Its Linearization a Unified Framework

IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 15, NO. 3, MAY 2004 663

Identification and Control of a NonlinearDiscrete-Time System Based on its Linearization:

A Unified FrameworkLingji Chen, Member, IEEE, and Kumpati S. Narendra, Life Fellow, IEEE

Abstract—This paper presents a unified theoretical frameworkfor the identification and control of a nonlinear discrete-timedynamical system, in which the nonlinear system is representedexplicitly as a sum of its linearized component and the residualnonlinear component referred to as a “higher order function.” Thisrepresentation substantially simplifies the procedure of applyingthe implicit function theorem to derive local properties of thenonlinear system, and reveals the role played by the linearizedsystem in a more transparent form. Under the assumption thatthe linearized system is controllable and observable, it is shownthat: 1) the nonlinear system is also controllable and observable ina local domain; 2) a feedback law exists to stabilize the nonlinearsystem locally; and 3) the nonlinear system can exactly track aconstant or a periodic sequence locally, if its linearized systemcan do so. With some additional assumptions, the nonlinearsystem is shown to have a well-defined relative degree (delay)and zero-dynamics. If the zero-dynamics of the linearized systemis asymptotically stable, so is that of the nonlinear one, andin such a case, a control law exists for the nonlinear systemto asymptotically track an arbitrary reference signal exactly, ina neighborhood of the equilibrium state. The tracking can beachieved by using the state vector for feedback, or by usingonly the input and the output, in which case the nonlinearautoregressive moving-average (NARMA) model is establishedand utilized. These results are important for understanding theuse of neural networks as identifiers and controllers for generalnonlinear discrete-time dynamical systems.

Index Terms—Control, discrete-time, identification, lineariza-tion, neural networks, nonlinear autoregressive moving-average(NARMA) model, nonlinear dynamical systems, normal form,tracking.

I. INTRODUCTION

I N 1990, it was first proposed [1] that neural networks shouldbe used as components in dynamical systems, and that iden-

tification and control of such systems using neural networksshould be undertaken within a unified framework of systemstheory. Since that time, a great deal of progress has been madeboth in the theory and the practice of neural network based con-trol (e.g., [2]–[7]). In a number of papers, system theoretic is-sues such as stabilization [8], representation and identificationof single-input–single-output (SISO) [9] and multivariable [10]nonlinear systems, existence of controllers for exact and asymp-totic tracking [11], and disturbance rejection [12] have been ad-

Manuscript received June 5, 2002; revised August 11, 2003. This work wassupported by the NSF by Grant ECS-0113239.

L. Chen is with Scientific Systems Company, Inc., Woburn, MA 01801 USA.K. S. Narendra is with Yale University, New Haven, CT 06510 USA.Digital Object Identifier 10.1109/TNN.2004.826206

dressed. In all the above cases, the results are local in nature,and are valid in a domain around the equilibrium state. Morespecifically, let a nonlinear system be described by

(1)

where the input , the state , and the output attime belong respectively to , , and , the functions

and are smooth, and the origin is an equilibrium state.The local results for are based on properties of its linearizedsystem described by

(2)

where

The results are derived, for the most part, by using the ImplicitFunction Theorem: First the nonlinear mapping of interest isderived (which usually involves compositing nonlinear map-pings and ), and then the Jacobian of this mappingis obtained and identified with its counterpart in the linearizedsystem. Since is assumed to be controllable and observable,the Jacobian can be shown to be nonsingular and, therefore, theimplicit function theorem can be applied to derive the desiredlocal results.

While the procedure is conceptually straightforward, deter-mining the Jacobian is known to be an involved process, as isevident from [11]. In this paper an attempt is made to simplifythe above procedure and to display in a more revealing fashionthe role played by the linearized system . Toward this end,we rewrite the system equations for in the following equiva-lent form

(3)

where the nonlinear functions and are naturally definedby the differences, and will be called “higher order functions.”

1045-9227/04$20.00 © 2004 IEEE

Page 2: Identification and Control of a Nonlinear Discrete Time System Based on Its Linearization a Unified Framework

664 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 15, NO. 3, MAY 2004

A block-diagram representation of a SISO system is shown inFig. 1.

There are several reasons for using the representation given in(3) when considering system theoretic properties and problemsof identification, stabilization, regulation, and tracking.

i) From a pedagogic point of view, many of the results thatare currently available can be derived in a unified fashionby using such a representation, so that the underlyingtheoretical concepts become transparent.

ii) Since the best developed part of control theory deals withlinear systems [of the form described in (2)], themethods proposed in i) may suggest how the same con-cepts can also be applied to problems in the nonlineardomain.

iii) Once the basic concepts become clear, it is possible tosuggest new methods for solving difficult identificationand control problems. For an example, see [13].

iv) After years of computer simulation studies in whichneural networks have been used to identify and controldynamical systems, there is general agreement at presentin the community that linear identifiers and controllersshould be first designed before nonlinear identificationand control is attempted. The representation of thesystem as given in (3) lends itself readily to suchcontrol.

This paper is organized as follows. Section II presentsmathematical preliminaries and problem statement. Section IIIexamines system theoretic properties such as controllability andobservability. The problem of tracking a constant, a periodicsequence, and an arbitrary reference signal are considered inSection IV when the state vector is accessible. The case whenonly the input and the output are available is considered inSection V. Conclusions are drawn in Section VI.

II. MATHEMATICAL PRELIMINARIES AND STATEMENT

OF THE PROBLEM

A. Higher Order Functions and Two Theorems

Since we will explicitly use addition, multiplication, andcomposition of “higher order terms” as functions of theirarguments, we now formally define “higher order functions”as follows.

Definition 1: A continuously differentiable functionis called a “Higher Order Function” if: a) ,

and b) . We denote the class of higher orderfunctions by .

Thus, any function can be equivalently written as a sum of alinear function and a higher order function. When the system in(1) is written in the form of (3), the functions and arehigher order functions.

The following properties of higher order functions arefound to be useful and can be verified in a straightforwardmanner:

i) If is a constant matrix and , then .ii) If , then and .

iii) If and and is continuously differ-entiable, then the composition .

Fig. 1. A SISO system represented using higher order function.

Two theorems that play an important role in nonlinear iden-tification and control are the inverse function theorem and theimplicit function theorem [14]. We now present their specializedforms for cases where some of the relevant functions belong to

.Theorem 1 (Inverse Function Theorem): In some neighbor-

hood of the origin, let

where is a nonsingular matrix and . Then there existsan open set containing the origin such that the set ,defined by , is open, and for all

Theorem 2 (Implicit Function Theorem): Let be an opensubset of containing the origin. Let an element of bedenoted by with and . Let

be a function from to , where is anonsingular matrix and . Then there exists an open set

containing the origin such that

satisfies the equation .By the above two theorems, when the underlying nonlinear

functions are expressed as the sum of linear and higher orderfunctions, the application of the inverse function theorem andthe implicit function theorem becomes a matter of simple ma-nipulation of equations. No differentiation or function evalua-tion is needed, because the Jacobian matrix is explicitly given.Note also that the vector can be a concatenation of several vec-tors. This leads to the following corollary:

Corollary 1: If , , and isnonsingular, then locally , .

Sometimes it is helpful to inspect a system in its block dia-gram form. For linear systems, the inverse operations of “sum”and “gain” can be obtained in a straightforward manner. It is nolonger the case for systems with nonlinear operations. However,with the help of the above theorems, some inverse operations (ina neighborhood of the equilibrium point) can be obtained by in-spection, and the relationship between the signals and canbe established as in Fig. 2.

B. Statement of the Problem

Let a single-input–single-output (SISO) nonlinear systems bedescribed by (3). The order of the system and the nonlinear func-tions are all assumed to be known. We are interested in the fol-lowing problems related to :

i) system representation;ii) stabilization using state feedback;

Page 3: Identification and Control of a Nonlinear Discrete Time System Based on Its Linearization a Unified Framework

CHEN AND NARENDRA: NONLINEAR DISCRETE-TIME SYSTEM 665

Fig. 2. Inverse function theorem expressed in block diagram form.

iii) existence of a controller for regulation;iv) existence of a controller for tracking.

We will first study the case when the state vector is accessible,and then study the case when only the input and the output areaccessible.

Prior to addressing the problems stated above, we considersystem theoretic properties of that are important for all thediscussions.

III. SYSTEM THEORETIC PROPERTIES

A. Controllability

Qualitatively speaking, controllability is the ability to transferthe state of a system from one point to another in the state space.For the nonlinear system represented by (3), the followingdefinition of local controllability is relevant.

Definition 2 (Local Controllability): A system is locally con-trollable if there exists a neighborhood of the origin suchthat, for any , there is a sequence of inputs

that transfers the system from to .It is, in general, quite difficult to demonstrate the controlla-

bility of a complex nonlinear system [15]. However, if the lin-earized system is controllable, sufficient conditions can bestated for , as shown below in Theorem 3.

Theorem 3: If the system is controllable, then the systemis locally controllable.

Proof: We will show the existence of a control sequencethat transfers the SISO system from to in steps.

The properties of higher order functions discussed in Section IIwill be repeatedly used in the following derivations.

...

...

(4)

where

and are all properly defined higher order func-tions. Since is controllable, its controllability matrixis nonsingular. By Corollary 1 in Section II, in a neighborhood

of the origin in the space , the desired controlsequence is given by

(5)

The above result is well known. However, it has been derivedin a much more straightforward manner in the proof, withoutcumbersome function composition and Jacobian calculation.The same is also true for many other results presented in therest of the paper.

B. Stability and Stabilizability

The linear unforced system

(6)

is asymptotically stable if (and only if) the eigenvalues of thematrix lie within the unit circle of the complex plane, or equiv-alently, if (and only if) the Lyapunov equation

has a positive definite solution . In such a case

is a Lyapunov function for (6). The same Lyapunov function canbe used to prove the (local) asymptotic stability of the nonlinearsystem

since

(7)

in a neighborhood of the origin1 .A controllable linear system can be stabilized by a feedback

control law of the form

(8)

so that the closed-loop system

is asymptotically stable. It follows that the same feedback lawstabilizes also, since the closed-loop system

is also asymptotically stable in a neighborhood of the origin, asdiscussed earlier.

For discrete-time linear systems, the feedback gain can bechosen so that becomes nil-potent, i.e., .In such a case, any state can be brought back to the origin in nomore than steps by a state feedback law (“deadbeat control”).This result can be extended to the nonlinear system .

1By definition of higher order functions, for any � > 0, there exists a neigh-borhood of the origin in which � is bounded by �x.

Page 4: Identification and Control of a Nonlinear Discrete Time System Based on Its Linearization a Unified Framework

666 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 15, NO. 3, MAY 2004

Theorem 4 (Finite Stabilizability): If the system is con-trollable, then the system can be stabilized in at most stepsby a state feedback law.

Proof: Let be zero in (5):

This is the (unique) control sequence of length that brings anystate to the origin. The first element of this sequence is

We claim that the state feedback law

stabilizes the system in at most steps, as a consequence of thefollowing two facts:

i) the solution (5) is unique in the neighborhood of theorigin;

ii) the origin is an equilibrium state.For a complete proof, see Appendix.

C. Observability

Definition 3 (Local Observability): Ifcan be re-constructed from a sequence of observations

and , then thesystem is locally observable.

Theorem 5: If the system is observable, then the systemis locally observable.

Proof: The procedure follows along the same line as in thecase of controllability in Section III-A. Define the vectors

and the matrix

and it can be shown that

where

is the observability matrix of , which is assumed to benonsingular. Therefore

(9)

where .

In summary, the results stated in this section merely makeprecise, in specific contexts, the well-known linearization prin-ciple, that design based on linearizations works locally fornonlinear systems. The controllability (stabilizability) and ob-servability of assure the local controllability (stabilizability)and observability of .

IV. REGULATION AND TRACKING; STATE VECTOR ACCESSIBLE

As stated earlier, our interest is in examining critically the roleplayed by in many of the identification and control problemscommonly addressed in the context of the nonlinear system .We first consider the problem of set-point regulation or regula-tion around an equilibrium state that is different from the origin.

A. Set-Point Regulation

The output of a stabilizable linear system can beregulated around a constant value asymptotically, using staticstate feedback, if , or equivalently, if thetransfer function of , , does not havea zero at .

We now consider the set-point regulation of the nonlinearsystem described by (3). Since is stabilizable, so is .Let

be the input to , where is a stable matrix (andhence is well-defined), and is a constant. The closedloop system has the form (for notational convenience, we dropthe time and denote by a superscript )

We will first show that with a constant input , the state ,and hence the output , will tend to constant values, and thenthe “nonlinear gain” from to can be determined. For a dis-crete-time system, consider the equality

Let

be an explicit representation of in terms of by using theimplicit function theorem in Section II. With the coordinatetransformation

is transformed into the origin of the -coordinates. The rep-resentation of the system now becomes (with definitions

and )

where is a constant matrix for any given constant .Since and, hence, and are zero, by

Page 5: Identification and Control of a Nonlinear Discrete Time System Based on Its Linearization a Unified Framework

CHEN AND NARENDRA: NONLINEAR DISCRETE-TIME SYSTEM 667

continuity, remains a stable matrix for sufficiently small, and, therefore

or

Thus we have shown that, after the system is stabilized by statefeedback, a constant input produces a constant output for thenonlinear system . Next we will examine the “nonlinear gain”of the system so that a desired constant output can be producedby an appropriate constant input. Let be a constant referenceoutput that is to follow asymptotically. This implies that

must be such that

(10)

where

Since , , a solution for in terms of (in a neighborhoodof the origin) to (10) exists if

This corresponds to the condition that the transfer functiondoes not have a zero at , or

equivalently, that the transfer functiondoes not have a zero at (since zeros are invariant understate feedback). Thus, we have the following theorem.

Theorem 6 (Set-Point Regulation): The output of the non-linear system can be regulated around a constant value in aneighborhood of the origin, if this can be achieved for the lin-earized system .

B. Periodic Sequence Tracking

The above result can be generalized to the tracking of anyperiodic sequence in a neighborhood of the origin.

Theorem 7 (Periodic Sequence Tracking): Given systemand any -periodic sequence , the output of can track

if the transfer function of has no zero that coincideswith an root of 1.

Proof: For a detailed proof, see [16]. The essence of theproof is to examine the evolution of the system on an -timescale: at , at ,etc., A periodic input to the system translates to constant inputsto the above -time scale subsystems and, therefore, the resultsobtained for the case of regulation can be extended to it.

C. Tracking an Arbitrary Signal

We have thus far considered the regulation problem and theproblem of tracking an -periodic signal. We now proceed toconsider the more general case where the output to befollowed is any arbitrary signal. However, further constraint onthe structure of has to be imposed before the problem can beposed precisely. This, in turn, leads to the concepts of relativedegree, normal form, and zero dynamics of the system, all ofwhich are central, both to the statement as well as the solution of

the general tracking problem [11], [17], [18]. We discuss theseconcepts first in the case of , before extending them to .

1) Relative Degree, Normal Form, and Zero Dynamics for: Consider the linear system described by

Let . Then

If , let . Then

Similarly, if , let . Then

As an example, assume that . Since , and, it follows that the fact implies that ,

and are linearly independent. Let be a nonsingularmatrix such that , and are its first three rows. Then, thetransformation

will transform the system into the normal form

(11)

where , , andis the state of the system, and and are row vec-

tors in and respectively, and ,and . Because of linearity, it can be

shown that can be chosen so that , and this is assumedin the following discussions.

We have assumed , and for illus-tration. In a more general case we would have

and

In such a case, , , ,, , and

while the equations for and are given by (11).The objective is for to follow a desired trajectory .

Intuitively this can be achieved by setting

(12)

However, the subsystem associated with in Fig. 3 is drivenby an arbitrary signal , and can become unbounded. To poseconditions that guarantee the boundedness of and other sig-nals, we proceed as follows. Choose to be identically zero.

Page 6: Identification and Control of a Nonlinear Discrete Time System Based on Its Linearization a Unified Framework

668 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 15, NO. 3, MAY 2004

Fig. 3. Linear normal form.

From (11) and the fact that and we have a systemevolving as

This is called the zero dynamics of the linear system. If is anasymptotically stable matrix, it follows that .In such a case, even when is nonzero but bounded,is a bounded sequence, and so is the control input .

2) Relative Degree, Normal Form, and Zero Dynamics for: The results obtained in Section III for are global in na-

ture. In this section, we show that similar results can be obtained,in a neighborhood of the origin, for the nonlinear system .

Let be described by

(13)

Assume that

Let

Then

where , . If, in a neighborhood of the origin

i.e.,

(14)

where subscripts denote Jacobians, it follows that, . In other words, is only a func-

tion of . A necessary (but not sufficient) condition for (14) tohold in a neighborhood of the origin is that .

Fig. 4. Nonlinear normal form.

We now define to obtain

The last equality holds if

in a neighborhood of the origin, or equivalently

Again, a necessary but not sufficient condition for the above tohold is that .

Continuing in this manner, if , then

in a neighborhood of the origin, and therefore as in the linearcase, a nonsingular matrix can be chosen whose first threerows are , and , respectively, and a vector canbe chosen whose first three elements are , , and

. If

is chosen as the new system of coordinates, (13) can be trans-formed into the normal form

(15)

where , ,and , . The vector is the state of the system. Ablock diagram representation is shown in Fig. 4.

As in the linear case, a matrix can be chosen so thatin the expression for and we will therefore assume to bezero in the following discussions. However, still depends on

through the higher order terms in .

Page 7: Identification and Control of a Nonlinear Discrete Time System Based on Its Linearization a Unified Framework

CHEN AND NARENDRA: NONLINEAR DISCRETE-TIME SYSTEM 669

The general case with relative degree instead of 3 followsin a straightforward fashion.

Definition 4 (Relative Degree): The nonlinear system issaid to have a well-defined relative degree if

i) its linearized system has relative degree :

andii) the nonlinear system satisfies

in a neighborhood of the origin, where the functions, are defined recursively

as indicated previously, and subscripts denote Jacobians.We note that in the normal form, are delayed

versions of which is the first component of that can bedirectly controlled by . Since , it follows that, givena desired trajectory at time , the control can bedetermined as

(16)

where , so that follows . Sinceis bounded, are also bounded. How-

ever, it has to be demonstrated that is also a bounded se-quence. This will assure that the entire state of the systemis bounded. It is only then that the control can be computedusing (16).

To study the behavior of , we first consider the case when(and hence, ). The output can be forced to

zero by the input

Note that is a function only of . Under such an input,evolves according to

Definition 5 (Zero Dynamics): The dynamics associatedwith the part of the state vector when the output is forcedto be identically zero, i.e.

(17)

is called the zero-dynamics of the nonlinear system .A sufficient condition for the asymptotic stability of the non-

linear zero dynamics is that is a stable matrix, or equivalently,the zero dynamics of the linearized system is asymptoticallystable.

The zero dynamics given by (17) is an autonomous nonlineardifference equation. By Malkin’s theorem [19], if it is (uni-formly) asymptotically stable, it is also totally stable, or, stableunder sufficiently small perturbations. If we consider , thedesired output signal, as a sufficiently small perturbation, the wecan conclude that all signals are bounded, and that in a neighbor-hood of the origin, the control law (16) can be used for tracking.

With a change of coordinates, the control law can also beexpressed as

(18)

where .Theorem 8 (General Tracking): If the nonlinear system

has a well defined relative degree, and if the zero dynamics ofthe linearized system is asymptotically stable, then a controllaw (18) exists such that the output of follows any desiredsignal of sufficiently small magnitude, and that the state onlyevolves in a neighborhood of the origin.

V. NARMA MODEL AND TRACKING: INPUT AND

OUTPUT ACCESSIBLE

In Section IV, problems of regulation and tracking of a non-linear system are studied under the assumption that the statevector is accessible. In practical situations, it is often the casethat the state vector is not accessible while only the input and theoutput are accessible. This makes the study of such control prob-lems very important. For mathematical tractability we alwaysassume that, even when only the input and the output are acces-sible, the system has an underlying state space representation(3). Since it has been established that, if the linearized system

is observable, the nonlinear system is also observable in aneighborhood of the origin, it follows that the state of thesystem can be reconstructed from measurements

, . Thus,for the purpose of establishing existence of solutions for suchproblems as stabilization, regulation and tracking, we could usethe results obtained in Section IV and replace with its re-constructed value (for a more rigorous treatment, readers arereferred to [20]). However, rather than relating all variables ex-plicitly to the state vector at every step, we prefer a rep-resentation of the system entirely in terms of its input andoutput . Therefore, we will establish the nonlinear autore-gressive moving average (NARMA) model of the system ina neighborhood of the origin, and study the properties of thisrepresentation and the problems of regulation and tracking per-taining to it.

Before we proceed, the meaning of a (local) input–outputrepresentation of a nonlinear system has to be clarified. Aninput–output representation has the following properties.

i) For each initial condition in the state representation,there exists a corresponding initial condition vector interms of past values of and in the input–output repre-sentation.

ii) If and , , then the outputdetermined by the state representation will be identicalto the one determined by the input–output representation,where and are some neighborhoods of the origin.

Page 8: Identification and Control of a Nonlinear Discrete Time System Based on Its Linearization a Unified Framework

670 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 15, NO. 3, MAY 2004

As pointed out in [11], since our main analytical tool is theimplicit function theorem, the results obtained for the nonlinearsystem will be local in nature, i.e., they are valid in aneighborhood of the origin. If is an unstable system,then there exist input sequences which lead to unboundedoutput sequences, thus leaving the region of validity of theimplicit function theorem. Therefore, strictly speaking, a localinput–output representation will fail to be valid for an unstable

. However, from a practical point of view, it is usually thecase that a linear controller is already in operation when weattempt nonlinear control using neural networks to improveperformance, and therefore our starting point will usually be thatall the signals are in the region of validity for all time. With thisunderstanding we will treat the input–output representationsof both stable and unstable systems in the same fashion.

A. The NARMA Model

From Section III it is known that if the linearized systemis observable, then the state can be reconstructed from the inputand the output, i.e.

where . If we iterate the state equation in (3) to obtain, and note that the system is time-invariant, we will have

for all ,

(19)

where the matrices , and are suitably defined, and. It follows that:

(20)

where . Thus the future value of the output, , isexpressed in terms of the current and past values of the input andthe output. Note that this has been achieved by only assumingthat the linearized system is observable.

Suppose has a relative degree (delay) , i.e., in (20),, and . Then

do not affect through linear terms,but they may affect through nonlinear terms in thehigher order function . Since our main analytical tools arethe inverse function theorem and the implicit function theorem,we want to restrict our attention to systems that have dominantlinear terms (as has been done in [21] and other papers). In thefollowing, it is assumed that the nonlinear system itself has awell-defined relative degree, as described in Section IV-C.2.

From the normal form (15) it follows that:

(21)

where

In this representation, affects through a nonzerolinear gain and a nonlinear higher order function .Since is part of the state vector, our objective is to eliminate itfrom the above equation and obtain an input–output representa-tion. There are three steps involved.

Step 1: Define

and (21) becomes

and our objective is to express in terms of and ,and their past or future values.

Step 2: Define

and, from (15), can be considered as the output of the fol-lowing system::

(22)

where is the state variable of dimension , and andare the inputs. Now we need the following:Proposition 1: The system in its normal form (11) is ob-

servable if and only if is an observable pair.Proposition 2: If is an observable pair, then the

system described by (22) is observable, i.e., the state can bereconstructed from the input and the output .

The aforementioned proposition is a straightforward exten-sion of the observability result for SISO systems presented inSection III. The following notations are used for the past andpresent values of the input and the output

Step 3: Since

it follows that:

and now we can express in terms ofand . The latter two are and

, respectively, by definition of .

Page 9: Identification and Control of a Nonlinear Discrete Time System Based on Its Linearization a Unified Framework

CHEN AND NARENDRA: NONLINEAR DISCRETE-TIME SYSTEM 671

Finally, it follows that:

(23)

where and the vectors and are suitably defined,with the latter having a nonzero leading element .

In this input–output representation, the input affects theoutput through a nonzero linear term .Mathematically speaking, a suitable value of can be deter-mined using the implicit function theorem for a desired value of

. However, from a control point of view, such a controlinput cannot be determined at instant , when the outputs

are not available. To overcome thisdifficulty, (23) is recursively used to replace future outputs withpast outputs and results in

(24)

where and . A diagrammaticrepresentation of a NARMA model is shown in Fig. 5.

B. The Tracking Problem

From the discussion in Section IV, it follows that, if the de-sired value is given at time , a control law can bedetermined as

(25)

where , so that follows .To ensure that all signals are bounded while using the

control law (25), we define zero dynamics in the input–outputrepresentation (24), which corresponds to the zero dynamics inthe state representation. As discussed earlier, the state vector

of the system (22) can be uniquely determined from, and , or more explicitly,

from and . Therefore, for a fixedsequence of output , there are values of the input thatuniquely determine the state . Thus, the zero dynamics interms of in (17) is manifested through the evolution ofwhen the output is constrained to be identically zero forall

(26)

Fig. 5. The NARMA model.

The above is called the zero dynamics in the input–output rep-resentation. The final results can be summarized as follows.

Theorem 9 (NARMA Model): If the system has a well de-fined relative degree, it has an input–output representation (24).And if the zero dynamics (26) is asymptotically stable, thenunder the control law (25), the output will follow any ref-erence signal with a sufficiently small amplitude, whileall signals in the system remain in a neighborhood of the origin.

C. Identifiers and Controllers

We have so far considered the case when the nonlinearities areknown. This provides the basis for dealing with the case whenthe nonlinearities are unknown and the plant has to be identifiedand controlled adaptively. In view of the results obtained so far,the identifier and the controller can be depicted in more detail,as shown in Fig. 6.

Identifiers and controllers are usually approximated by neuralnetworks. After collecting the input and the output of the plant,the training of an identifier can be conducted in a standard way.When the training is complete and the identifier is obtained, itscorresponding controller, as shown in Fig. 6, can also be trainedby using random numbers. Interested readers are referred to [22]for details.

VI. CONCLUSION

This paper presents a unified theoretical framework for theidentification and control of a nonlinear discrete-time dynam-ical system, in which the nonlinear system is represented ex-plicitly as a sum of its linearized component and the residualhigher order functions. This representation substantially simpli-fies the procedure of applying the implicit function theorem toderive local properties of the nonlinear system, and reveals therole played by the linearized system in a more transparent form.Under the assumption that the linearized system is controllableand observable, it is shown that: 1) the nonlinear system is alsocontrollable and observable in a local domain; 2) a feedback lawexists to stabilize the nonlinear system locally in a finite numberof steps; and 3) the nonlinear system can track a constant or a pe-riodic sequence locally, if its linearized system can do so. Withsome additional assumptions, the nonlinear system is shown tohave a well-defined relative degree (delay) and zero-dynamics.If the zero-dynamics of the linearized system is asymptoticallystable, so is the nonlinear one, and in such a case, a control lawexists for the nonlinear system to track an arbitrary reference

Page 10: Identification and Control of a Nonlinear Discrete Time System Based on Its Linearization a Unified Framework

672 IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 15, NO. 3, MAY 2004

Fig. 6. Identifier and controller in more detail.

signal locally. The tracking can be achieved by using the statevector for feedback, or by using only the input and the output,in which case the NARMA model is established and utilized.

Even though many of the results have appeared in [8], [9],and particularly in [11], the main contribution of this paper isto present all the results in a unified fashion, so that the roleplayed by the linearized system in the analysis of the non-linear system becomes evident. It is the authors’ belief thatthe framework presented will provide a significantly clearer un-derstanding of the use of neural networks as identifiers and con-trollers for general nonlinear discrete-time dynamical systems.

APPENDIX

A. Complete Proof of Theorem 4

Proof: Let be zero in (5):

This is the (unique) control sequence of length that brings anystate to the origin. Define the first element of this sequenceby

(27)

We claim that the state feedback law

(28)

stabilizes the system in at most steps. This is a consequenceof the following two facts:

i) the solution (5) is unique in the neighborhood of theorigin;

ii) the origin is an equilibrium state.To see this, let the control sequence that transfers any initial state

to the origin in exactly steps be denoted by ; recall thatany such sequence has to satisfy (4), which has a unique solution(5) and, therefore, is uniquely determined by the initial state .Let the first element of the sequence be denoted by , thesecond element by , and the rest of the elements by ,i.e.

(29)

Consider the case in which transfers the state to theorigin in exactly steps. Let be the state of the systemafter the control input is applied. Then we knowthat the remaining control sequencetransfers to the origin in steps. Since the originis an equilibrium state, it follows that the control sequence

transfers to the origin in exactlysteps. By uniqueness, it must follow that

hence

as a consequence of (27) and (29). Continuing the same argu-ment for , we conclude that the state feedback law

stabilizes the system in at most steps.

Page 11: Identification and Control of a Nonlinear Discrete Time System Based on Its Linearization a Unified Framework

CHEN AND NARENDRA: NONLINEAR DISCRETE-TIME SYSTEM 673

ACKNOWLEDGMENT

L. Chen would like to thank Dr. J. B. D. Cabrera [11], [16]for many insightful discussions.

REFERENCES

[1] K. S. Narendra and K. Parthasarathy, “Identification and control of dy-namical systems using neural networks,” IEEE Trans. Neural Networks,vol. 1, pp. 4–27, Mar. 1990.

[2] M. D. Lemmon and A. N. Michel, “Guest editorial: Special section onneural networks in control, identification, and decision making,” IEEETrans. Automat. Contr., vol. 44, pp. 1993–1994, Nov. 1999.

[3] K. S. Narendra and F. L. Lewis, “Introduction to the special issueon neural network feedback control,” Automatica, vol. 37, no. 8, pp.1147–1148, Aug. 2001.

[4] Y. H. Kim and F. L. Lewis, High-Level Feedback Control With NeuralNetworks, Singapore: World Scientific, 1999.

[5] S. S. Ge, C. C. Hang, T. H. Lee, and T. Zhang, Stable Adaptive NeuralNetwork Control. Norwell, MA: Kluwer, 2001.

[6] F. L. Lewis, S. Jagannathan, and A. Yesildirek, Neural Network Controlof Robot Manipulators and Nonlinear Systems. London, U.K.: TaylorFrancis, 1999.

[7] R. M. Sanner and J.-J. E. Slotine, “Gaussian networks for direct adaptivecontrol,” IEEE Trans. Neural Networks, vol. 10, pp. 898–906, 1999.

[8] A. U. Levin and K. S. Narendra, “Control of nonlinear dynamicalsystems using neural networks. controllability and stabilization,” IEEETrans. Neural Networks, vol. 4, pp. 192–206, Mar. 1993.

[9] , “Control of nonlinear dynamical systems using neural net-works—Part II: Observability, identification, and control,” IEEE Trans.Neural Networks, vol. 7, pp. 30–42, Jan. 1996.

[10] K. S. Narendra and S. Mukhopadhyay, “Adaptive control of nonlinearmultivariable systems using neural networks,” Neural Networks, vol. 7,no. 5, pp. 737–752, 1994.

[11] J. B. D. Cabrera and K. S. Narendra, “Issues in the application of neuralnetworks for tracking based on inverse control,” IEEE Trans. Automat.Contr., vol. 44, pp. 2007–2027, Nov. 1999.

[12] S. Mukhopadhyay and K. S. Narendra, “Disturbance rejection in non-linear systems using neural networks,” IEEE Trans. Neural Networks,vol. 4, pp. 63–72, Jan. 1993.

[13] L. Chen and K. Narendra, “Nonlinear adaptive control using neural net-works and multiple models,” Automatica, Special Issue on Neural Net-work Feedback Control, vol. 37, no. 8, pp. 1245–1255, Aug. 2001.

[14] L. J. Corwin and R. H. Szczarba, Multivariable Calculus. New York:Marcel Dekker, 1982.

[15] B. Jakubczyk and E. Sontag, “Controllability of nonlinear discrete-timesystems: A lie-algebraic approach,” SIAM J. Contr. Opt., pp. 1–33, 1990.

[16] J. B. D. Cabrera, “Nonlinear Arma Models and the General TrackingProblem for Discrete-Time Dynamical Systems,” Ph.D. dissertation,Yale University, New Haven, CT, 1997.

[17] A. Isidori, Nonlinear Control Systems II, ser. Commun. Contr.Eng.. New York: Springer Verlag, 1999.

[18] S. Monaco and D. Normand-Cyrot, “Minimum-phase nonlineardiscrete-time systems and feedback stabilization,” in Proc. 26th IEEEConf. Decision and Control, 1987, pp. 979–986.

[19] J. P. LaSalle, Stability Theory for Difference Equations, ser. Studies inOrdinary Differential Equations, MAA Studies in Mathematics: Amer-ican Mathematical Assoc., 1977.

[20] L. Chen, “Nonlinear adaptive control of discrete-time systems usingneural networks and multiple models,” Ph.D. dissertation, Yale Univ.,New Haven, CT, 2001.

[21] I. J. Leontaritis and S. A. Billings, “Input–output parametric modelsfor nonlinear systems. Part I: Deterministic nonlinear systems,” Int. J.Contr., vol. 41, no. 2, pp. 303–328, 1985.

[22] L. Chen and K. Narendra, “On a new approach to the design of trackingcontrollers for nonlinear dynamical systems,” in Proc. 1999 Amer. Con-trol Conf., vol. 5, 1999, pp. 3534–3538.

Lingji Chen (S’98–M’99) received the B.E. andM.E. degrees in electrical engineering from theUniversity of Science and Technology of China,Hefei, Anhui, in 1990 and 1993, respectively, andthe Ph.D. degree in electrical engineering from YaleUniversity, New Haven, CT, in 2001.

Currently, he is a Senior Research Engineer withScientific Systems Company Inc., Woburn, MA.His research interests include systems biology,bioinformatics, data fusion, nonlinear control, andneural networks.

Kumpati S. Narendra (S’55–M’60–SM’63–F’79–LF’97) received the B.E. degree (with honors) inelectrical engineering, from Madras University,India, in 1954, and the M.S. and Ph.D. degreesfrom Harvard University, Cambridge, MA, in 1955and 1959, respectively. He received an honoraryM.A. degree from Yale University, New Haven, CT,in 1968, and in 1995, his alma mater, in Madras,awarded him an honorary Doctor of Science degree.

From 1961 to 1965, he was an Assistant Professorwith the Division of Engineering and Applied

Science at Harvard University. He joined the Department of Engineering andApplied Science of Yale University in 1965, and became Professor in 1968.He was Chairman of the Department of Electrical Engineering from 1984to 1987, and Director of the Neuro-Engineering and Neuro-Science Centerduring 1995–1996. Currently, he is the Harold W. Cheel Professor of ElectricalEngineering and Director of the Center for Systems Science at Yale University.He is the author of more than 175 technical articles in the area of systemstheory, three books, and the editor of four others in the areas of adaptivecontrol, stability theory, and learning. He has served on various internationaltechnical committees, and on several editorial boards of technical journals. Heis currently a member of the Advisory Committee of the Institute of AdvancedEngineering in Seoul, Korea, the Scientific Advisor of the Hamilton Institutein Dublin, Ireland. He has also been a consultant to more than 15 industrialresearch laboratories during the past 30 years.

Prof. Narendra is the recipient of numerous awards, including the Franklin V.Taylor Memorial Award (IEEE SMC Society, 1972), the George Axelby BestPaper Award (IEEE Control Systems Society, 1988), the John R. Ragazzini Ed-ucation Award (American Automatic Control Council, 1990), the OutstandingPaper Award of the Neural Network Council (1991), the Neural Network Lead-ership Award (1994), the Bode Prize/Lecture (IEEE Control Systems Society,1995), and the Richard E. Bellman Control Heritage Award of the AACC, forpioneering contributions to adaptive control and learning theory (June 2003).He is a Fellow of the Institute of Electrical Engineers (U.K.) and a member ofSigma Xi, and the Connecticut Academy of Science and Engineering.