Composite learning tracking control for underactuated marine surface vessels with output constraints

View article
PeerJ Computer Science

Introduction

In recent years, with the development of the marine economy, marine transport vehicles have gained much attention (Shen et al., 2020; Yu, Guo & Yan, 2019). Marine surface vehicles (MSVs) have been widely used in marine exploration, marine transportation, marine survey and other fields (Liu et al., 2016; Shao et al., 2019). To accomplish these tasks, the trajectory tracking control of MSVs plays a significant role. Due to the influence of the external environment, the kinetics of MSVs inevitably have unknown dynamics and unknown time-varying environmental disturbances.

In view of this, a series of control approaches have been utilized for control of MSVs, including neural network (NN) control (Zhu et al., 2021; Li et al., 2015), fuzzy logic system (FLS) control (Peng, Wang & Wang, 2018; Wang, Sun & Er, 2018), disturbance observer-based (DOB) control (Guo & Zhang, 2020; Hu et al., 0000), and the finite-time control (Zhu, Ma & Hu, 2020; Wang, Pan & Su, 2019; Wang & Deng, 2020). In Zhu et al. (2021), Li et al. (2015), Peng, Wang & Wang (2018), Wang, Sun & Er (2018), NNs and FLSs are used to approximate the uncertain terms, such as unmodeled dynamics, unknown dynamics. In Guo & Zhang (2020), Hu et al. (0000), a DOB control approach was adopted to compensate compound uncertainty of parameter perturbations and unknown disturbances. In Do (2016) and Ghommam & Saad (2018), the dynamic uncertainties of MSVs were dealt with by parameter adaptive technique and a backstepping design tool.

To address the underactuation problem of MSVs, several control methods are introduced, such as additional control method (Do, 2010; Park, Kwon & Kim, 2017; Chen et al., 2020), output redefinition control (Shojaei & Arefi, 2015; Shojaei, 2017), line-of-sight (LOS) (Shojaei, 2015; Gao et al., 2016; Jia, Hu & Zhang, 2019; Liu, 2019), etc. Three additional control terms were adopted to address the underactuation problem of MSV in Do (2010), Park, Kwon & Kim (2017), Chen et al. (2020). To achieve the design of trajectory tracking control laws, the output redefinition control approach in Shojaei & Arefi (2015) and Shojaei (2017) was introduced to handle the underactuation problem, the combination of adaptive technique, NNs and saturation function to solve the unknown disturbances, unknown dynamic and input saturation, respectively. In Shojaei (2015), Gao et al. (2016), Jia, Hu & Zhang (2019) and Liu (2019), the LOS method was utilized to solve the underactuation problem of MSVs, the combination of parameter adaptive technology and NN approximation are used to successfully solve the time-varying external disturbance and parameter uncertainty.

For the sake of navigation safety, the output constraint problem is inevitably in practice. In practice, the navigable water areas are restricted, and then surface vessels should navigate in the navigable water areas. When the position error is too large, it may lead to collision accident of MSVs. When the yaw angle errors become excessive, the actuator will be damaged due to overload. Therefore, it is necessary to further study the MSVs trajectory tracking system with output constraints. Several methods have been presented to solve the output constraint problem, such as moving-horizon optimal control (Mayne & Michalska, 1990), artificial potential field (Sun & Ge, 2014), barrier Lyapunov function (BLF) (Tee et al., 2011) and output error transformation method (Zheng et al., 2020; Zhu, Du & Kao, 2020). In Zheng et al. (2020) and Zhu, Du & Kao (2020), the output constraint problem is transformed into a tracking error constraint problem by using the coordinate transformation. Coordinate transformation ensures that the tracking error always stays within predefined boundaries. Duo to the structure of Lyapunov function can be constructed by a barrier function, the BLF-based approach can solve the problem of trajectory tracking control for MSVs under the output constraint (Zhu, Du & Kao, 2020; Zhao, He & Ge, 2014). In simultaneous consideration of unknown dynamics and time-varying disturbances, Zhu, Du & Kao (2020) use a log-BLF method to solve the constant symmetric output constraint, Zhao, He & Ge (2014) utilize the asymmetric BLF method to deal with the asymmetric output constraints.

All the literature mentioned before have concentrated on the tracking and stability of the system. Most literature have not mentioned the precision accuracy of identifying models. In practice, the model uncertainty should be approximated as precisely as possible. In generally, the unknown dynamics of the system can be compensated by using adaptive control technique. In order to achieve better control performance, composite adaptive control scheme is developed in Patre & Bhasin (2010). It makes the system realize faster parameter convergence as well as smaller tracking error, and has been applied in various fields (Sun, Pan & Yang, 2017; Pan, Sun & Yu, 2016). By approximating the unknown dynamic items faster and more accurately to obtain better control performance, the prediction errors can be constructed by the serial-parallel estimation model (SPEM) (Peng, Wang & Wang, 2017). Then, the updating law of the neural network is designed by using the prediction error, which improves the transient performance effectively. To update the laws and optimize the system’s transient performance, Yucelen & Haddad (2013) presented an adaptive control modification. An error feedback term was included in the reference model in Pan, Sun & Yu (2016) and Stepanyan & Krishnakumar (2010) to improve the transient performance of the model. In Xu & Sun (2018), both the prediction errors and the tracking errors were applied to construct the updating law of NNs weights. The index of learning performance is introduced in the update rate, some literature focus on constructing composite learning laws by introducing auxiliary filter (Na et al., 2015; Huang et al., 2018) or using time interval data (Xu et al., 2019; Xu et al., 2018).

In this paper, we propose a composite learning control strategy for underactuated MSVs subject to unknown dynamics, ocean environmental disturbances, and output constraints based on the discussion above. The main contributions can be summarized as follows.

  • Position error and yaw angle error constraints are addressed by employing the BLF-based method. The dynamic surface control approach is used to decrease the computation of the explosion problem that exists in the backstepping method.

  • The composite NNs are employed to approximate the unknown dynamics of MSVs. Different from the traditional NN in which only the tracking errors are used to update the NN weights, both the tracking errors and prediction errors are used to update the NN weights. Therefore, the unknown dynamics can be approximated faster and more accurately.

  • Using the approximation to the unknown dynamics of MSVs, the NDOs are constructed to estimate time-varying disturbances. By combining the dynamic surface control technique with disturbance observers and composite NNs, a trajectory tracking control system is developed. Compared with the control scheme based on neural networks, the proposed control scheme can effectively improve the transient and steady-state performance of MSVs trajectory tracking control.

The rest of this paper is arranged as follows. In Section 2, the mathematical model of MSVs and the problem formulation are introduced. In Section 3, the principle of intelligent approximation using NN is presented. In Section 4, proposes the details of controller design procedures. In Section 5, the simulation results are given to show the effectiveness of the controller. In Section 6, the entire work is summarized.

Problem formulation and preliminaries

MSV kinematic and dynamic models

The mathematical model of underactuated MSVs with 3 degrees of freedom can be described as x ̇ = u cos φ v sin φ y ̇ = u sin φ + v cos φ φ ̇ = r u ̇ = 1 m 11 m 22 v r d 11 u + τ u + Δ f u + d u v ̇ = 1 m 22 m 11 u r d 22 v + Δ f v + d v r ̇ = 1 m 33 m 11 m 22 u v d 33 r + τ r + Δ f r + d r where [xyφ]T denotes the position and heading angle in the inertial reference frame. [uvr]T denotes surge, sway and angular velocity in the body-fixed frame. The mii, i = 1, 2, 3 represent the inertia including added mass. The dii, i = 1, 2, 3 stand for the hydrodynamic damping in surge, sway and yaw. The dj, j = uvr denote unknown environmental disturbances. Δfu, Δfv and Δfr represent unknown dynamics of the MSVs. τu and τr are the control force and moment in the surge and yaw directions.

Assumption 1: The environmental disturbances dj are unknown bounded and there exists d ̇ j d ̄ j , j = uvr, d ̄ j are unknown positive constants.

Remark 1: The ocean disturbances include slowly changing disturbances caused by second-order waves, currents, winds and unknown dynamics, as well as norm-bound disturbances caused by ocean uncertainties. The energy in the marine environment is finite. The rate of change of ocean disturbance is unknown bounded.

Remark 2: Since these parameters of MSVs are affected by operational conditions and marine environment. These factors change frequently, which makes these parameters of MSVs are uncertainties. where mii and dii, i = 1, 2, 3 represent nominal values of the inertia including added mass and the hydrodynamic damping, respectively. Where Δfj, j = uvr represent unknown dynamics includes uncertain parts of the model parameters.

Assumption 2: The desired smooth reference signal xd, yd and its first two time derivatives are bounded.

The position errors and orientation tracking errors will be defined in the body-fixed frame x e = x x d cos φ + y y d sin φ y e = x x d sin φ + y y d cos φ

The time derivative of Eqs. (3a) and (3b) can be expressed as x ̇ e = u + r y e x ̇ d cos φ y ̇ d sin φ y ̇ e = v r x e + x ̇ d sin φ y ̇ d cos φ

In engineering practice, the MSV position, heading, velocities in surge and sway, and yaw rate can be measured by the global positioning system, the gyro compass, the Doppler log, and the rate gyro, respectively. Then, we define the tracking position error ρs and yaw angle error θ as ρ s = ρ e ρ 0 = x e 2 + y e 2 ρ 0 θ = arctan 2 y e , x e By combining Eqs. (3a)(3b) and Eqs. (5a)(5b) we can get x e = ρ e cos θ y e = ρ e sin θ

To avoid the possible singularity of the virtual control law, a positive constant ρ0 is introduced. Considering Assumption 1 and Assumption 2, the control objective is to construct the composite intelligent learning control law τu and τr for MSVs to make sure the ρs and θ can converge to arbitrarily small errors under unknown dynamics, time-varying disturbances and output constraints.

Radial basis function neural network (RBFNN) approximation

In this paper, the RBF NNs are employed for approximation. For an arbitrary continuous function f(ς) over a compact set Ω(ς) → Rn, there exists an RBF NN with the following form: f ς = ω T ψ ς + ξ w , ς Ω ς ψ ς = e x p ς c j T ς c j / b j l 2 , j = 1 , 2 , , l where f(ς) ∈ Rp denotes the output vector of the RBF NN, ς ∈ Rq denotes the input vector of the RBF NN. ψ(ς) is Gaussian basis function. cj is the center of the basis function and bj is the width of the Gaussian function. ξw is the approximation error that satisfies ξ w ξ ̄ , ξ ̄ is an unknown positive constant.

According to Eq.(43), ω is the ideal weight parameter that satisfies ω = arg min ω R sup ς Ω ς f ς ω T ψ ς represent NN weights parameter. However, it is very difficult to determine the ideal weight parameter. ω ˆ is the estimate of the NN weights parameter. However, it is very difficult to determine the ideal weight parameter. The estimate of the NN weights parameter is usually used to approximate the unknown nonlinear term such as f ˆ = ω ˆ T ψ in practice.

Control Law Design

In this section, we can design the control law for the MSVs under Assumption 1–2. The block diagram of the trajectory tracking control system of MSVs is presented in Fig. 1. Combing Eqs. (5a) and (5b) with Eqs. (6a) and (6b), the time derivative of ρs can be written as ρ ̇ s = u cos θ + v sin θ + cos θ ζ 1 + sin θ ζ 2 where ζ1 and ζ2 are defined as follows ζ 1 = x ̇ d cos φ y ̇ d sin φ ζ 2 = x ̇ d sin φ y ̇ d cos φ

Schematic of the MSV closed-loop tracking control.

Figure 1: Schematic of the MSV closed-loop tracking control.

When MSV pass through a narrow passage, it is necessary to limit the position error ρs to prevent vehicle collisions. The BLF can be selected as the following form V 1 = 1 2 log k a 2 k a 2 ρ s 2

where log(∗) is the natural logarithm of (∗), ka is the constraint of ρs, there exist ρ s < k a .

Taking time derivative of Eq. (10) , it can be further written as V ̇ 1 = ρ s ρ ̇ s k a 2 ρ s 2 = ρ s k a 2 ρ s 2 u cos θ + v sin θ + cos θ ζ 1 + sin θ ζ 2 The virtual control law can be designed as α u = sec θ k ρ ρ s v sin θ cos θ ζ 1 sin θ ζ 2 where kρ is a positive constant.

In the surge direction, Let αu pass through a first-order filter with a time constant Tu > 0 to get a new state variable βu. T u β ̇ u + β u = α u , β u 0 = α u 0

Then, the filter error and velocity error can be defined as λu and ue, respectively. So, it can be expressed as λ u = β u α u , u e = u β u

The time derivative of λu can be calculated as λ ̇ u = λ u T u α ̇ u = λ u T u + B u where Bu is a continuous function and has a maximum value Hu.

Then, V2 can be further chosen as V 2 = 1 2 log k a 2 k a 2 ρ s 2 + 1 2 m 11 u e 2

The time derivative of Eq. (16) can be written as V ̇ 2 = ρ s ρ ̇ s k a 2 ρ s 2 + m 11 u e u ̇ e = ρ s k a 2 ρ s 2 u e cos θ λ u cos θ k ρ ρ s + m 11 u e u ̇ e

According to Eqs. (2a) and (12), we can obtain the time derivative of as m 11 u ̇ e = m 22 v r d 11 u + τ u + Δ f u + d u m 11 β ̇ u

The unknown term can be approximate using NN. We have m22vr − d11u + Δfu = ωuTψu + ξu. Here, let Du = ξu + du. The ξu is the approximation error that satisfies the time derivative of ξu is bound. With Assumption 1, we can get D u χ u 0 , D ̇ u χ u where χu0 and χu are unknown positive constants.

Therefore, the time derivative of V2 can be further written as V ̇ 2 = ρ s ρ ̇ s k a 2 ρ s 2 + m 11 u e u ̇ e = ρ s k a 2 ρ s 2 u e cos θ + λ u cos θ k ρ ρ s + u e ω u T ψ u + D u + τ u m 11 β ̇ u

Then, we can design the control law as τ u = ω ˆ u T ψ u D ˆ u + m 11 β ̇ u k u u e ρ s cos θ k a 2 ρ s 2 where ku is a positive constant. ω ˆ u is the estimation of the ω u . D ˆ u is the estimation of the Du. ω ̃ u = ω u ω ˆ u , D ̃ u = D u D ˆ u

From Eq. (21) along Eq. (20), we can get V ̇ 2 = ρ s λ u cos θ k ρ ρ s 2 k a 2 ρ s 2 + u e ω ̃ u T ψ u + u e D ̃ u k u u e 2

Then, we can define zu as prediction error z u = u u ˆ

u ˆ can be defined with SPEM u ˆ ̇ = 1 m 11 τ u + ω ˆ u T ψ u + D ˆ u + ϕ u z u where u ˆ 0 = u 0 , ϕu is a positive constant.

The prediction error is employed to construct the weight updating ω ˆ ̇ u = γ u u e + γ z u z u ψ u ϑ u ω ˆ u where γu , γzu and ϑu are the positive constants to be designed.

The approximation information is employed to construct the NDO in the following form D ˆ u = m 11 u σ u σ ̇ u = ω ˆ u T ψ u + D ˆ u + τ u u e + γ z u z u

According to Eqs. (2a), (27a) and (27b), the derivative of D ˆ u can be expressed as D ˆ ̇ u = ω ̃ u T ψ u + D ̃ u + u e + γ z u z u

Then, the D ̃ ̇ u can be calculated D ̃ ̇ u = D ̇ u ω ̃ u T ψ u D ̃ u u e γ z u z u

Combining Eqs. (5a)(5b) with Eqs. (6a)(6b), the time derivative of θ can be written as θ ̇ = r + 1 ρ e u sin θ + v cos θ sin θ ζ 1 + cos θ ζ 2

It is also necessary to restrict θ in practice, there exist θ < k b . Similar to the above, we select the following BLF candidates as V 3 = 1 2 log k b 2 k b 2 θ 2

Taking time derivative of Eq. (31), it can be further written as V ̇ 3 = θ k b 2 θ 2 r + 1 ρ e u sin θ + v cos θ sin θ ζ 1 + cos θ ζ 2

According to Eq. (32), we can get virtual control law αr for the yaw direction α r = k θ θ + 1 ρ e u sin θ + v cos θ sin θ ζ 1 + cos θ ζ 2 where kθ is a positive constant.

Remark 3: From Eq. (33), it can be seen αr is undefined when ρe = 0. The positive constant ρ0 is designed to make ρe − ρ0 can converge to the neighbor of zero. It means that ρe can converge to the neighbor of ρe. Therefore, the singularity of αr can be avoided.

Let αr pass through a first-order filter with a time constant Tr > 0 to get a new state variable βr. T r β ̇ r + β r = α r , β r 0 = α r 0

Then, the filter error and velocity error can be defined as λr and re, respectively. So, it can be expressed as λ r = β r α r , r e = r β r

The time derivative of λr can be calculated as λ ̇ r = λ r T r α ̇ r = λ r T r + B r where Br is a continuous function and has a maximum value Hr.

Then, V4 can be further chosen as V 4 = 1 2 log k b 2 k b 2 φ e 2 + 1 2 m 33 r e 2

The time derivative of Eq. (37) can be written as V ̇ 4 = θ θ ̇ k b 2 θ 2 + m 33 r e r ̇ e = θ k b 2 θ 2 r e λ r k θ θ + m 33 r e r ̇ e

According to Eqs. (2c) and (35), we can obtain the derivative of re as m 33 r ̇ e = m 11 m 22 u v d 33 r + τ r + Δ f r + d r m 33 β ̇ r

The unknown term can be approximate using NN. We have (m11 − m22)uv − d33r + Δfr = ωrTψr + ξr . we can define Dr = ξr + dr, The ξr is the approximation error that satisfies the time derivative of ξr is bound. With Assumption 1, we can get D r χ r 0 , D ̇ r χ u where χr0 and χr are unknown positive constants.

Then, the time derivative of V4 can be further written as V ̇ 4 = θ k b 2 θ 2 r e λ r k θ θ + r e ω r T ψ r + D r + τ r m 33 β ̇ r

Then, we can get τ r = ω ˆ r T φ r D ˆ r + m 33 β ̇ r k r r e + θ k b 2 θ 2 where kr is a positive constant. ω ˆ r is the estimation of the ω r . D ˆ r is the estimation of the Dr. ω ̃ r = ω r ω ˆ r , D ̃ r = D r D ˆ r

From Eqs. (41) along (40), we can get V ̇ 4 = θ k b 2 θ 2 λ r k θ θ + r e ω ̃ r T ψ r + r e D ̃ r k r r e 2

Then, we can define zr as prediction error z r = r r ˆ

r ˆ can be defined with SPEM r ˆ ̇ = 1 m 33 τ r + ω ˆ r T ψ r + D ˆ r + ϕ r z r where r ˆ 0 = r 0 , ϕr is a positive constant.

The prediction error is employed to construct the weight updating ω ˆ ̇ r = γ r r e + γ z r z r ψ r ϑ r ω ˆ r where γr , γzr and ϑr are the positive constants to be designed.

The approximation information is employed to construct the NDO in the following form D ˆ r = m 33 r σ r σ ̇ r = ω ˆ r T ψ r + D ˆ r + τ r r e + γ z r z r

According to Eqs. (2a), (48a) and (48b), the derivative of D ˆ r can be expressed as D ˆ ̇ r = ω ̃ r T ψ r + D ̃ r + r e + γ z r z r

Then, the D ̃ ̇ r can be calculated D ̃ ̇ r = D ̇ r ω ̃ r T ψ r D ̃ r r e γ z r z r

Remark 4: From Eqs. (26) and (47), it can easily obtain the weight updating of composite NN is designed by employing tracking error and prediction error. The prediction error can provide extra information for learning NN weight updating. Thus, better tracking performance can be achieved.

Remark 5: In Eqs. (26) and (47), γu and γr are positive constants used to optimize the learning rate. The ω ˆ u and ω ˆ r mainly tuned by the prediction errors if and are chosen larger, while if γzu and γzr are chosen smaller, the ω ˆ u and ω ˆ r mainly tuned by the tracking errors.

The compound unknown terms consist of unknown dynamics and time-varying disturbances are expressed as ∑u and ∑r. m 22 v r d 11 u + Δ f u + d u = Σ u m 11 m 22 u v d 33 r + Δ f r + τ w r = Σ r

Remark 6: The disturbance observer and neural network contain each other’s information. If compound unknown terms can be perfect follow by ω ˆ u T ψ u + D ˆ u and ω ˆ r T ψ r + D ˆ r , the system’s estimation of unknown information can be more accurate. As a result, the objective of composite learning combining NN and NDO is accomplished.

Remark 7: Through trial and error, we first choose the appropriate design parameters kρ, kθ, ku, and kr to ensure that the system is stable. Furthermore, we properly regulate the other design parameters γuγzuγrγzrϑu, ϑr, ϕu and ϕr to get the satisfactory control performance. A large number of simulations in many cases show that the larger kρ, kθ,, ku, kr , γzu, γzr, ϕu and ϕr are, the MSVs can obtain higher tracking accuracy.

Theorem 1: Considering the closed-loop system Eqs. (1a)(1c) and Eqs.(2a)(2c) with unknown dynamics, time-varying disturbances and output constraint under Assumption 1–Assumption 2, if virtual control law Eqs. (12), (33), control law Eqs. (21), (42), the NN updating laws Eqs. (26), (47) and NDOs Eqs. (27a)(27b), Eqs. (48a)(48b) are designed. It is guaranteed that all signals include in Eq. (52) are uniformly ultimately bounded (UUB).

Proof: Consider the following Lyapunov function V = V 2 + V 4 + 1 2 1 γ u ω ̃ u T ω ̃ u + D ̃ u 2 + λ u 2 + m 11 γ z u z u 2 + 1 γ r ω ̃ r T ω ̃ r + D ̃ r 2 + λ r 2 + m 33 γ z r z r 2

The time derivative of Eq. (52) can be calculated as V ̇ = V ̇ 2 + V ̇ 4 + 1 γ u ω ̃ u T ω ˆ ̇ u + D ̃ u D ˆ ̇ u + m 11 γ z u z u z ̇ u + λ u λ ̇ u + λ r λ ̇ r + 1 γ r ω ̃ r T ω ˆ ̇ r + D ̃ r D ˆ ̇ r + m 33 γ z r z r z ̇ r

In the view of Eqs. (15), (36) and Young’s inequality, we can get λ u λ ̇ u λ u 2 T u + 1 2 ι λ u 2 + 2 ι H u 2 λ r λ ̇ r λ r 2 T r + 1 2 ι λ r 2 + 2 ι H r 2

Using Eqs. (26) and (47), we have 1 γ u ω ̃ u T ω ˆ ̇ u = ω ̃ u T u e + γ z u z u ψ u ϑ u ω ˆ u 1 γ r ω ̃ r T ω ˆ ̇ r = ω ̃ r T r e + γ z r z r ψ r ϑ r ω ˆ r

From Eqs. (29) and (50), we have D ̃ u D ̃ ̇ u = D ̃ u D ̇ u ω ̃ u T ψ u D ̃ u u e γ z u z u D ̃ r D ̃ ̇ r = D ̃ r D ̇ r ω ̃ r T ψ r D ̃ r r e γ z r z r

Combining Eqs. (2a)(2c), (24), (25), (45) with Eq. (46), we can get m 11 γ z u z u z ̇ u = γ z u z u ω ̃ u T ψ u + D ̃ u ϕ u z u m 33 γ z r z r z ̇ r = γ z r z r ω ̃ r T ψ r + D ̃ r ϕ r z r

Combining Eqs. (23), (44), (52)(58) and Young’s inequality, Eq. (53) can be expressed as V ̇ k ρ A 2 ρ s 2 k a 2 ρ s 2 k u u e 2 γ z u ϕ u z u 2 D ̃ u 2 1 T u 1 2 A k a 2 ρ s 2 1 2 ι λ u 2 + 2 ι H u 2 + ω ̃ u T ϑ u ω ˆ u + D ̃ u D ̇ u D ̃ u ω ̃ u T ψ u k θ 1 2 θ 2 k b 2 θ 2 k r r e 2 D ̃ r 2 1 T r 1 2 k b 2 θ 2 λ r 2 γ z r ϕ r z r 2 + 1 2 ι λ r 2 + 2 ι H r 2 + ω ̃ r T ϑ r ω ˆ r + D ̃ r D ̇ r D ̃ r ω ̃ r T ψ r

According to Young’s inequality, we can obtain D ̃ g ω ̃ g T ψ g 1 2 ζ g D ̃ g 2 ϖ g 2 + 1 2 ζ g ω ̃ g T ω ̃ g D ̃ g D ̇ g 1 2 D ̃ g 2 + 1 2 χ g 2 ω ̃ g T ω ˆ g 1 2 ω ̃ g T ω ̃ g + 1 2 ω g 2 where ζg is positive user-defined parameter, ∥ψg ∥  ≤ ϖg, D ̇ g χ g , g = ur.χg and ∥ωg ∥ are positive constants.

From Eqs. (63)(65), Eq. (62) can be expressed as

V ̇ k ρ A 2 ρ s 2 k a 2 ρ s 2 k u u e 2 1 2 ϑ u 1 2 μ u ω ̃ u T ω u 1 T u 1 2 A k a 2 ρ s 2 1 2 ι λ u 2 1 2 1 2 μ u ϖ u 2 D ̃ u 2 γ z u ϕ u z u 2 k θ 1 2 θ 2 k b 2 θ 2 k r r e 2 γ z r ϕ r z r 2 1 T r 1 2 k b 2 θ 2 1 2 ι λ r 2 1 2 ϑ r 1 2 μ r ω ̃ r T ω r 1 2 1 2 μ r ϖ r 2 D ̃ r 2 + 2 ι H u 2 + 1 2 ϑ u ω u 2 + 1 2 χ u 2 + 2 ι H r 2 + 1 2 ϑ r ω r 2 + 1 2 χ r 2 2 a V + b where a = min k ρ A 2 , k u , 1 T u 1 2 A k a 2 ρ s 2 1 2 ι , γ z u ϕ u , 1 2 ϑ u 1 2 μ u , 1 2 1 2 μ u ϖ u 2 , k θ 1 2 , k r , 1 T r 1 2 k b 2 θ 2 1 2 ι , 1 2 ϑ r 1 2 μ r , 1 2 1 2 μ r ϖ r 2 , γ z r ϕ r , b = 2 ι H u 2 + 1 2 ϑ u ω u 2 + 1 2 χ u 2 + 2 ι H r 2 + 1 2 ϑ r ω r 2 + 1 2 χ r 2 .

By choosing the appropriate design parameters to make k ρ > A 2 , k u > 0 , 1 T u 1 2 A k a 2 ρ s 2 1 2 ι > 0 , γ z u ϕ u > 0 , 1 2 ϑ u 1 2 μ u > 0 , 1 2 1 2 μ u ϖ u 2 > 0 , k θ > 1 2 , k r > 0 , 1 T r 1 2 k b 2 θ 2 1 2 ι > 0 , 1 2 ϑ r 1 2 μ r > 0 , 1 2 1 2 μ r ϖ r 2 > 0 , γ z r ϕ r > 0 .

By solving Eq. (66), we have 0 V b 2 a + [ V (0) b 2 a ] e 2 a t

From Eq. (67), we can obtain that V b 2 a as t → ∞. All signals in the Lyapunov function Eq. (52) are UUB. This concludes the proof.

Simulation Results

In this section, to demonstrate the effectiveness of the proposed control system, the dynamic model of an MSV in Do, Jiang & Pan (2004) is considered.

The model parameters of the MSV are presented as follows: m11 =120 × 103 kgm22 = 177.9 × 103 kgm33 = 636 × 105 kg m2. d11 =215 × 102 kg/sd22 = 147 × 103 kg/sd33 = 802 × 104 kg/m2s.

The proposed control scheme is marked as τCL. The control strategy without considering the prediction error is denoted as τNN.

Case 1: The reference trajectory is selected as xd = 200sin(0.02t), yd = 200cos(0.02t).

Unknown dynamics are selected as [Δfu, Δfv, Δfr]T = 0 . 2 d 11 u u , 0 . 2 d 22 v v , 0 . 2 r r T . The external disturbances are given as [dudvdr]T = [104sin(0.3t − π/4) + 104cos(0.2t + π/4) + 2 × 104, 103sin(0.2t − π/4)) + 103cos(0.3t − π/4) + 3 × 103, 105sin(0.2t + π/6) + 105cos(0.5t − π/4) (−3 × 105]T.

The initial condition is chosen as [x(0), y(0), φ(0), u(0), v(0), r(0)] =[20, 190,  − 0.02π,0,0,0]. The control laws design parameters are designed as ρ0 = 10, kρ = 0.4, ku = 6 × 103, kr = 3.18 × 106, Tu = 0.8, Tr = 0.3, γu = 10000, γr = 100, γzu = 20, γzr = 3000, ϑu = 0.00001, ϑr = 0.0001, ϕu = 10, ϕr = 1.

Figures 2A2F illustrate the simulation results for the MSV under the two control strategies. Fig. 2A clearly illustrates that the MSV can track the reference trajectory in the presence of unknown dynamics, time-varying disturbances and output constraint under two control methods. The result in Fig. 2B shows that MSV can accomplish faster and more precise tracking under τCL. The results of approximation of unknown information in Figs. 2C and 2D further support this conclusion. The estimates of 2-norms weights are more sensitive under as illustrated in Fig. 2E. The control inputs τu and τr are plotted in Fig. 2F.

Simulation results under τNN and τCL for case 1.

Figure 2: Simulation results under τNN and τCL for case 1.

(A) Reference and actual trajectories of the MSV. (B) Tracking position error and yaw angle error. (C) ∑u and its estimation. (D) ∑r and its estimation. (E) 2-norms ω ˆ u , ω ˆ r of parameter estimates ω ˆ u and ω ˆ r . (F) Control signals τu and τr.

Case 2: The MSV’s unknown dynamics are raised 1.2 × Δfn. The control law’s initial conditions and design parameters are the same as in Case 1, and the larger time-varying disturbances can be chosen as [dudvdr]T = [1.5 × 104sin(0.3t − π/4) + 1.5 × 104cos(0.2t + π/4) + 3 × 104, 1.5 × 103sin(0.2t − π/4) + 1.5 × 103cos(0.3t − π/4) + 3 × 103, 1.5 × 105sin(0.2t + π/6) + 1.5 × 105cos(0.5t − π/4) − 4.5 × 105]T.

Under two control systems, MSV can track a reference trajectory in the presence of unknown dynamics, time-varying disturbances and output constraint as shown in Fig. 3A. As demonstrated in Fig. 3B, MSV can obtain higher tracking performance under τCL. The proposed control scheme has better robustness performance. As shown in Figs. 3C3D, a similar result can be illustrated in case 1. The estimates of 2-norms weights are more sensitive under as illustrated in Fig. 3E. The control inputs are presented in Fig. 3F.

Simulation results under τNN and τCL for case 2.

Figure 3: Simulation results under τNN and τCL for case 2.

(A) Reference and actual trajectories of the MSV. (B) Tracking position error and yaw angle error. (C) ∑u and its estimation. (D) ∑r and its estimation. (E) 2-norms ω ˆ u , ω ˆ r of parameter estimates ω ˆ u and ω ˆ r . (F) Control signals τu and τr.

Case 3: The initial conditions and design parameters of the control law are the same as those in case 1. To further verify the superiority and effectiveness of the control scheme, another form of environmental disturbance are given as [dudvdr]T = d + h. where d is d = [104sin(0.3t − π/4) + 104cos(0.2t + π/4) + 2 × 104, 103sin(0.2t − π/4) + 103cos(0.3t − π/4) + 3 × 103, 105sin(0.2t + π/6) + 105cos(0.5t − π/4) − 3 × 105]T. h is selected by the first-order Markov process. h ̇ = −Λh + Γ℘, where ℘ ∈ R3 is the zero-mean Gaussian white noise.

The simulation results are depicted in Figs. 4A4F. Under two control systems, MSV can track a reference trajectory under unknown dynamics, time-varying disturbances and output constraint as shown in Fig. 4A. As demonstrated in Fig. 4B, MSV can achieve better tracking performance under τCL. As shown in Figs. 4C4D, a similar result can be verified. The estimates of 2-norms weights are more sensitive under as shown in Fig. 4E. The control inputs are presented in Fig. 4F.

Simulation results under τNN and τCL for case 3.

Figure 4: Simulation results under τNN and τCL for case 3.

(A) Reference and actual trajectories of the MSV. (B) Tracking position error and yaw angle error. (C) ∑u and its estimation. (D) ∑r and its estimation. (E) 2-norms ω ˆ u , ω ˆ r of parameter estimates ω ˆ u and ω ˆ r . (F) Control signals τu and τr.

Conclusions

In this paper, a composite learning trajectory tracking control scheme is proposed for underactuated MSVs in the presence of unknown dynamics, time-varying disturbances and output constraints. The underactuation problem of the MSVs is addressed by the LOS approach. The barrier Lyapunov function is introduced to deal with the problem of output constraint. The composite learning control scheme is utilized to approximate unknown dynamics. The prediction errors and the tracking errors are adopted to construct the NN weight updating. Using approximation information, the disturbance observers are designed to estimates unknown time-varying disturbances. The Lyapunov method is used to demonstrate the stability of a closed-loop system. The simulation results demonstrate the effectiveness and superiority of the proposed control scheme.

Furthermore, the finite-time control can be further considered. The control scheme in this paper can be easily combined with event-triggered control.

Supplemental Information

Raw data of the simulation results

DOI: 10.7717/peerj-cs.863/supp-1
  Visitors   Views   Downloads