Composite learning tracking control for underactuated marine surface vessels with output constraints

Huaran Yan; Yingjie Xiao; Honghang Zhang

doi:10.7717/peerj-cs.863

Composite learning tracking control for underactuated marine surface vessels with output constraints

Huaran Yan ¹, Yingjie Xiao¹, Honghang Zhang²

1Merchant Marine College, Shanghai Maritime University, Shanghai, China

2Maritime College, Zhejiang Ocean University, Zhoushan, China

DOI: 10.7717/peerj-cs.863

Published: 2022-02-03
Accepted: 2022-01-03
Received: 2021-11-05

Academic Editor: Qichun Zhang

Subject Areas: Adaptive and Self-Organizing Systems, Autonomous Systems
Keywords: Disturbance observer, Trajectory tracking, Line-of-sight, Output constraints, Composite learning

Copyright: © 2022 Yan et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

Cite this article: Yan H, Xiao Y, Zhang H. 2022. Composite learning tracking control for underactuated marine surface vessels with output constraints. PeerJ Computer Science 8:e863 https://doi.org/10.7717/peerj-cs.863

The authors have chosen to make the review history of this article public.

Abstract

In this paper, a composite learning control scheme was proposed for underactuated marine surface vessels (MSVs) subject to unknown dynamics, time-varying external disturbances and output constraints. Based on the line-of-sight (LOS) approach, the underactuation problem of the MSVs was addressed. To deal with the problem of output constraint, the barrier Lyapunov function-based method was utilized to ensure that the output error will never violate the constraint. The composite neural networks (NNs) are employed to approximate unknown dynamics. The prediction errors can be obtained using the serial-parallel estimation model (SPEM). Both the prediction errors and the tracking errors were employed to construct the NN weight updating. Using approximation information, the disturbance observers were designed to estimate unknown time-varying disturbances. The stability analysis via the Lyapunov approach indicates that all signals of unmanned marine surface vessels are uniformly ultimate boundedness. The simulation results verify the effectiveness of the proposed control scheme.

Introduction

In recent years, with the development of the marine economy, marine transport vehicles have gained much attention (Shen et al., 2020; Yu, Guo & Yan, 2019). Marine surface vehicles (MSVs) have been widely used in marine exploration, marine transportation, marine survey and other fields (Liu et al., 2016; Shao et al., 2019). To accomplish these tasks, the trajectory tracking control of MSVs plays a significant role. Due to the influence of the external environment, the kinetics of MSVs inevitably have unknown dynamics and unknown time-varying environmental disturbances.

In view of this, a series of control approaches have been utilized for control of MSVs, including neural network (NN) control (Zhu et al., 2021; Li et al., 2015), fuzzy logic system (FLS) control (Peng, Wang & Wang, 2018; Wang, Sun & Er, 2018), disturbance observer-based (DOB) control (Guo & Zhang, 2020; Hu et al., 0000), and the finite-time control (Zhu, Ma & Hu, 2020; Wang, Pan & Su, 2019; Wang & Deng, 2020). In Zhu et al. (2021), Li et al. (2015), Peng, Wang & Wang (2018), Wang, Sun & Er (2018), NNs and FLSs are used to approximate the uncertain terms, such as unmodeled dynamics, unknown dynamics. In Guo & Zhang (2020), Hu et al. (0000), a DOB control approach was adopted to compensate compound uncertainty of parameter perturbations and unknown disturbances. In Do (2016) and Ghommam & Saad (2018), the dynamic uncertainties of MSVs were dealt with by parameter adaptive technique and a backstepping design tool.

To address the underactuation problem of MSVs, several control methods are introduced, such as additional control method (Do, 2010; Park, Kwon & Kim, 2017; Chen et al., 2020), output redefinition control (Shojaei & Arefi, 2015; Shojaei, 2017), line-of-sight (LOS) (Shojaei, 2015; Gao et al., 2016; Jia, Hu & Zhang, 2019; Liu, 2019), etc. Three additional control terms were adopted to address the underactuation problem of MSV in Do (2010), Park, Kwon & Kim (2017), Chen et al. (2020). To achieve the design of trajectory tracking control laws, the output redefinition control approach in Shojaei & Arefi (2015) and Shojaei (2017) was introduced to handle the underactuation problem, the combination of adaptive technique, NNs and saturation function to solve the unknown disturbances, unknown dynamic and input saturation, respectively. In Shojaei (2015), Gao et al. (2016), Jia, Hu & Zhang (2019) and Liu (2019), the LOS method was utilized to solve the underactuation problem of MSVs, the combination of parameter adaptive technology and NN approximation are used to successfully solve the time-varying external disturbance and parameter uncertainty.

For the sake of navigation safety, the output constraint problem is inevitably in practice. In practice, the navigable water areas are restricted, and then surface vessels should navigate in the navigable water areas. When the position error is too large, it may lead to collision accident of MSVs. When the yaw angle errors become excessive, the actuator will be damaged due to overload. Therefore, it is necessary to further study the MSVs trajectory tracking system with output constraints. Several methods have been presented to solve the output constraint problem, such as moving-horizon optimal control (Mayne & Michalska, 1990), artificial potential field (Sun & Ge, 2014), barrier Lyapunov function (BLF) (Tee et al., 2011) and output error transformation method (Zheng et al., 2020; Zhu, Du & Kao, 2020). In Zheng et al. (2020) and Zhu, Du & Kao (2020), the output constraint problem is transformed into a tracking error constraint problem by using the coordinate transformation. Coordinate transformation ensures that the tracking error always stays within predefined boundaries. Duo to the structure of Lyapunov function can be constructed by a barrier function, the BLF-based approach can solve the problem of trajectory tracking control for MSVs under the output constraint (Zhu, Du & Kao, 2020; Zhao, He & Ge, 2014). In simultaneous consideration of unknown dynamics and time-varying disturbances, Zhu, Du & Kao (2020) use a log-BLF method to solve the constant symmetric output constraint, Zhao, He & Ge (2014) utilize the asymmetric BLF method to deal with the asymmetric output constraints.

All the literature mentioned before have concentrated on the tracking and stability of the system. Most literature have not mentioned the precision accuracy of identifying models. In practice, the model uncertainty should be approximated as precisely as possible. In generally, the unknown dynamics of the system can be compensated by using adaptive control technique. In order to achieve better control performance, composite adaptive control scheme is developed in Patre & Bhasin (2010). It makes the system realize faster parameter convergence as well as smaller tracking error, and has been applied in various fields (Sun, Pan & Yang, 2017; Pan, Sun & Yu, 2016). By approximating the unknown dynamic items faster and more accurately to obtain better control performance, the prediction errors can be constructed by the serial-parallel estimation model (SPEM) (Peng, Wang & Wang, 2017). Then, the updating law of the neural network is designed by using the prediction error, which improves the transient performance effectively. To update the laws and optimize the system’s transient performance, Yucelen & Haddad (2013) presented an adaptive control modification. An error feedback term was included in the reference model in Pan, Sun & Yu (2016) and Stepanyan & Krishnakumar (2010) to improve the transient performance of the model. In Xu & Sun (2018), both the prediction errors and the tracking errors were applied to construct the updating law of NNs weights. The index of learning performance is introduced in the update rate, some literature focus on constructing composite learning laws by introducing auxiliary filter (Na et al., 2015; Huang et al., 2018) or using time interval data (Xu et al., 2019; Xu et al., 2018).

In this paper, we propose a composite learning control strategy for underactuated MSVs subject to unknown dynamics, ocean environmental disturbances, and output constraints based on the discussion above. The main contributions can be summarized as follows.

Position error and yaw angle error constraints are addressed by employing the BLF-based method. The dynamic surface control approach is used to decrease the computation of the explosion problem that exists in the backstepping method.
The composite NNs are employed to approximate the unknown dynamics of MSVs. Different from the traditional NN in which only the tracking errors are used to update the NN weights, both the tracking errors and prediction errors are used to update the NN weights. Therefore, the unknown dynamics can be approximated faster and more accurately.
Using the approximation to the unknown dynamics of MSVs, the NDOs are constructed to estimate time-varying disturbances. By combining the dynamic surface control technique with disturbance observers and composite NNs, a trajectory tracking control system is developed. Compared with the control scheme based on neural networks, the proposed control scheme can effectively improve the transient and steady-state performance of MSVs trajectory tracking control.

The rest of this paper is arranged as follows. In Section 2, the mathematical model of MSVs and the problem formulation are introduced. In Section 3, the principle of intelligent approximation using NN is presented. In Section 4, proposes the details of controller design procedures. In Section 5, the simulation results are given to show the effectiveness of the controller. In Section 6, the entire work is summarized.

Problem formulation and preliminaries

MSV kinematic and dynamic models

The mathematical model of underactuated MSVs with 3 degrees of freedom can be described as (1a) $\dot{x} = u cos φ - v sin φ$ (1b) $\dot{y} = u sin φ + v cos φ$ (1c) $\dot{φ} = r$ (2a) $\dot{u} = \frac{1}{m_{11}} (m_{22} v r - d_{11} u + τ_{u} + Δ f_{u} + d_{u})$ (2b) $\dot{v} = \frac{1}{m_{22}} (- m_{11} u r - d_{22} v + Δ f_{v} + d_{v})$ (2c) $\dot{r} = \frac{1}{m_{33}} [(m_{11} - m_{22}) u v - d_{33} r + τ_{r} + Δ f_{r} + d_{r}]$ where [x, y, φ]^T denotes the position and heading angle in the inertial reference frame. [u, v, r]^T denotes surge, sway and angular velocity in the body-fixed frame. The m_ii, i = 1, 2, 3 represent the inertia including added mass. The d_ii, i = 1, 2, 3 stand for the hydrodynamic damping in surge, sway and yaw. The d_j, j = u, v, r denote unknown environmental disturbances. Δf_u, Δf_v and Δf_r represent unknown dynamics of the MSVs. τ_u and τ_r are the control force and moment in the surge and yaw directions.

Assumption 1: The environmental disturbances d_j are unknown bounded and there exists $|{\dot{d}}_{j}| \leq {\bar{d}}_{j}$ , j = u, v, r, ${\bar{d}}_{j}$ are unknown positive constants.

Remark 1: The ocean disturbances include slowly changing disturbances caused by second-order waves, currents, winds and unknown dynamics, as well as norm-bound disturbances caused by ocean uncertainties. The energy in the marine environment is finite. The rate of change of ocean disturbance is unknown bounded.

Remark 2: Since these parameters of MSVs are affected by operational conditions and marine environment. These factors change frequently, which makes these parameters of MSVs are uncertainties. where m_ii and d_ii, i = 1, 2, 3 represent nominal values of the inertia including added mass and the hydrodynamic damping, respectively. Where Δf_j, j = u, v, r represent unknown dynamics includes uncertain parts of the model parameters.

Assumption 2: The desired smooth reference signal x_d, y_d and its first two time derivatives are bounded.

The position errors and orientation tracking errors will be defined in the body-fixed frame (3a) $x_{e} = (x - x_{d}) cos φ + (y - y_{d}) sin φ$ (3b) $y_{e} = - (x - x_{d}) sin φ + (y - y_{d}) cos φ$

The time derivative of Eqs. (3a) and (3b) can be expressed as (4a) ${\dot{x}}_{e} = u + r y_{e} - {\dot{x}}_{d} cos φ - {\dot{y}}_{d} sin φ$ (4b) ${\dot{y}}_{e} = v - r x_{e} + {\dot{x}}_{d} sin φ - {\dot{y}}_{d} cos φ$

In engineering practice, the MSV position, heading, velocities in surge and sway, and yaw rate can be measured by the global positioning system, the gyro compass, the Doppler log, and the rate gyro, respectively. Then, we define the tracking position error ρ_s and yaw angle error θ as (5a) $ρ_{s} = ρ_{e} - ρ_{0} = \sqrt{x_{e}^{2} + y_{e}^{2}} - ρ_{0}$ (5b) $θ = arctan 2 (y_{e}, x_{e})$ By combining Eqs. (3a)–(3b) and Eqs. (5a)–(5b) we can get (6a) $x_{e} = ρ_{e} cos θ$ (6b) $y_{e} = ρ_{e} sin θ$

To avoid the possible singularity of the virtual control law, a positive constant ρ₀ is introduced. Considering Assumption 1 and Assumption 2, the control objective is to construct the composite intelligent learning control law τ_u and τ_r for MSVs to make sure the ρ_s and θ can converge to arbitrarily small errors under unknown dynamics, time-varying disturbances and output constraints.

Radial basis function neural network (RBFNN) approximation

In this paper, the RBF NNs are employed for approximation. For an arbitrary continuous function f(ς) over a compact set Ω(ς) → Rⁿ, there exists an RBF NN with the following form: (7a) $f (ς) = ω^{T} ψ (ς) + ξ_{w}, \forall ς \in Ω (ς)$ (7b) $ψ (ς) = e x p (- {(ς - c_{j})}^{T} (ς - c_{j}) / b_{j l}^{2}), j = 1, 2, \dots, l$ where f(ς) ∈ R^p denotes the output vector of the RBF NN, ς ∈ R^q denotes the input vector of the RBF NN. ψ(ς) is Gaussian basis function. c_j is the center of the basis function and b_j is the width of the Gaussian function. ξ_w is the approximation error that satisfies $|ξ_{w}| \leq \bar{ξ}$ , $\bar{ξ}$ is an unknown positive constant.

According to Eq.(43), ω is the ideal weight parameter that satisfies $ω = arg {min}_{ω \in R^{ℓ}} \{{sup}_{ς \in Ω (ς)} |f (ς) - ω^{T} ψ (ς)|\}$ represent NN weights parameter. However, it is very difficult to determine the ideal weight parameter. $\hat{ω}$ is the estimate of the NN weights parameter. However, it is very difficult to determine the ideal weight parameter. The estimate of the NN weights parameter is usually used to approximate the unknown nonlinear term such as $\hat{f} = {\hat{ω}}^{T} ψ$ in practice.

Control Law Design

In this section, we can design the control law for the MSVs under Assumption 1–2. The block diagram of the trajectory tracking control system of MSVs is presented in Fig. 1. Combing Eqs. (5a) and (5b) with Eqs. (6a) and (6b), the time derivative of ρ_s can be written as (8) ${\dot{ρ}}_{s} = u cos θ + v sin θ + cos θ ζ_{1} + sin θ ζ_{2}$ where ζ₁ and ζ₂ are defined as follows (9a) $ζ_{1} = - {\dot{x}}_{d} cos φ - {\dot{y}}_{d} sin φ$ (9b) $ζ_{2} = {\dot{x}}_{d} sin φ - {\dot{y}}_{d} cos φ$

Figure 1: Schematic of the MSV closed-loop tracking control.

Download full-size image

DOI: 10.7717/peerjcs.863/fig-1

When MSV pass through a narrow passage, it is necessary to limit the position error ρ_s to prevent vehicle collisions. The BLF can be selected as the following form (10) $V_{1} = \frac{1}{2} log \frac{k_{a}^{2}}{k_{a}^{2} - ρ_{s}^{2}}$

where log(∗) is the natural logarithm of (∗), k_a is the constraint of ρ_s, there exist $|ρ_{s}| < k_{a}$ .

Taking time derivative of Eq. (10) , it can be further written as ${\dot{V}}_{1} = \frac{ρ_{s} {\dot{ρ}}_{s}}{k_{a}^{2} - ρ_{s}^{2}}$ (11) $= \frac{ρ_{s}}{k_{a}^{2} - ρ_{s}^{2}} (u cos θ + v sin θ + cos θ ζ_{1} + sin θ ζ_{2})$ The virtual control law can be designed as (12) $α_{u} = sec θ (- k_{ρ} ρ_{s} - v sin θ - cos θ ζ_{1} - sin θ ζ_{2})$ where k_ρ is a positive constant.

In the surge direction, Let α_u pass through a first-order filter with a time constant T_u > 0 to get a new state variable β_u. (13) $T_{u} {\dot{β}}_{u} + β_{u} = α_{u}, β_{u} (0) = α_{u} (0)$

Then, the filter error and velocity error can be defined as λ_u and u_e, respectively. So, it can be expressed as (14) $λ_{u} = β_{u} - α_{u}, u_{e} = u - β_{u}$

The time derivative of λ_u can be calculated as ${\dot{λ}}_{u} = - \frac{λ_{u}}{T_{u}} - {\dot{α}}_{u}$ (15) $= - \frac{λ_{u}}{T_{u}} + B_{u}$ where B_u is a continuous function and has a maximum value H_u.

Then, V₂ can be further chosen as (16) $V_{2} = \frac{1}{2} log \frac{k_{a}^{2}}{k_{a}^{2} - ρ_{s}^{2}} + \frac{1}{2} m_{11} u_{e}^{2}$

The time derivative of Eq. (16) can be written as ${\dot{V}}_{2} = \frac{ρ_{s} {\dot{ρ}}_{s}}{k_{a}^{2} - ρ_{s}^{2}} + m_{11} u_{e} {\dot{u}}_{e}$ (17) $= \frac{ρ_{s}}{k_{a}^{2} - ρ_{s}^{2}} (- u_{e} cos θ - λ_{u} cos θ - k_{ρ} ρ_{s}) + m_{11} u_{e} {\dot{u}}_{e}$

According to Eqs. (2a) and (12), we can obtain the time derivative of as (18) $m_{11} {\dot{u}}_{e} = m_{22} v r - d_{11} u + τ_{u} + Δ f_{u} + d_{u} - m_{11} {\dot{β}}_{u}$

The unknown term can be approximate using NN. We have m₂₂vr − d₁₁u + Δf_u = ω_u^Tψ_u + ξ_u. Here, let D_u = ξ_u + d_u. The ξ_u is the approximation error that satisfies the time derivative of ξ_u is bound. With Assumption 1, we can get (19) $|D_{u}| \leq χ_{u 0}, |{\dot{D}}_{u}| \leq χ_{u}$ where χ_u0 and χ_u are unknown positive constants.

Therefore, the time derivative of V₂ can be further written as ${\dot{V}}_{2} = \frac{ρ_{s} {\dot{ρ}}_{s}}{k_{a}^{2} - ρ_{s}^{2}} + m_{11} u_{e} {\dot{u}}_{e}$ (20) $= \frac{ρ_{s}}{k_{a}^{2} - ρ_{s}^{2}} (u_{e} cos θ + λ_{u} cos θ - k_{ρ} ρ_{s}) + u_{e} ({ω_{u}}^{T} ψ_{u} + D_{u} + τ_{u} - m_{11} {\dot{β}}_{u})$

Then, we can design the control law as (21) $τ_{u} = - {\hat{ω}}_{u}^{T} ψ_{u} - {\hat{D}}_{u} + m_{11} {\dot{β}}_{u} - k_{u} u_{e} - \frac{ρ_{s} cos θ}{(k_{a}^{2} - ρ_{s}^{2})}$ where k_u is a positive constant. ${\hat{ω}}_{u}$ is the estimation of the $ω_{u} . {\hat{D}}_{u}$ is the estimation of the D_u. (22) ${\tilde{ω}}_{u} = ω_{u} - {\hat{ω}}_{u}, {\tilde{D}}_{u} = D_{u} - {\hat{D}}_{u}$

From Eq. (21) along Eq. (20), we can get (23) ${\dot{V}}_{2} = \frac{ρ_{s} λ_{u} cos θ - k_{ρ} {ρ_{s}}^{2}}{k_{a}^{2} - ρ_{s}^{2}} + u_{e} {\tilde{ω}}_{u}^{T} ψ_{u} + u_{e} {\tilde{D}}_{u} - k_{u} {u_{e}}^{2}$

Then, we can define z_u as prediction error (24) $z_{u} = u - \hat{u}$

$\hat{u}$ can be defined with SPEM (25) $\dot{\hat{u}} = \frac{1}{m_{11}} (τ_{u} + {\hat{ω}}_{u}^{T} ψ_{u} + {\hat{D}}_{u} + ϕ_{u} z_{u})$ where $\hat{u} (0) = u (0)$ , ϕ_u is a positive constant.

The prediction error is employed to construct the weight updating (26) ${\dot{\hat{ω}}}_{u} = γ_{u} [(u_{e} + γ_{z u} z_{u}) ψ_{u} - ϑ_{u} {\hat{ω}}_{u}]$ where γ_u , γ_zu and ϑ_u are the positive constants to be designed.

The approximation information is employed to construct the NDO in the following form (27a) ${\hat{D}}_{u} = m_{11} u - σ_{u}$ (27b) ${\dot{σ}}_{u} = {\hat{ω}}_{u}^{T} ψ_{u} + {\hat{D}}_{u} + τ_{u} - (u_{e} + γ_{z u} z_{u})$

According to Eqs. (2a), (27a) and (27b), the derivative of ${\hat{D}}_{u}$ can be expressed as (28) ${\dot{\hat{D}}}_{u} = {\tilde{ω}}_{u}^{T} ψ_{u} + {\tilde{D}}_{u} + u_{e} + γ_{z u} z_{u}$

Then, the ${\dot{\tilde{D}}}_{u}$ can be calculated (29) ${\dot{\tilde{D}}}_{u} = {\dot{D}}_{u} - {\tilde{ω}}_{u}^{T} ψ_{u} - {\tilde{D}}_{u} - u_{e} - γ_{z u} z_{u}$

Combining Eqs. (5a)–(5b) with Eqs. (6a)–(6b), the time derivative of θ can be written as (30) $\dot{θ} = - r + \frac{1}{ρ_{e}} (- u sin θ + v cos θ - sin θ ζ_{1} + cos θ ζ_{2})$

It is also necessary to restrict θ in practice, there exist $|θ| < k_{b}$ . Similar to the above, we select the following BLF candidates as (31) $V_{3} = \frac{1}{2} log \frac{k_{b}^{2}}{k_{b}^{2} - θ^{2}}$

Taking time derivative of Eq. (31), it can be further written as (32) ${\dot{V}}_{3} = \frac{θ}{k_{b}^{2} - θ^{2}} (- r + \frac{1}{ρ_{e}} (- u sin θ + v cos θ - sin θ ζ_{1} + cos θ ζ_{2}))$

According to Eq. (32), we can get virtual control law α_r for the yaw direction (33) $α_{r} = k_{θ} θ + \frac{1}{ρ_{e}} (- u sin θ + v cos θ - sin θ ζ_{1} + cos θ ζ_{2})$ where k_θ is a positive constant.

Remark 3: From Eq. (33), it can be seen α_r is undefined when ρ_e = 0. The positive constant ρ₀ is designed to make ρ_e − ρ₀ can converge to the neighbor of zero. It means that ρ_e can converge to the neighbor of ρ_e. Therefore, the singularity of α_r can be avoided.

Let α_r pass through a first-order filter with a time constant T_r > 0 to get a new state variable β_r. (34) $T_{r} {\dot{β}}_{r} + β_{r} = α_{r}, β_{r} (0) = α_{r} (0)$

Then, the filter error and velocity error can be defined as λ_r and r_e, respectively. So, it can be expressed as (35) $λ_{r} = β_{r} - α_{r}, r_{e} = r - β_{r}$

The time derivative of λ_r can be calculated as ${\dot{λ}}_{r} = - \frac{λ_{r}}{T_{r}} - {\dot{α}}_{r}$ (36) $= - \frac{λ_{r}}{T_{r}} + B_{r}$ where B_r is a continuous function and has a maximum value H_r.

Then, V₄ can be further chosen as (37) $V_{4} = \frac{1}{2} log \frac{k_{b}^{2}}{k_{b}^{2} - φ_{e}^{2}} + \frac{1}{2} m_{33} r_{e}^{2}$

The time derivative of Eq. (37) can be written as ${\dot{V}}_{4} = \frac{θ \dot{θ}}{k_{b}^{2} - θ^{2}} + m_{33} r_{e} {\dot{r}}_{e}$ (38) $= \frac{θ}{k_{b}^{2} - θ^{2}} (- r_{e} - λ_{r} - k_{θ} θ) + m_{33} r_{e} {\dot{r}}_{e}$

According to Eqs. (2c) and (35), we can obtain the derivative of r_e as (39) $m_{33} {\dot{r}}_{e} = (m_{11} - m_{22}) u v - d_{33} r + τ_{r} + Δ f_{r} + d_{r} - m_{33} {\dot{β}}_{r}$

The unknown term can be approximate using NN. We have (m₁₁ − m₂₂)uv − d₃₃r + Δf_r = ω_r^Tψ_r + ξ_r . we can define D_r = ξ_r + d_r, The ξ_r is the approximation error that satisfies the time derivative of ξ_r is bound. With Assumption 1, we can get (40) $|D_{r}| \leq χ_{r 0}, |{\dot{D}}_{r}| \leq χ_{u}$ where χ_r0 and χ_r are unknown positive constants.

Then, the time derivative of V₄ can be further written as (41) ${\dot{V}}_{4} = \frac{θ}{k_{b}^{2} - θ^{2}} (- r_{e} - λ_{r} - k_{θ} θ) + r_{e} ({ω_{r}}^{T} ψ_{r} + D_{r} + τ_{r} - m_{33} {\dot{β}}_{r})$

Then, we can get (42) $τ_{r} = - {\hat{ω}}_{r}^{T} φ_{r} - {\hat{D}}_{r} + m_{33} {\dot{β}}_{r} - k_{r} r_{e} + \frac{θ}{k_{b}^{2} - θ^{2}}$ where k_r is a positive constant. ${\hat{ω}}_{r}$ is the estimation of the $ω_{r} . {\hat{D}}_{r}$ is the estimation of the D_r. (43) ${\tilde{ω}}_{r} = ω_{r} - {\hat{ω}}_{r}, {\tilde{D}}_{r} = D_{r} - {\hat{D}}_{r}$

From Eqs. (41) along (40), we can get (44) ${\dot{V}}_{4} = \frac{θ}{k_{b}^{2} - θ^{2}} (- λ_{r} - k_{θ} θ) + r_{e} {\tilde{ω}}_{r}^{T} ψ_{r} + r_{e} {\tilde{D}}_{r} - k_{r} {r_{e}}^{2}$

Then, we can define z_r as prediction error (45) $z_{r} = r - \hat{r}$

$\hat{r}$ can be defined with SPEM (46) $\dot{\hat{r}} = \frac{1}{m_{33}} (τ_{r} + {\hat{ω}}_{r}^{T} ψ_{r} + {\hat{D}}_{r} + ϕ_{r} z_{r})$ where $\hat{r} (0) = r (0)$ , ϕ_r is a positive constant.

The prediction error is employed to construct the weight updating (47) ${\dot{\hat{ω}}}_{r} = γ_{r} [(r_{e} + γ_{z r} z_{r}) ψ_{r} - ϑ_{r} {\hat{ω}}_{r}]$ where γ_r , γ_zr and ϑ_r are the positive constants to be designed.

The approximation information is employed to construct the NDO in the following form (48a) ${\hat{D}}_{r} = m_{33} r - σ_{r}$ (48b) ${\dot{σ}}_{r} = {\hat{ω}}_{r}^{T} ψ_{r} + {\hat{D}}_{r} + τ_{r} - (r_{e} + γ_{z r} z_{r})$

According to Eqs. (2a), (48a) and (48b), the derivative of ${\hat{D}}_{r}$ can be expressed as (49) ${\dot{\hat{D}}}_{r} = {\tilde{ω}}_{r}^{T} ψ_{r} + {\tilde{D}}_{r} + r_{e} + γ_{z r} z_{r}$

Then, the ${\dot{\tilde{D}}}_{r}$ can be calculated (50) ${\dot{\tilde{D}}}_{r} = {\dot{D}}_{r} - {\tilde{ω}}_{r}^{T} ψ_{r} - {\tilde{D}}_{r} - r_{e} - γ_{z r} z_{r}$

Remark 4: From Eqs. (26) and (47), it can easily obtain the weight updating of composite NN is designed by employing tracking error and prediction error. The prediction error can provide extra information for learning NN weight updating. Thus, better tracking performance can be achieved.

Remark 5: In Eqs. (26) and (47), γ_u and γ_r are positive constants used to optimize the learning rate. The ${\hat{ω}}_{u}$ and ${\hat{ω}}_{r}$ mainly tuned by the prediction errors if and are chosen larger, while if γ_zu and γ_zr are chosen smaller, the ${\hat{ω}}_{u}$ and ${\hat{ω}}_{r}$ mainly tuned by the tracking errors.

The compound unknown terms consist of unknown dynamics and time-varying disturbances are expressed as ∑_u and ∑_r. (51a) $m_{22} v r - d_{11} u + Δ f_{u} + d_{u} = Σ_{u}$ (51b) $(m_{11} - m_{22}) u v - d_{33} r + Δ f_{r} + τ_{w r} = Σ_{r}$

Remark 6: The disturbance observer and neural network contain each other’s information. If compound unknown terms can be perfect follow by ${\hat{ω}}_{u}^{T} ψ_{u} + {\hat{D}}_{u}$ and ${\hat{ω}}_{r}^{T} ψ_{r} + {\hat{D}}_{r}$ , the system’s estimation of unknown information can be more accurate. As a result, the objective of composite learning combining NN and NDO is accomplished.

Remark 7: Through trial and error, we first choose the appropriate design parameters k_ρ, k_θ, k_u, and k_r to ensure that the system is stable. Furthermore, we properly regulate the other design parameters γ_u, γ_zu, γ_r, γ_zr, ϑ_u, ϑ_r, ϕ_u and ϕ_r to get the satisfactory control performance. A large number of simulations in many cases show that the larger k_ρ, k_θ,, k_u, k_r , γ_zu, γ_zr, ϕ_u and ϕ_r are, the MSVs can obtain higher tracking accuracy.

Theorem 1: Considering the closed-loop system Eqs. (1a)–(1c) and Eqs.(2a)–(2c) with unknown dynamics, time-varying disturbances and output constraint under Assumption 1–Assumption 2, if virtual control law Eqs. (12), (33), control law Eqs. (21), (42), the NN updating laws Eqs. (26), (47) and NDOs Eqs. (27a)–(27b), Eqs. (48a)–(48b) are designed. It is guaranteed that all signals include in Eq. (52) are uniformly ultimately bounded (UUB).

Proof: Consider the following Lyapunov function (52) $V = V_{2} + V_{4} + \frac{1}{2} (\frac{1}{γ_{u}} {\tilde{ω}}_{u}^{T} {\tilde{ω}}_{u} + {\tilde{D}}_{u}^{2} + {λ_{u}}^{2} + m_{11} γ_{z u} z_{u}^{2} + \frac{1}{γ_{r}} {\tilde{ω}}_{r}^{T} {\tilde{ω}}_{r} + {\tilde{D}}_{r}^{2} + {λ_{r}}^{2} + m_{33} γ_{z r} z_{r}^{2})$

The time derivative of Eq. (52) can be calculated as (53) $\dot{V} = {\dot{V}}_{2} + {\dot{V}}_{4} + \frac{1}{γ_{u}} {\tilde{ω}}_{u}^{T} (- {\dot{\hat{ω}}}_{u}) + {\tilde{D}}_{u} (- {\dot{\hat{D}}}_{u}) + m_{11} γ_{z u} z_{u} {\dot{z}}_{u} + λ_{u} {\dot{λ}}_{u} + λ_{r} {\dot{λ}}_{r} + \frac{1}{γ_{r}} {\tilde{ω}}_{r}^{T} (- {\dot{\hat{ω}}}_{r}) + {\tilde{D}}_{r} (- {\dot{\hat{D}}}_{r}) + m_{33} γ_{z r} z_{r} {\dot{z}}_{r}$

In the view of Eqs. (15), (36) and Young’s inequality, we can get (54) $λ_{u} {\dot{λ}}_{u} \leq - \frac{{λ_{u}}^{2}}{T_{u}} + \frac{1}{2 ι} {λ_{u}}^{2} + 2 ι {H_{u}}^{2}$ (55) $λ_{r} {\dot{λ}}_{r} \leq - \frac{{λ_{r}}^{2}}{T_{r}} + \frac{1}{2 ι} {λ_{r}}^{2} + 2 ι {H_{r}}^{2}$

Using Eqs. (26) and (47), we have (56) $\frac{1}{γ_{u}} {\tilde{ω}}_{u}^{T} (- {\dot{\hat{ω}}}_{u}) = - {\tilde{ω}}_{u}^{T} [(u_{e} + γ_{z u} z_{u}) ψ_{u} - ϑ_{u} {\hat{ω}}_{u}]$ (57) $\frac{1}{γ_{r}} {\tilde{ω}}_{r}^{T} (- {\dot{\hat{ω}}}_{r}) = - {\tilde{ω}}_{r}^{T} [(r_{e} + γ_{z r} z_{r}) ψ_{r} - ϑ_{r} {\hat{ω}}_{r}]$

From Eqs. (29) and (50), we have (58) ${\tilde{D}}_{u} {\dot{\tilde{D}}}_{u} = {\tilde{D}}_{u} ({\dot{D}}_{u} - {\tilde{ω}}_{u}^{T} ψ_{u} - {\tilde{D}}_{u} - u_{e} - γ_{z u} z_{u})$ (59) ${\tilde{D}}_{r} {\dot{\tilde{D}}}_{r} = {\tilde{D}}_{r} ({\dot{D}}_{r} - {\tilde{ω}}_{r}^{T} ψ_{r} - {\tilde{D}}_{r} - r_{e} - γ_{z r} z_{r})$

Combining Eqs. (2a)–(2c), (24), (25), (45) with Eq. (46), we can get (60) $m_{11} γ_{z u} z_{u} {\dot{z}}_{u} = γ_{z u} z_{u} ({\tilde{ω}}_{u}^{T} ψ_{u} + {\tilde{D}}_{u} - ϕ_{u} z_{u})$ (61) $m_{33} γ_{z r} z_{r} {\dot{z}}_{r} = γ_{z r} z_{r} ({\tilde{ω}}_{r}^{T} ψ_{r} + {\tilde{D}}_{r} - ϕ_{r} z_{r})$

Combining Eqs. (23), (44), (52)–(58) and Young’s inequality, Eq. (53) can be expressed as (62) $\dot{V} \leq - (k_{ρ} - \frac{A}{2}) \frac{{ρ_{s}}^{2}}{k_{a}^{2} - ρ_{s}^{2}} - k_{u} {u_{e}}^{2} - γ_{z u} ϕ_{u} {z_{u}}^{2} - {\tilde{D}}_{u}^{2} - (\frac{1}{T_{u}} - \frac{1}{2 A (k_{a}^{2} - ρ_{s}^{2})} - \frac{1}{2 ι}) {λ_{u}}^{2} + 2 ι {H_{u}}^{2} + {\tilde{ω}}_{u}^{T} ϑ_{u} {\hat{ω}}_{u} + {\tilde{D}}_{u} {\dot{D}}_{u} - {\tilde{D}}_{u} {\tilde{ω}}_{u}^{T} ψ_{u} - (k_{θ} - \frac{1}{2}) \frac{θ^{2}}{k_{b}^{2} - θ^{2}} - k_{r} {r_{e}}^{2} - {\tilde{D}}_{r}^{2} - (\frac{1}{T_{r}} - \frac{1}{2 (k_{b}^{2} - θ^{2})}) {λ_{r}}^{2} - γ_{z r} ϕ_{r} {z_{r}}^{2} + \frac{1}{2 ι} {λ_{r}}^{2} + 2 ι {H_{r}}^{2} + {\tilde{ω}}_{r}^{T} ϑ_{r} {\hat{ω}}_{r} + {\tilde{D}}_{r} {\dot{D}}_{r} - {\tilde{D}}_{r} {\tilde{ω}}_{r}^{T} ψ_{r}$

According to Young’s inequality, we can obtain (63) $- {\tilde{D}}_{g} {\tilde{ω}}_{g}^{T} ψ_{g} \leq \frac{1}{2} ζ_{g} {\tilde{D}}_{g}^{2} ϖ_{g}^{2} + \frac{1}{2 ζ_{g}} {\tilde{ω}}_{g}^{T} {\tilde{ω}}_{g}$ (64) ${\tilde{D}}_{g} {\dot{D}}_{g} \leq \frac{1}{2} {\tilde{D}}_{g}^{2} + \frac{1}{2} χ_{g}^{2}$ (65) ${\tilde{ω}}_{g}^{T} {\hat{ω}}_{g} \leq - \frac{1}{2} {\tilde{ω}}_{g}^{T} {\tilde{ω}}_{g} + \frac{1}{2} ∥ {ω_{g}}^{*} ∥^{2}$ where ζ_g is positive user-defined parameter, ∥ψ_g ∥ ≤ ϖ_g, $|{\dot{D}}_{g}| \leq χ_{g}$ , g = u, r.χ_g and ∥ω_g^∗ ∥ are positive constants.

From Eqs. (63)–(65), Eq. (62) can be expressed as

$\dot{V} \leq - (k_{ρ} - \frac{A}{2}) \frac{{ρ_{s}}^{2}}{k_{a}^{2} - ρ_{s}^{2}} - k_{u} {u_{e}}^{2} - (\frac{1}{2} ϑ_{u} - \frac{1}{2 μ_{u}}) {\tilde{ω}}_{u}^{T} ω_{u} - (\frac{1}{T_{u}} - \frac{1}{2 A (k_{a}^{2} - ρ_{s}^{2})} - \frac{1}{2 ι}) {λ_{u}}^{2} - (\frac{1}{2} - \frac{1}{2} μ_{u} ϖ_{u}^{2}) {\tilde{D}}_{u}^{2} - γ_{z u} ϕ_{u} {z_{u}}^{2} - (k_{θ} - \frac{1}{2}) \frac{θ^{2}}{k_{b}^{2} - θ^{2}} - k_{r} {r_{e}}^{2} - γ_{z r} ϕ_{r} {z_{r}}^{2} - (\frac{1}{T_{r}} - \frac{1}{2 (k_{b}^{2} - θ^{2})} - \frac{1}{2 ι}) {λ_{r}}^{2} - (\frac{1}{2} ϑ_{r} - \frac{1}{2 μ_{r}}) {\tilde{ω}}_{r}^{T} ω_{r} - (\frac{1}{2} - \frac{1}{2} μ_{r} ϖ_{r}^{2}) {\tilde{D}}_{r}^{2} + 2 ι {H_{u}}^{2} + \frac{1}{2} ϑ_{u} ∥ ω_{u} ∥^{2} + \frac{1}{2} χ_{u}^{2} + 2 ι {H_{r}}^{2} + \frac{1}{2} ϑ_{r} ∥ ω_{r} ∥^{2} + \frac{1}{2} χ_{r}^{2}$ (66) $\leq - 2 a V + b$ where $a = min ((k_{ρ} - \frac{A}{2}), k_{u}, (\frac{1}{T_{u}} - \frac{1}{2 A (k_{a}^{2} - ρ_{s}^{2})} - \frac{1}{2 ι}), γ_{z u} ϕ_{u}, (\frac{1}{2} ϑ_{u} - \frac{1}{2 μ_{u}}), (\frac{1}{2} - \frac{1}{2} μ_{u} ϖ_{u}^{2}), (k_{θ} - \frac{1}{2}), k_{r}, (\frac{1}{T_{r}} - \frac{1}{2 (k_{b}^{2} - θ^{2})} - \frac{1}{2 ι}), (\frac{1}{2} ϑ_{r} - \frac{1}{2 μ_{r}}), (\frac{1}{2} - \frac{1}{2} μ_{r} ϖ_{r}^{2}), γ_{z r} ϕ_{r})$ , $b = 2 ι {H_{u}}^{2} + \frac{1}{2} ϑ_{u} ∥ ω_{u} ∥^{2}$ $+ \frac{1}{2} χ_{u}^{2} + 2 ι {H_{r}}^{2} + \frac{1}{2} ϑ_{r} ∥ ω_{r} ∥^{2} + \frac{1}{2} χ_{r}^{2} .$

By choosing the appropriate design parameters to make $k_{ρ} > \frac{A}{2}, k_{u} > 0, (\frac{1}{T_{u}} - \frac{1}{2 A (k_{a}^{2} - ρ_{s}^{2})} - \frac{1}{2 ι}) > 0, γ_{z u} ϕ_{u} > 0, (\frac{1}{2} ϑ_{u} - \frac{1}{2 μ_{u}}) > 0, (\frac{1}{2} - \frac{1}{2} μ_{u} ϖ_{u}^{2}) > 0, k_{θ} > \frac{1}{2}, k_{r} > 0, (\frac{1}{T_{r}} - \frac{1}{2 (k_{b}^{2} - θ^{2})} - \frac{1}{2 ι}) > 0, (\frac{1}{2} ϑ_{r} - \frac{1}{2 μ_{r}}) > 0, (\frac{1}{2} - \frac{1}{2} μ_{r} ϖ_{r}^{2}) > 0, γ_{z r} ϕ_{r} > 0 .$

By solving Eq. (66), we have (67) $0 \leq V \leq \frac{b}{2 a} + [V (0) - \frac{b}{2 a}] e^{- 2 a t}$

From Eq. (67), we can obtain that $V \to \frac{b}{2 a}$ as t → ∞. All signals in the Lyapunov function Eq. (52) are UUB. This concludes the proof.

Simulation Results

In this section, to demonstrate the effectiveness of the proposed control system, the dynamic model of an MSV in Do, Jiang & Pan (2004) is considered.

The model parameters of the MSV are presented as follows: m₁₁ =120 × 10³ kg, m₂₂ = 177.9 × 10³ kg, m₃₃ = 636 × 10⁵ kg m². d₁₁ =215 × 10² kg/s, d₂₂ = 147 × 10³ kg/s, d₃₃ = 802 × 10⁴ kg/m²s.

The proposed control scheme is marked as τ_CL. The control strategy without considering the prediction error is denoted as τ_NN.

Case 1: The reference trajectory is selected as x_d = 200sin(0.02t), y_d = 200cos(0.02t).

Unknown dynamics are selected as [Δf_u, Δf_v, Δf_r]^T = ${[(- 0.2 d_{11} |u|) u, (- 0.2 d_{22} |v|) v, (- 0.2 |r|) r]}^{T} .$ The external disturbances are given as [d_u, d_v, d_r]^T = [10⁴sin(0.3t − π/4) + 10⁴cos(0.2t + π/4) + 2 × 10⁴, 10³sin(0.2t − π/4)) + 10³cos(0.3t − π/4) + 3 × 10³, 10⁵sin(0.2t + π/6) + 10⁵cos(0.5t − π/4) (−3 × 10⁵]^T.

The initial condition is chosen as [x(0), y(0), φ(0), u(0), v(0), r(0)] =[20, 190, − 0.02π,0,0,0]. The control laws design parameters are designed as ρ₀ = 10, k_ρ = 0.4, k_u = 6 × 10³, k_r = 3.18 × 10⁶, T_u = 0.8, T_r = 0.3, γ_u = 10000, γ_r = 100, γ_zu = 20, γ_zr = 3000, ϑ_u = 0.00001, ϑ_r = 0.0001, ϕ_u = 10, ϕ_r = 1.

Figures 2A–2F illustrate the simulation results for the MSV under the two control strategies. Fig. 2A clearly illustrates that the MSV can track the reference trajectory in the presence of unknown dynamics, time-varying disturbances and output constraint under two control methods. The result in Fig. 2B shows that MSV can accomplish faster and more precise tracking under τ_CL. The results of approximation of unknown information in Figs. 2C and 2D further support this conclusion. The estimates of 2-norms weights are more sensitive under as illustrated in Fig. 2E. The control inputs τ_u and τ_r are plotted in Fig. 2F.

Simulation results under τNN and τCL for case 1. — Figure 2: Simulation results under τ_NN and τ_CL for case 1.
(A) Reference and actual trajectories of the MSV. (B) Tracking position error and yaw angle error. (C) ∑_u and its estimation. (D) ∑_r and its estimation. (E) 2-norms $∥ {\hat{ω}}_{u} ∥$ , $∥ {\hat{ω}}_{r} ∥$ of parameter estimates ${\hat{ω}}_{u}$ and ${\hat{ω}}_{r}$ . (F) Control signals τ_u and τ_r.

Download full-size image

DOI: 10.7717/peerjcs.863/fig-2

Case 2: The MSV’s unknown dynamics are raised 1.2 × Δf_n. The control law’s initial conditions and design parameters are the same as in Case 1, and the larger time-varying disturbances can be chosen as [d_u, d_v, d_r]^T = [1.5 × 10⁴sin(0.3t − π/4) + 1.5 × 10⁴cos(0.2t + π/4) + 3 × 10⁴, 1.5 × 10³sin(0.2t − π/4) + 1.5 × 10³cos(0.3t − π/4) + 3 × 10³, 1.5 × 10⁵sin(0.2t + π/6) + 1.5 × 10⁵cos(0.5t − π/4) − 4.5 × 10⁵]^T.

Under two control systems, MSV can track a reference trajectory in the presence of unknown dynamics, time-varying disturbances and output constraint as shown in Fig. 3A. As demonstrated in Fig. 3B, MSV can obtain higher tracking performance under τ_CL. The proposed control scheme has better robustness performance. As shown in Figs. 3C–3D, a similar result can be illustrated in case 1. The estimates of 2-norms weights are more sensitive under as illustrated in Fig. 3E. The control inputs are presented in Fig. 3F.

Simulation results under τNN and τCL for case 2. — Figure 3: Simulation results under τ_NN and τ_CL for case 2.
(A) Reference and actual trajectories of the MSV. (B) Tracking position error and yaw angle error. (C) ∑_u and its estimation. (D) ∑_r and its estimation. (E) 2-norms $∥ {\hat{ω}}_{u} ∥$ , $∥ {\hat{ω}}_{r} ∥$ of parameter estimates ${\hat{ω}}_{u}$ and ${\hat{ω}}_{r}$ . (F) Control signals τ_u and τ_r.

Download full-size image

DOI: 10.7717/peerjcs.863/fig-3

Case 3: The initial conditions and design parameters of the control law are the same as those in case 1. To further verify the superiority and effectiveness of the control scheme, another form of environmental disturbance are given as [d_u, d_v, d_r]^T = d + h. where d is d = [10⁴sin(0.3t − π/4) + 10⁴cos(0.2t + π/4) + 2 × 10⁴, 10³sin(0.2t − π/4) + 10³cos(0.3t − π/4) + 3 × 10³, 10⁵sin(0.2t + π/6) + 10⁵cos(0.5t − π/4) − 3 × 10⁵]^T. h is selected by the first-order Markov process. $\dot{h}$ = −Λh + Γ℘, where ℘ ∈ R³ is the zero-mean Gaussian white noise.

The simulation results are depicted in Figs. 4A–4F. Under two control systems, MSV can track a reference trajectory under unknown dynamics, time-varying disturbances and output constraint as shown in Fig. 4A. As demonstrated in Fig. 4B, MSV can achieve better tracking performance under τ_CL. As shown in Figs. 4C–4D, a similar result can be verified. The estimates of 2-norms weights are more sensitive under as shown in Fig. 4E. The control inputs are presented in Fig. 4F.

Simulation results under τNN and τCL for case 3. — Figure 4: Simulation results under τ_NN and τ_CL for case 3.
(A) Reference and actual trajectories of the MSV. (B) Tracking position error and yaw angle error. (C) ∑_u and its estimation. (D) ∑_r and its estimation. (E) 2-norms $∥ {\hat{ω}}_{u} ∥$ , $∥ {\hat{ω}}_{r} ∥$ of parameter estimates ${\hat{ω}}_{u}$ and ${\hat{ω}}_{r}$ . (F) Control signals τ_u and τ_r.

Download full-size image

DOI: 10.7717/peerjcs.863/fig-4

Conclusions

In this paper, a composite learning trajectory tracking control scheme is proposed for underactuated MSVs in the presence of unknown dynamics, time-varying disturbances and output constraints. The underactuation problem of the MSVs is addressed by the LOS approach. The barrier Lyapunov function is introduced to deal with the problem of output constraint. The composite learning control scheme is utilized to approximate unknown dynamics. The prediction errors and the tracking errors are adopted to construct the NN weight updating. Using approximation information, the disturbance observers are designed to estimates unknown time-varying disturbances. The Lyapunov method is used to demonstrate the stability of a closed-loop system. The simulation results demonstrate the effectiveness and superiority of the proposed control scheme.

Furthermore, the finite-time control can be further considered. The control scheme in this paper can be easily combined with event-triggered control.

Supplemental Information

Raw data of the simulation results

DOI: 10.7717/peerj-cs.863/supp-1

Download

Computer code

DOI: 10.7717/peerj-cs.863/supp-2

Download

[1] Chen L, Cui R, Yang C, Yan W. 2020. Adaptive neural network control of underactuated surface vessels with guaranteed transient performance: theory and experimental results. IEEE Transactions on Industrial Electronics 67(5):4024-4035

[2] Do KD. 2010. Practical control of underactuated ships. Ocean Engineering 37(13):1111-1119

[3] Do KD. 2016. Global robust adaptive path-tracking control of underactuated ships under stochastic disturbances. Ocean Engineering 111:267-278

[4] Do KD, Jiang ZP, Pan J. 2004. Global robust adaptive path following of underactuated ships. Automatica 40(6):929-944

[5] Gao T, Huang J, Zhou Y, Song Y-D. 2016. Robust adaptive tracking control of an underactuated ship with guaranteed transient performance. International Journal of Systems Science 48(2):272-279

[6] Ghommam J, Saad M. 2018. Adaptive leader-follower formation control of underactuated surface vessels under asymmetric range and bearing constraints. IEEE Transactions on Vehicular Technology 67(2):852-865

[7] Gibson TE, Annaswamy AM, Lavretsky E. 2013. On adaptive control with closed-loop reference models: transients, oscillations, and peaking. IEEE Access 1:703-717

[8] Guo G, Zhang P. 2020. Asymptotic stabilization of usvs with actuator dead-zones and yaw constraints based on fixed-time disturbance observer. IEEE Transactions on Vehicular Technology 69(1):302-316

[9] Hu X, Wei X, Kao Y, Han J. 2021. Robust synchronization for under-actuated vessels based on disturbance observer. In: IEEE transactions on intelligent transportation systems.

[10] Huang Y, Na J, Wu X, Gao G-B, Guo Y. 2018. Adaptive nonsingular fast terminal sliding-mode control for the tracking problem of uncertain dynamical systems. Transactions of the Institute of Measurement and Control 40(4):1237-1249

[11] Jia Z, Hu Z, Zhang W. 2019. Adaptive output-feedback control with prescribed performance for trajectory tracking of underactuated surface vessels. ISA Transactions 95:18-26

[12] Li G, Li W, Hildre HP, Zhang H. 2015. Online learning control of surface vessels for fine trajectory tracking. Journal of Marine Science and Technology 21(2):251-260

[13] Liu Z. 2019. Practical backstepping control for underactuated ship path following associated with disturbances. IET Intelligent Transport Systems 13(5):834-840

[14] Liu Z, Zhang Y, Yu X, Yuan C. 2016. Unmanned surface vehicles: an overview of developments and challenges. Annual Reviews in Control 41:71-93

[15] Mayne DQ, Michalska H. 1990. Receding horizon control of nonlinear systems. IEEE Transactions on Automatic Control 35(7):814-824

[16] Na J, Mahyuddin MN, Herrmann G, Ren X, Barber P. 2015. Robust adaptive finite-time parameter estimation and control for robotic systems. International Journal of Robust and Nonlinear Control 25(16):3045-3071

[17] Pan Y, Sun T, Yu H. 2016. Composite adaptive dynamic surface control using online recorded data. International Journal of Robust and Nonlinear Control 26(18):3921-3936

[18] Park BS, Kwon J-W, Kim H. 2017. Neural network-based output feedback control for reference tracking of underactuated surface vessels. Automatica 77:353-359

[19] Patre ZDWPM, Bhasin SWE. 2010. Composite adaptation for neural network-based controllers. IEEE Transactions on Automatic Control 55(4):944-950

[20] Peng Z, Wang D, Wang J. 2017. Predictor-based neural dynamic surface control for uncertain nonlinear systems in strict-feedback form. IEEE Transactions on Neural Networks and Learning Systems 28(9):2156-2167

[21] Peng Z, Wang J, Wang D. 2018. Distributed maneuvering of autonomous surface vehicles based on neurodynamic optimization and fuzzy approximation. IEEE Transactions on Control Systems Technology 26(3):1083-1090

[22] Shao G, Ma Y, Malekian R, Yan X, Li Z. 2019. A novel cooperative platform design for coupled USV-UAV systems. IEEE Transactions on Industrial Informatics 15(9):4913-4922

[23] Shen Z, Wang Y, Yu H, Guo C. 2020. Finite-time adaptive tracking control of marine vehicles with complex unknowns and input saturation. Ocean Engineering 198:106980

[24] Shojaei K. 2015. Neural adaptive robust control of underactuated marine surface vehicles with input saturation. Applied Ocean Research 53:267-278

[25] Shojaei K. 2017. Three-dimensional tracking control of autonomous underwater vehicles with limited torque and without velocity sensors. Robotica 36(3):374-394

[26] Shojaei K, Arefi MM. 2015. On the neuro-adaptive feedback linearising control of underactuated autonomous underwater vehicles in three-dimensional space. IET Control Theory and Applications 9(8):1264-1273

[27] Stepanyan V, Krishnakumar K. 2010. MRAC revisited: guaranteed performance with reference model modification. Proceedings of the American Control Conference 2010:93-98

[28] Sun X, Ge SS. 2014. Adaptive neural region tracking control of multi-fully actuated ocean surface vessels. IEEE/CAA Journal of Automatica Sinica 1(1):77-83

[29] Sun T, Pan Y, Yang C. 2017. Composite adaptive locally weighted learning control for multi-constraint nonlinear systems. Applied Soft Computing 55(4):944-950

[30] Tee KP, Ge SS, Li H, Ren B. 2011. Control of nonlinear systems with time-varying output constraints. Automatica, 2020 47(11):2511-2516

[31] Wang N, Deng Z. 2020. Finite-time fault estimator based fault-tolerance control for a surface vehicle with input saturations. IEEE Transactions on Industrial Informatics, [11]Vol 16(2):1172-1181

[32] Wang N, Pan X, Su S-F. 2019. Finite-time fault-tolerant trajectory tracking control of an autonomous surface vehicle. Journal of the Franklin Institute

[33] Wang N, Sun J-C, Er MJ. 2018. Tracking-error-based universal adaptive fuzzy control for output tracking of nonlinear systems with completely unknown dynamics. IEEE Transactions on Fuzzy Systems 26(2):869-883

[34] Xu B, Shou Y, Luo J, Pu H, Shi Z. 2019. Neural learning control of strict-feedback systems using disturbance observer. IEEE Trans Neural Netw Learn Syst 30(5):1296-1237

[35] Xu B, Sun F. 2018. Composite intelligent learning control of strict-feedback systems with disturbance. IEEE Trans Neural Netw Learn Syst 48(2):730-741

[36] Xu B, Yang D, Shi Z, Pan Y, Chen B, Sun F. 2018. ‘Online recorded data-based composite neural control of strict-feedback systems with application to hypersonic flight dynamics. IEEE Trans Neural Netw Learn Syst 29(8):3839-3849

[37] Yu H, Guo C, Yan Z. 2019. Globally finite-time stable three-dimensional trajectory-tracking control of underactuated UUVs. Ocean Engineering 189:106329

[38] Yucelen T, Haddad WM. 2013. Low-frequency learning and fast adaptation in model reference adaptive control. IEEE Transactions on Automatic Control 58(4):1080-1085

[39] Zhao Z, He W, Ge SS. 2014. Adaptive neural network control of a fully actuated marine surface vessel with multiple output constraints. IEEE Transactions on Control Systems Technology 22(4):1536-1543

[40] Zheng Z, Ruan L, Zhu M, Guo X. 2020. Reinforcement learning control for underactuated surface vessel with output error constraints and uncertainties. Neurocomputing 399:479-490