Wang , Jianliang* , and Zhang: Fuzzy Neural Network Active Disturbance Rejection Control for Two-Wheeled Self-Balanced Robot

# Fuzzy Neural Network Active Disturbance Rejection Control for Two-Wheeled Self-Balanced Robot

Abstract: Considering the problems of poor control effect, weak disturbance rejection ability and adaptive ability of two-wheeled self-balanced robot (TWSBR) systems on undulating roads, this paper proposes a fuzzy neural network active disturbance rejection controller (FNNADRC), that is based on fuzzy neural network (FNN) for online correction of active disturbance rejection controller (ADRC)’s nonlinear control rate. Firstly, the dynamic model of the TWSBR is established and decoupled, the extended state observer (ESO) is used to compensate dynamically and linearize the upright and displacement subsystems. Then, the nonlinear PD control rate and FNN are designed, and the FNN is used to modify the control parameters of the nonlinear PD control rate in real time. Finally, the proposed control strategy is simulated and compared with the traditional ADRC and fuzzy active disturbance rejection controller (FADRC). The simulation results show that the control effect of the proposed control strategy is slightly better than ADRC and FADRC.

Keywords: ADRC , Compensate Dynamically , ESO , FNN , Nonlinear PD Control Rate , TWSBR

## 1. Introduction

Two-wheeled self-balanced robots (TWSBRs) are characterized by simple structure, small size, light weight and flexible movement, so it has been widely applied in different industry fields, such as duty patrol, industrial transportation, daily travel [1]. As a multi-purpose load-bearing tool, TWSBR needs to adapt to different terrains. Dynamic disturbing forces will be generated at contact points between wheels and different ground. These disturbances are caused by the change of the contact angle between the wheel and the ground. Currently, many control methods have been proposed for TWSBRs [2-8]. In these references, proportional-integral-derivative (PID), linear quadratic regulator (LQR), neural network and other control methods are used, but they are all based on the mathematical model which is on the plane. Because most strategies have unsatisfactory disturbance rejection and adaptive ability, it is difficult to apply a TWSBR to slope road surfaces, undulating road surfaces and other complex working environments. Therefore, how to enhance the control ability of the control strategy is the problem that needs to be solved.

In 2014, Hu and Yan [9] estimated the perturbation of the dynamic system through a system state observer and designed the controller combined with LQR approach, which improved the disturbance rejection effect to a certain extent. In 2016, Jiang et al. [10] proposed an adaptive differential active disturbance rejection controller (ADRC), which adjusted the parameters of ADRC through an adaptive differential evolution algorithm, however, only the compensation factor is adjusted without considering the adjustment of control rate parameters and observer parameters [10]. In 2017, Unluturk and Aydogdu [11] proposed an adaptive switch controller based on the neural network, which realized the balance control under the soil pavement, pebble pavement, and sand and gravel pavement, without considering the slope and undulating pavement. In the same year, Liang et al. [12] established the dynamic TWSBR model on the undulating pavement surface, and designed a LQR controller. The convergence speed of the controller needs to be improved [12]. In 2020, Tran et al. [13] proposed a simple controller based on a composite observer for trajectory tracking and balance control with unknown disturbance of input and uncertain functions of a TWSBR model. Then, the closed-loop control system’s stability is proved, and a simulation comparison is made at last. In the same year, Karam et al. [14] designed a robust controller and used state feedback to estimate the equivalent term of SMC control law to overcome the dependence of the model. The simulation results show that the property of SFSMC is better than that of classical SMC in terms of tracking error. In 2021, Fikri et al. [15] designed the [TeX:] $$H_{\infty}$$ controller. The research shows that the [TeX:] $$H_{\infty}$$ controller has satisfactory disturbance rejection performance. None of the above research methods considers how to enhance the anti-interference capability and adaptive ability of the TWSBR on undulating road surface. Therefore, this paper proposes a control strategy for the TWSBR on undulating road surface.

We present a fuzzy neural network active disturbance rejection controller (FNNADRC) strategy for a TWSBR under complex road conditions in this paper. In order to improve the disturbance rejection capability of the TWSBR, the disturbance is estimated and compensated by the extended state observer (ESO) in ADRC method. The parameters of nonlinear PD control rate are fixed, which leads to its weak adaptive ability. Combining neural network and fuzzy control makes the network have both self-learning ability and reasoning ability [16], so as to correct the control parameters of nonlinear PD control rate in real time according to the size of control error and error rate of change. In this way, the control effect is improved, so that it can move stably in complex road conditions.

The advantages of this study can be summarized as follows:

On the basis of traditional ADRC control, the idea of fuzzy neural network (FNN) is integrated. According to the state error, the ADRC parameters are adjusted in real time to overcome the serious interference of TWSBR control systems under complex road conditions. By using the self-learning advantage of FNN, the problem that parameters need to be adjusted through certain exploration experience in fuzzy control is solved.

This paper is organized as follows: Section 2 discusses the TWSBR model under rolling pavement. In Section 3, we propose the FNNADRC based ESO. In Section 4, the superiority of FNNADRC method is compared by simulation. Section 5 concludes this.

## 2. Dynamic Model

Fig. 1 shows the TWSBR on the undulating pavement. The special case of the undulating road is the slope road; therefore, the undulating road can be regarded as the slope road whose inclination angle changes continuously and slowly [12]. The schematic diagram of instantaneous movement of the TWSBR on undulating road surface is shown in Fig. 2.

Fig. 1.

Fig. 2.

Forward motion schematic diagram.

In this case, the dynamic model can be expressed as follows:

##### (1)
[TeX:] $$\left\{\begin{array}{c} \dot{x}_1=x_2 \\ \dot{x}_2=\frac{B_1+B_2-B_3-B_4}{A} \\ \dot{x}_3=x_4 \\ \dot{x}_4=\frac{\left(u_1-u_2\right) d r}{a_3} \\ \dot{x}_5=x_6 \\ \dot{x}_6=\frac{C_1-C_2-C_3-C_4+C_5}{A} \end{array}\right.$$

where,

[TeX:] $$\begin{gathered} B_1=a_1\left[M_B g l \sin \left(x_1+\theta_{\varepsilon}\right)+M_B x_6 l \sin \left(x_1+\alpha+\theta_{\varepsilon}\right) \dot{\alpha}\right] \\ B_2=a_4\left(2 M_W+M_B\right) r g \sin \alpha \cos \left(x_1+\alpha+\theta_{\varepsilon}\right) \\ B_3=a_4^2 x_2\left(x_2+\dot{\alpha}\right) \sin \left(x_1+\alpha+\theta_{\varepsilon}\right) \cos \left(x_1+\alpha+\theta_{\varepsilon}\right) \\ B_4=\left[a_1+a_4 \cos \left(x_1+\alpha+\theta_{\varepsilon}\right)\right]\left(u_1+u_2\right) \\ \mathrm{A}=a_1 a_2-a_4{ }^2 \cos ^2\left(x_1+\theta_{\varepsilon}+\alpha\right) \\ C_1=a_2 a_4 r x_2 \sin \left(x_1+\theta_{\varepsilon}+\alpha\right)\left(x_2+\dot{\alpha}\right) \\ C_2=a_2\left(M_B+2 M_W\right) g r^2 \sin \alpha \end{gathered} \\ \begin{gathered} C_3=a_4^2 g \sin \left(x_1+\theta_{\varepsilon}\right) \cos \left(x_1+\alpha+\theta_{\varepsilon}\right) \\ C_4=a_4^2 x_6 \dot{\alpha} \sin \left(x_1+\alpha+\theta_s\right) \cos \left(x_1+\alpha+\theta_{\varepsilon}\right) \\ C_5=\left[a_2 r+a_4 r \cos \left(x_1+\alpha+\theta_{\varepsilon}\right)\right]\left(u_1+u_2\right) \\ a_1=M_B r^2+2 M_W r^2+2 I \\ a_2=M_B l^2+J \\ a_3=M_W d^2 r^2+I d^2+2 K r^2 \\ a_4=M_B l r \\ \theta_{\varepsilon}=\arcsin \frac{\left(2 M_W+M_B\right) r \sin \alpha}{M_B l} \\ x_1=\theta-\theta_{\varepsilon}, x_2=\dot{\theta}, x_3=\varphi, x_4=\dot{\varphi}, x_5=x, x_6=\dot{x}, u_1=T_L, u_2=T_R, \end{gathered}$$

[TeX:] $$\theta$$ is pitch angle, [TeX:] $$\varphi$$ is yaw angle, x is displacement, [TeX:] $$T_L$$ is left wheel’s torque, [TeX:] $$T_R$$ is right wheel’s torque, [TeX:] $$\alpha$$ is the angle between the tangent line of the wheel and the road surface and the horizontal line. Table 1 shows the values of relevant parameters.

Table 1.

Model parameters
Parameter Description Value
[TeX:] $$M_W$$ Mass of wheel 0.05 kg
[TeX:] $$M_B$$ Mass of body 1.125 kg
I Torque of the wheel [TeX:] $$4.77 \times 10^{-5} \mathrm{~kg} \cdot \mathrm{m}^2$$
J Inertia of the body rotating about the motor axis [TeX:] $$5.19 \times 10^{-3} \mathrm{~kg} \cdot \mathrm{m}^2$$
K Inertia of the body rotating about the pendulum [TeX:] $$6.3 \times 10^{-3} \mathrm{~kg} \cdot \mathrm{m}^2$$
l Center distance of the robot body and the motor axis 0.053 m
d Wheel axle distance 0.24 m

## 3. Controller Design

##### 3.1 System Structure Design

The overall structure of FNN control system is shown in Fig. 3.

Fig. 3.

Fuzzy neural network control block diagram of two-wheeled self-balanced robot.

The TWSBR control system has three subsystems: upright control, turning control and displacement control. The three subsystems are connected through the “static coupling” matrix. According to the separation principle of ADRC design, ADRCs are proposed for the upright and displacement subsystem respectively. In addition, the transition process is implemented by a tracking differentiator (TD), the ESO and error feedback control law are designed. Since the disturbance model is not included in the turning subsystem, a simple PD control approach is presented for the system. Based on the traditional ADRC of upright subsystem and displacement subsystem, two FNNs with the same structure are introduced respectively. The error of angle and angle error rate of change, the error of displacement and displacement error rate of change are taken as the inputs of the two networks, and the outputs of the two networks are used to regulate the parameters of the nonlinear PD control rate, so as to enhance the disturbance rejection ability of the system.

##### 3.2 MIMO System Decoupling

From the above expression, it can be concluded that the system is a multiple-input and multiple-out (MIMO) coupled system. In order to use decoupling control for a multivariable system, the system is expressed in the following form:

##### (2)
[TeX:] $$\left\{\begin{array}{c} \dot{x}_1=x_2 \\ \dot{x}_2=f_1\left(x_1, x_2, x_6, \alpha\right)+b_{11}\left(x_1, \alpha\right) u_1+b_{12}\left(x_1, \alpha\right) u_2 \\ \dot{x}_3=x_4 \\ \dot{x}_4=f_2+b_{21} u_1-b_{22} u_2 \\ \dot{x}_5=x_6 \\ \dot{x}_6=f_3\left(x_1, x_2, x_6, \alpha\right)+b_{31}\left(x_1, \alpha\right) u_1+b_{32}\left(x_1, \alpha\right) u_2 \end{array}\right.$$

where,

[TeX:] \begin{aligned} &f_1\left(x_1, x_2, x_6, \alpha\right)=\frac{B_1+B_2-B_3}{A}\\ &f_2=0\\ &f_3\left(x_1, x_2, x_6, \alpha\right)=\frac{C_1-C_2-C_3-C_4}{A}\\ &b_{11}\left(x_1, \alpha\right)=b_{12}\left(x_1, \alpha\right)=\frac{-\left[a_1+a_4 \cos \left(x_1+\alpha+\theta_{\varepsilon}\right)\right]}{a_1 a_2-a_4^2 \cos ^2\left(x_1+\theta_{\varepsilon}+\alpha\right)}\\ &b_{21}=-b_{22}=\frac{d r}{a_3}\\ &b_{31}\left(x_1, \alpha\right)=b_{32}\left(x_1, \alpha\right)=\frac{\left[a_2 r+a_4 r \cos \left(x_1+\alpha+\theta_{\varepsilon}\right)\right]}{a_1 a_2-a_4^2 \cos ^2\left(x_1+\theta_{\varepsilon}+\alpha\right)} . \end{aligned}

According to expression (2), the amplification coefficient matrix of the control force is

##### (3)
[TeX:] $$B=\left[\begin{array}{cc} b_{11}\left(x_1, \alpha\right) & b_{12}\left(x_1, \alpha\right) \\ b_{21} & b_{22} \\ b_{31}\left(x_1, \alpha\right) & b_{32}\left(x_1, \alpha\right) \end{array}\right]$$

Virtual control variables [TeX:] $$U_1, U_2, U_3$$ are introduced, and the input-output relationship of each channel is

##### (4)
[TeX:] $$\left\{\begin{array}{c} \dot{x}_1=x_2 \\ \dot{x}_2=f_1\left(x_1, x_2, x_6, \alpha\right)+U_1 \\ \dot{x}_3=x_4 \\ \dot{x}_4=f_2+U_2 \\ \dot{x}_5=x_6 \\ \dot{x}_6=f_3\left(x_1, x_2, x_6, \alpha\right)+U_3 \\ y_1=x_1, y_2=x_3, y_3=x_5 \end{array}\right.$$

In this way, the relationship between the virtual control signal of each channel and the controlled output is single-input and single-output (SISO). Each channel’s output and the intermediate control signal are perfectly decoupled, and the controller of each subsystem should be presented on the basis of the approach of the SISO system.

The real control signal can be expressed as follows:

##### (5)
[TeX:] $$u=X_B U$$

Select the [TeX:] $$x_1$$ in sight of the equilibrium point, [TeX:] $$x_1=0, \alpha=0$$ can be selected to calculate the approximate value of B. Then calculate the inverse of B, since B is irreversible in this system, the pseudo-inverse matrix [TeX:] $$X_B$$ is calculated for B.

##### 3.3.1 Design of ADRC for the upright subsystem

After the system is decoupled, the expression of the upright subsystem is

##### (6)
[TeX:] $$\left\{\begin{array}{c} \dot{x}_1=x_2 \\ \dot{x}_2=f_1\left(x_1, x_2, x_6, \alpha\right)+U_1 \end{array}\right.$$

The expected angle value of the subsystem is 0, so it is not necessary to have the tracking differentiator. Only the nonlinear control rate and ESO need to be designed, and then the dynamic compensation linearization is carried out. The ESO of the upright subsystem is

##### (7)
[TeX:] $$\left\{\begin{array}{c} e=z_{11}-y_1 \\ \dot{z}_{11}=z_{12}-\beta_{11} e \\ \dot{z}_{12}=z_{13}-\beta_{12} f a l(e, 0.5, \delta)+U_1 \\ \dot{z}_{13}=-\beta_{13} f a l(e, 0.25, \delta) \end{array}\right.$$

where,

[TeX:] $$f a l(e, \gamma, \delta)=\left\{\begin{array}{c} |e|^\gamma \operatorname{sign}(e),|e|>\delta \\ e / \delta^{\gamma-1},|e| \leq \delta \end{array}\right.$$

[TeX:] $$z_{11}$$ is the estimation of the system output, [TeX:] $$z_{12}$$ is the estimation of system state [TeX:] $$x_{1=} \quad Z_{13}$$ is the estimation of system state [TeX:] $$x_2.$$

The nonlinear PD control rate of the upright subsystem is

##### (8)
[TeX:] $$\begin{gathered} u_{10}=\beta_1 f a l\left(e_1, \alpha_1, \delta\right)+\beta_2 \operatorname{fal}\left(e_2, \alpha_2, \delta\right) \\ 0<\alpha_1<1<\alpha_2 \end{gathered}$$

where [TeX:] $$\beta_1, \beta_2$$ are the parameters of nonlinear PD control rate.

Because it is necessary to design the FNN to correct [TeX:] $$\beta_1, \beta_2$$ in real time according to the inclination error signal and inclination velocity error signal, the correction parameters [TeX:] $$\Delta \beta_1, \Delta \beta_2$$ are introduced. The values are directly applied to the original control rate parameter to modify it. Therefore, the expression (8) is written as follows:

##### (9)
[TeX:] $$u_{10}=\left(\beta_1+\Delta \beta_1\right) f a l\left(e_1, \alpha_1, \delta\right)+\left(\beta_2+\Delta \beta_2\right) f a l\left(e_2, \alpha_2, \delta\right)$$

The compensation of the disturbance for the upright subsystem is

##### (10)
[TeX:] $$U_1=u_{10}-z_{13}.$$

##### 3.3.2 Design of PD for the turning subsystem

When the dynamic system is decoupled, the expression of the turning subsystem is

##### (11)
[TeX:] $$\left\{\begin{array}{c} \dot{x}_3=x_4 \\ \dot{x}_4=f_2+U_2 \end{array}\right.$$

Since the turning subsystem is a plain integrator series system, the control system of upright and displacement has little negative effect on the turning subsystem, the simple PD approach is adopted for the turning subsystem.

##### (12)
[TeX:] $$U_2=K_p\left(\varphi_r-\varphi\right)+K_d\left(\dot{\varphi}_r-\dot{\varphi}\right)$$

where [TeX:] $$\varphi_r$$ is the desirable turning angle and [TeX:] $$\varphi$$ is the real turning angle.

##### 3.3.3 Design of ADRC for displacement subsystem

Since the system is decoupled, the displacement subsystem can be expressed as

##### (13)
[TeX:] $$\left\{\begin{array}{c} \dot{x}_5=x_6 \\ \dot{x}_6=f_3\left(x_1, x_2, x_6, \alpha\right)+U_3 \end{array} .\right.$$

The subsystem of displacement and the subsystem of upright interact with each other. In order to avoid overshoot of the subsystem of displacement and the effect of the upright control, the transition process is arranged. The other designs are similar to the upright subsystem. The TD used to realize the transition process is

##### (14)
[TeX:] $$\left\{\begin{array}{c} \dot{v}_1=v_2 \\ \dot{v}_2=-r \operatorname{sign}\left(v_1-y_3+\frac{v_2\left|v_2\right|}{2 r}\right) \end{array}\right.$$

where, the solution [TeX:] $$v_1$$ of the equation can approach [TeX:] $$y_3$$ in any finite time, [TeX:] $$v_2$$ is the differential signal of [TeX:] $$v_1.$$

The ESO of the displacement subsystem is

##### (15)
[TeX:] $$\left\{\begin{array}{c} e=z_{31}-y_3 \\ \dot{z}_{31}=z_{32}-\beta_{31} e \\ \dot{z}_{32}=z_{33}-\beta_{32} f a l(e, 0.5, \delta)+U_3 \\ \dot{z}_{33}=-\beta_{33} f a l(e, 0.25, \delta) \end{array}\right.$$

where,

[TeX:] $$Z_{31}$$ is the estimation of the system output, [TeX:] $$Z_{32}$$ is the estimation of system state [TeX:] $$x_{5,} \quad Z_{33}$$ is the estimation of system state [TeX:] $$x_6.$$

The nonlinear PD control rate of the displacement subsystem is

##### (16)
[TeX:] $$u_{30}=\left(\beta_3+\Delta \beta_3\right) f a l\left(e_3, \alpha_3, \delta\right)+\left(\beta_4+\Delta \beta_4\right) f a l\left(e_4, \alpha_4, \delta\right)$$

where,[TeX:] $$\beta_3 \text { and } \beta_4$$ are the parameters of nonlinear PD control rate, [TeX:] $$\Delta \beta_3 \text { and } \Delta \beta_4$$ are the correction parameters.

The disturbance compensation of displacement subsystem is

##### (17)
[TeX:] $$U_3=u_{30}-z_{33} .$$

##### 3.4.1 Structure of fuzzy neural network

The structure of FNN is shown in Fig. 4. The entire network has two inputs and two outputs, there are five layers in total. For the upright and displacement subsystem, the network is applied to adjust the parameters of the control rate respectively.

Fig. 4.

Fuzzy neural network structure diagram.

The first layer is the input layer which has two nodes. The inputs of the layer are the error and the error rate of the change. Its function is to pass the input to the next layer node.

The second layer is the membership layer which has six nodes, and each input corresponding to the first layer has three fuzzy sets {"negative", "zero", "positive"}. Gaussian functions are adopted as membership functions, then

##### (18)
[TeX:] $$A_i^j\left(x_i\right)=\exp \left(-\frac{\left(x_i-a_{i j}\right)^2}{b_{i j}^2}\right)$$

where [TeX:] $$i=1,2 ; j=1,2,3 ; A_i^j\left(x_i\right)$$ indicates that the i-th input corresponds to the j-th linguistic variable value.

The third layer is the inference layer, which has nine nodes, each node represents a fuzzy rule, and the applicability of each rule is calculated as

##### (19)
[TeX:] $$\alpha_l=A_1^{j_1}\left(x_1\right) \cdot A_2^{j_2}\left(x_2\right)$$

where, [TeX:] $$j_1=1,2,3 ; j_2=1,2,3 ; l=1,2, \ldots, 9.$$

The fourth layer is the normalized layer, which also has nine nodes, and the normalized calculation result is

##### (20)
[TeX:] $$\hat{\alpha}_l=\frac{\alpha_l}{\sum_{i=1}^9 \alpha_i}$$

where [TeX:] $$l=1,2, \ldots, 9.$$

The fifth layer is the output layer, which has two nodes for de-fuzzification calculation. The outputs of the layer are correction offsets of control parameters.

##### (21)
[TeX:] $$y_k=\sum_{l=1}^9 \omega_{k l} \hat{\alpha}_l$$

where, [TeX:] $$k=1,2 ; \omega_{k l}$$ is the weight of the network, [TeX:] $$y_k$$ is the output of the network, [TeX:] $$\Delta \beta_1=y_1, \Delta \beta_2=y_2.$$

##### 3.4.2 Learning algorithm

The fuzzy segmentation number in this design has been determined, and the parameters to be trained include the connection weight [TeX:] $$\omega_{k l}$$ of the fifth layer, and the center value parameters [TeX:] $$a_{i j} \text { and } b_{i j}$$ of the second layer.

Let the desired output of the system be [TeX:] $$\hat{y}_k,$$ the training objective functions are

##### (22)
[TeX:] $$J=\frac{1}{2} \sum_{k=1}^2\left(\hat{y}_k-y_k\right)^2.$$

The iterative formulas for adjusting parameters are

##### (23)
[TeX:] $$\omega_{k l}(t+1)=\omega_{k l}(t)-\eta_1 \frac{\partial J}{\partial \omega_{k l}}$$

where, [TeX:] $$k=1,2 ; l=1,2, \ldots, 9.$$

##### (24)
[TeX:] $$a_{i j}(t+1)=a_{i j}(t)-\eta_2 \frac{\partial J}{\partial a_{i j}}$$

where, [TeX:] $$i=1,2,3 ; j=1,2,3.$$

##### (25)
[TeX:] $$b_{i j}(t+1)=b_{i j}(t)-\eta_3 \frac{\partial J}{\partial b_{i j}}$$

where, [TeX:] $$\eta_1, \eta_2, \eta_3$$ are the learning rates.

## 4. Simulation Results

In order to examinate the control effect, ADRC, FADRC, and FNNADRC are used to control the TWSBR under different road surfaces, and the simulation results are analysed and compared. The parameters of ADRC are determined by the trial and error method, and then corrected by FNN when the system is running. The parameters of FNN are adjusted by error back-propagation. According to the stress analysis, the maximum [TeX:] $$x_1$$ of TWSBR is [TeX:] $$\arctan \mu, \text { where } \mu$$ is ground friction coefficient [7], therefore, the initial value of inclination should be less than this value. There is no restriction on the initial values of steering angle and displacement. Based on the above instructions, set the initial states [TeX:] $$x_1=\pi / 20, x_3=\pi / 4, x_5=0,$$ desired states [TeX:] $$x_1=0, x_3=0, x_5=0.5.$$

In case of flat road surface [TeX:] $$\alpha=0,$$ ADRC, FADRC and FNNADRC are used for control simulation respectively. The upright inclination, turning angle and displacement control effects are shown in Figs. 5–7. Fig. 5 shows that the upright curves of ADRC and FADRC oscillate greatly. The error between the real dip angle and the expected value is large, while the error between the dip curve of FNNADRC and the target value is the small. There is no disturbance in the turning subsystem of TWSBR dynamic model, so the little error between the real and the expected value of turning angle in Fig. 6 is showed. Therefore, the error between the real and the expected value of the turning angle under any road condition is relatively small, and the control result is similar to Fig. 6, which will not be described later. Fig. 7 shows that the error of FNNADRC between the displacement curve and the target value is small, and it is better than that of ADRC and FADRC. In conclusion, the control effect of FNNADRC is more satisfactory than the other two methods on flat road surface.

Fig. 5.

Upright curve on the flat road.

Fig. 6.

Turning curve on the flat road.

Fig. 7.

Displacement curve on the flat road.

In case of slope road surface [TeX:] $$\alpha=0.174 \mathrm{rad},$$ ADRC, FADRC and FNNADRC are used for control simulation respectively. The control effects of upright inclination and displacement are displayed in Figs. 8 and 9. Fig. 8 exhibits that the upright curve oscillation amplitude of ADRC and FADRC is large, while the error between the inclination curve of FNNADRC and the target value is small. Fig. 8 shows that the dip and displacement curves of FNNADRC are smoother and have less errors. To sum up, FNNADRC has the best comprehensive control effect under the slope.

Fig. 8.

Upright curve on the slope road.

Fig. 9.

Displacement curve on the slope road.

Under the condition of undulating road surface [TeX:] $$\alpha=\sin x_1,$$ ADRC, FADRC and FNNADRC are used for control simulation respectively. The upright inclination and displacement control effects are exhibited in Figs. 10 and 11. Fig. 10 shows that the upright curve oscillation amplitude of ADRC and FADRC is large, while the error between the inclination curve of FNNADRC and the target value is more acceptable. Fig. 11 shows that the displacement curves of ADRC and FADRC have large oscillation amplitude, while the error between the displacement curve of FNNADRC and the target value is more in line with the requirements. In summary, FNNADRC has the best comprehensive control effect under undulating road surface.

Fig. 10.

Upright curve on the undulating road.

Fig. 11.

Displacement curve on the undulating road.

## 5. Conclusion

In this research, the disturbance rejection problem of the TWSBR in complex environments is studied. According to the dynamics model of the TWSBR system on undulating pavement, the FNNADRC is designed. The nonlinear system is decoupled into three subsystems. Then, considering the characteristics of the subsystems, an ADRC is designed to ensure the stability of the subsystems. And the controller parameters are corrected by the self-learning function of the FNN. Under flat road surface, slope road surface and undulating road surface, the control methods of ADRC, FADRC and FNNADRC are simulated and compared respectively to examinate the disturbance rejection capability of FNNADRC. The results of the simulation display that FNNADRC has improved anti-interference capability compared with the other two methods.

## Biography

##### Chao Wang
https://orcid.org/0000-0003-1342-9706

He received M.S. degrees in School of marine electrical engineering from Dalian Maritime University. He is a teaching assistant. His current research interests include intelligent control and Stability Analysis of Nonlinear Systems.

## Biography

##### Xiao Jianliang
https://orcid.org/0000-0001-6559-6341

He graduated from Dalian Marine University. Currently, he is an associate professor at the City Institute, Dalian University of Technology. His current research interests include network security and Artificial Intelligence.

## Biography

##### Cheng Zhang
https://orcid.org/0000-0001-8439-4892

He graduated from Dalian University of Technology. Currently, he is a professor at the City Institute, Dalian University of Technology. He has published over 20 papers in the field of digital image processing and virtual reality.

## References

• 1 R. P . M. Chan, K. A. Stol, and C. R. Halkyard, "Review of modelling and control of two-wheeled robots," Annual Reviews in Control, vol. 37, no. 1, pp. 89-103, 2013.doi:[[[10.1016/j.arcontrol.2013.03.004]]]
• 2 B. Mahler and J. Haase, "Mathematical model and control strategy of a two-wheeled self-balancing robot," in Proceedings of the 39th Annual Conference of the IEEE Industrial Electronics Society, Vienna, Austria, 2013, pp. 4198-4203.doi:[[[10.1109/IECON.2013.6699809]]]
• 3 M. A. Imtiaz, M. Naveed, N. Bibi, S. Aziz, and S. Z. H. Naqvi, "Control system design, analysis & implementation of two wheeled self balancing robot (TWSBR)," in Proceedings of 2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), V ancouver, Canada, 2018, pp. 431-437.doi:[[[10.1109/IEMCON.2018.8614858]]]
• 4 C. C. Tsai, H. C. Huang, and S. C. Lin, "Adaptive neural network control of a self-balancing two-wheeled scooter," IEEE Transactions on Industrial Electronics, vol. 57, no. 4, pp. 1420-1428, 2010.doi:[[[10.1109/TIE.2009.2039452]]]
• 5 Z. Kausar, K. Stol, and N. Patel, "The effect of terrain inclination on performance and the stability region of two-wheeled mobile robots," International Journal of Advanced Robotic Systems, vol. 9, no. 5, article no. 218, 2012. https://doi.org/10.5772/52894doi:[[[10.5772/52894]]]
• 6 M. I. Ali and M. M. Hossen, "A two-wheeled self-balanced robot with dynamics model," in Proceedings of 2017 4th International Conference on Advances in Electrical Engineering (ICAEE), Dhaka, Bangladesh, 2017, pp. 271-275.doi:[[[10.1109/ICAEE.2017.8255365]]]
• 7 Z. Kausar, K. Stol, and N. Patel, "Performance enhancement of a statically unstable two wheeled mobile robot traversing on an uneven surface," in Proceedings of 2010 IEEE Conference on Robotics, Automation and Mechatronics, Singapore, 2010, pp. 156-162.doi:[[[10.1109/ramech.2010.5513197]]]
• 8 J. Wu, W. Zhang, and S. Wang, "A two-wheeled self-balancing robot with the fuzzy PD control method," Mathematical Problems in Engineering, vol. 2012, article no. 469491, 2012. https://doi.org/10.1155/2012/469491doi:[[[10.1155//469491]]]
• 9 J. Hu and G. F. Yan, "Analysis of two-wheeled self-balancing mobile robots based on ADRC," Journal of Mechanical & Electrical Engineering, vol. 31, no. 2, pp. 159-164, 2014.custom:[[[-]]]
• 10 L. Jiang, H. Qiu, Z. Wu, and J. He, "Active disturbance rejection control based on adaptive differential evolution for two-wheeled self-balancing robot," in Proceedings of 2016 Chinese Control and Decision Conference (CCDC), Yinchuan, China, 2016, pp. 6761-6766.doi:[[[10.1109/ccdc.2016.7532214]]]
• 11 A. Unluturk and O. Aydogdu, "Adaptive control of two-wheeled mobile balance robot capable to adapt different surfaces using a novel artificial neural network–based real-time switching dynamic controller," International Journal of Advanced Robotic Systems, vol. 14, no. 2, article no. 1729881417700893, 2017. https://doi.org/10.1177/1729881417700893doi:[[[10.1177/1729881417700893]]]
• 12 D. Liang, N. Sun, Y . Wu, and Y . Fang, "Dynamic modeling and control of inverted pendulum robots moving on undulating pavements," in Proceedings of 2017 Seventh International Conference on Information Science and Technology (ICIST), Da Nang, Vietnam, 2017, pp. 115-120.doi:[[[10.1109/ICIST.2017.7926503]]]
• 13 K. G. Tran, N. H. Nguyen, and P . D. Nguyen, "Observer-based controllers for two-wheeled inverted robots with unknown input disturbance and model uncertainty," Journal of Control Science and Engineering, vol. 2020, article no. 7205737, 2020. https://doi.org/10.1155/2020/7205737doi:[[[10.1155/2020/737]]]
• 14 E. H. Karam, R. S. Habeeb, and N. Mjeed, "A new approach in design of sliding mode controller by optimization state feedback for two wheeled self balancing robot," International Journal of Open Information Technologies, vol. 8, no. 10, pp. 39-44, 2020.custom:[[[-]]]
• 15 M. R. Fikri, D. W. Djamari, and S. M. Levy, "H-infinity controller design of two-wheeled mobile robot under disturbance," in Proceedings of 2021 International Seminar on Intelligent Technology and Its Applications (ISITIA), Surabaya, Indonesia, 2021, pp. 147-152.doi:[[[10.1109/ISITIA52817.2021.9502218]]]
• 16 Y. Wang, Robot Intelligent Control Engineering. Beijing, China: Science Press, 2004.custom:[[[-]]]