1. Introduction
Two-wheeled self-balanced robots (TWSBRs) are characterized by simple structure, small size, light weight and flexible movement, so it has been widely applied in different industry fields, such as duty patrol, industrial transportation, daily travel [1]. As a multi-purpose load-bearing tool, TWSBR needs to adapt to different terrains. Dynamic disturbing forces will be generated at contact points between wheels and different ground. These disturbances are caused by the change of the contact angle between the wheel and the ground. Currently, many control methods have been proposed for TWSBRs [2-8]. In these references, proportional-integral-derivative (PID), linear quadratic regulator (LQR), neural network and other control methods are used, but they are all based on the mathematical model which is on the plane. Because most strategies have unsatisfactory disturbance rejection and adaptive ability, it is difficult to apply a TWSBR to slope road surfaces, undulating road surfaces and other complex working environments. Therefore, how to enhance the control ability of the control strategy is the problem that needs to be solved.
In 2014, Hu and Yan [9] estimated the perturbation of the dynamic system through a system state observer and designed the controller combined with LQR approach, which improved the disturbance rejection effect to a certain extent. In 2016, Jiang et al. [10] proposed an adaptive differential active disturbance rejection controller (ADRC), which adjusted the parameters of ADRC through an adaptive differential evolution algorithm, however, only the compensation factor is adjusted without considering the adjustment of control rate parameters and observer parameters [10]. In 2017, Unluturk and Aydogdu [11] proposed an adaptive switch controller based on the neural network, which realized the balance control under the soil pavement, pebble pavement, and sand and gravel pavement, without considering the slope and undulating pavement. In the same year, Liang et al. [12] established the dynamic TWSBR model on the undulating pavement surface, and designed a LQR controller. The convergence speed of the controller needs to be improved [12]. In 2020, Tran et al. [13] proposed a simple controller based on a composite observer for trajectory tracking and balance control with unknown disturbance of input and uncertain functions of a TWSBR model. Then, the closed-loop control system’s stability is proved, and a simulation comparison is made at last. In the same year, Karam et al. [14] designed a robust controller and used state feedback to estimate the equivalent term of SMC control law to overcome the dependence of the model. The simulation results show that the property of SFSMC is better than that of classical SMC in terms of tracking error. In 2021, Fikri et al. [15] designed the [TeX:] $$H_{\infty}$$ controller. The research shows that the [TeX:] $$H_{\infty}$$ controller has satisfactory disturbance rejection performance. None of the above research methods considers how to enhance the anti-interference capability and adaptive ability of the TWSBR on undulating road surface. Therefore, this paper proposes a control strategy for the TWSBR on undulating road surface.
We present a fuzzy neural network active disturbance rejection controller (FNNADRC) strategy for a TWSBR under complex road conditions in this paper. In order to improve the disturbance rejection capability of the TWSBR, the disturbance is estimated and compensated by the extended state observer (ESO) in ADRC method. The parameters of nonlinear PD control rate are fixed, which leads to its weak adaptive ability. Combining neural network and fuzzy control makes the network have both self-learning ability and reasoning ability [16], so as to correct the control parameters of nonlinear PD control rate in real time according to the size of control error and error rate of change. In this way, the control effect is improved, so that it can move stably in complex road conditions.
The advantages of this study can be summarized as follows:
On the basis of traditional ADRC control, the idea of fuzzy neural network (FNN) is integrated. According to the state error, the ADRC parameters are adjusted in real time to overcome the serious interference of TWSBR control systems under complex road conditions. By using the self-learning advantage of FNN, the problem that parameters need to be adjusted through certain exploration experience in fuzzy control is solved.
This paper is organized as follows: Section 2 discusses the TWSBR model under rolling pavement. In Section 3, we propose the FNNADRC based ESO. In Section 4, the superiority of FNNADRC method is compared by simulation. Section 5 concludes this.
2. Dynamic Model
Fig. 1 shows the TWSBR on the undulating pavement. The special case of the undulating road is the slope road; therefore, the undulating road can be regarded as the slope road whose inclination angle changes continuously and slowly [12]. The schematic diagram of instantaneous movement of the TWSBR on undulating road surface is shown in Fig. 2.
TWSBR on the undulating road.
Forward motion schematic diagram.
In this case, the dynamic model can be expressed as follows:
where,
[TeX:] $$\theta$$ is pitch angle, [TeX:] $$\varphi$$ is yaw angle, x is displacement, [TeX:] $$T_L$$ is left wheel’s torque, [TeX:] $$T_R$$ is right wheel’s torque, [TeX:] $$\alpha$$ is the angle between the tangent line of the wheel and the road surface and the horizontal line. Table 1 shows the values of relevant parameters.
3. Controller Design
3.1 System Structure Design
The overall structure of FNN control system is shown in Fig. 3.
Fuzzy neural network control block diagram of two-wheeled self-balanced robot.
The TWSBR control system has three subsystems: upright control, turning control and displacement control. The three subsystems are connected through the “static coupling” matrix. According to the separation principle of ADRC design, ADRCs are proposed for the upright and displacement subsystem respectively. In addition, the transition process is implemented by a tracking differentiator (TD), the ESO and error feedback control law are designed. Since the disturbance model is not included in the turning subsystem, a simple PD control approach is presented for the system. Based on the traditional ADRC of upright subsystem and displacement subsystem, two FNNs with the same structure are introduced respectively. The error of angle and angle error rate of change, the error of displacement and displacement error rate of change are taken as the inputs of the two networks, and the outputs of the two networks are used to regulate the parameters of the nonlinear PD control rate, so as to enhance the disturbance rejection ability of the system.
3.2 MIMO System Decoupling
From the above expression, it can be concluded that the system is a multiple-input and multiple-out (MIMO) coupled system. In order to use decoupling control for a multivariable system, the system is expressed in the following form:
where,
According to expression (2), the amplification coefficient matrix of the control force is
Virtual control variables [TeX:] $$U_1, U_2, U_3$$ are introduced, and the input-output relationship of each channel is
In this way, the relationship between the virtual control signal of each channel and the controlled output is single-input and single-output (SISO). Each channel’s output and the intermediate control signal are perfectly decoupled, and the controller of each subsystem should be presented on the basis of the approach of the SISO system.
The real control signal can be expressed as follows:
Select the [TeX:] $$x_1$$ in sight of the equilibrium point, [TeX:] $$x_1=0, \alpha=0$$ can be selected to calculate the approximate value of B. Then calculate the inverse of B, since B is irreversible in this system, the pseudo-inverse matrix [TeX:] $$X_B$$ is calculated for B.
3.3 Design of ADRC
3.3.1 Design of ADRC for the upright subsystem
After the system is decoupled, the expression of the upright subsystem is
The expected angle value of the subsystem is 0, so it is not necessary to have the tracking differentiator. Only the nonlinear control rate and ESO need to be designed, and then the dynamic compensation linearization is carried out. The ESO of the upright subsystem is
where,
[TeX:] $$z_{11}$$ is the estimation of the system output, [TeX:] $$z_{12}$$ is the estimation of system state [TeX:] $$x_{1=} \quad Z_{13}$$ is the estimation of system state [TeX:] $$x_2.$$
The nonlinear PD control rate of the upright subsystem is
where [TeX:] $$\beta_1, \beta_2$$ are the parameters of nonlinear PD control rate.
Because it is necessary to design the FNN to correct [TeX:] $$\beta_1, \beta_2$$ in real time according to the inclination error signal and inclination velocity error signal, the correction parameters [TeX:] $$\Delta \beta_1, \Delta \beta_2$$ are introduced. The values are directly applied to the original control rate parameter to modify it. Therefore, the expression (8) is written as follows:
The compensation of the disturbance for the upright subsystem is
3.3.2 Design of PD for the turning subsystem
When the dynamic system is decoupled, the expression of the turning subsystem is
Since the turning subsystem is a plain integrator series system, the control system of upright and displacement has little negative effect on the turning subsystem, the simple PD approach is adopted for the turning subsystem.
where [TeX:] $$\varphi_r$$ is the desirable turning angle and [TeX:] $$\varphi$$ is the real turning angle.
3.3.3 Design of ADRC for displacement subsystem
Since the system is decoupled, the displacement subsystem can be expressed as
The subsystem of displacement and the subsystem of upright interact with each other. In order to avoid overshoot of the subsystem of displacement and the effect of the upright control, the transition process is arranged. The other designs are similar to the upright subsystem. The TD used to realize the transition process is
where, the solution [TeX:] $$v_1$$ of the equation can approach [TeX:] $$y_3$$ in any finite time, [TeX:] $$v_2$$ is the differential signal of [TeX:] $$v_1.$$
The ESO of the displacement subsystem is
where,
[TeX:] $$Z_{31}$$ is the estimation of the system output, [TeX:] $$Z_{32}$$ is the estimation of system state [TeX:] $$x_{5,} \quad Z_{33}$$ is the estimation of system state [TeX:] $$x_6.$$
The nonlinear PD control rate of the displacement subsystem is
where,[TeX:] $$\beta_3 \text { and } \beta_4$$ are the parameters of nonlinear PD control rate, [TeX:] $$\Delta \beta_3 \text { and } \Delta \beta_4$$ are the correction parameters.
The disturbance compensation of displacement subsystem is
3.4 Design of Fuzzy Neural Network
3.4.1 Structure of fuzzy neural network
The structure of FNN is shown in Fig. 4. The entire network has two inputs and two outputs, there are five layers in total. For the upright and displacement subsystem, the network is applied to adjust the parameters of the control rate respectively.
Fuzzy neural network structure diagram.
The first layer is the input layer which has two nodes. The inputs of the layer are the error and the error rate of the change. Its function is to pass the input to the next layer node.
The second layer is the membership layer which has six nodes, and each input corresponding to the first layer has three fuzzy sets {"negative", "zero", "positive"}. Gaussian functions are adopted as membership functions, then
where [TeX:] $$i=1,2 ; j=1,2,3 ; A_i^j\left(x_i\right)$$ indicates that the i-th input corresponds to the j-th linguistic variable value.
The third layer is the inference layer, which has nine nodes, each node represents a fuzzy rule, and the applicability of each rule is calculated as
where, [TeX:] $$j_1=1,2,3 ; j_2=1,2,3 ; l=1,2, \ldots, 9.$$
The fourth layer is the normalized layer, which also has nine nodes, and the normalized calculation result is
where [TeX:] $$l=1,2, \ldots, 9.$$
The fifth layer is the output layer, which has two nodes for de-fuzzification calculation. The outputs of the layer are correction offsets of control parameters.
where, [TeX:] $$k=1,2 ; \omega_{k l}$$ is the weight of the network, [TeX:] $$y_k$$ is the output of the network, [TeX:] $$\Delta \beta_1=y_1, \Delta \beta_2=y_2.$$
3.4.2 Learning algorithm
The fuzzy segmentation number in this design has been determined, and the parameters to be trained include the connection weight [TeX:] $$\omega_{k l}$$ of the fifth layer, and the center value parameters [TeX:] $$a_{i j} \text { and } b_{i j}$$ of the second layer.
Let the desired output of the system be [TeX:] $$\hat{y}_k,$$ the training objective functions are
The iterative formulas for adjusting parameters are
where, [TeX:] $$k=1,2 ; l=1,2, \ldots, 9.$$
where, [TeX:] $$i=1,2,3 ; j=1,2,3.$$
where, [TeX:] $$\eta_1, \eta_2, \eta_3$$ are the learning rates.
4. Simulation Results
In order to examinate the control effect, ADRC, FADRC, and FNNADRC are used to control the TWSBR under different road surfaces, and the simulation results are analysed and compared. The parameters of ADRC are determined by the trial and error method, and then corrected by FNN when the system is running. The parameters of FNN are adjusted by error back-propagation. According to the stress analysis, the maximum [TeX:] $$x_1$$ of TWSBR is [TeX:] $$\arctan \mu, \text { where } \mu$$ is ground friction coefficient [7], therefore, the initial value of inclination should be less than this value. There is no restriction on the initial values of steering angle and displacement. Based on the above instructions, set the initial states [TeX:] $$x_1=\pi / 20, x_3=\pi / 4, x_5=0,$$ desired states [TeX:] $$x_1=0, x_3=0, x_5=0.5.$$
4.1 Flat Road
In case of flat road surface [TeX:] $$\alpha=0,$$ ADRC, FADRC and FNNADRC are used for control simulation respectively. The upright inclination, turning angle and displacement control effects are shown in Figs. 5–7. Fig. 5 shows that the upright curves of ADRC and FADRC oscillate greatly. The error between the real dip angle and the expected value is large, while the error between the dip curve of FNNADRC and the target value is the small. There is no disturbance in the turning subsystem of TWSBR dynamic model, so the little error between the real and the expected value of turning angle in Fig. 6 is showed. Therefore, the error between the real and the expected value of the turning angle under any road condition is relatively small, and the control result is similar to Fig. 6, which will not be described later. Fig. 7 shows that the error of FNNADRC between the displacement curve and the target value is small, and it is better than that of ADRC and FADRC. In conclusion, the control effect of FNNADRC is more satisfactory than the other two methods on flat road surface.
Upright curve on the flat road.
Turning curve on the flat road.
Displacement curve on the flat road.
4.2 Slope Road
In case of slope road surface [TeX:] $$\alpha=0.174 \mathrm{rad},$$ ADRC, FADRC and FNNADRC are used for control simulation respectively. The control effects of upright inclination and displacement are displayed in Figs. 8 and 9. Fig. 8 exhibits that the upright curve oscillation amplitude of ADRC and FADRC is large, while the error between the inclination curve of FNNADRC and the target value is small. Fig. 8 shows that the dip and displacement curves of FNNADRC are smoother and have less errors. To sum up, FNNADRC has the best comprehensive control effect under the slope.
Upright curve on the slope road.
Displacement curve on the slope road.
4.3 Undulating Road
Under the condition of undulating road surface [TeX:] $$\alpha=\sin x_1,$$ ADRC, FADRC and FNNADRC are used for control simulation respectively. The upright inclination and displacement control effects are exhibited in Figs. 10 and 11. Fig. 10 shows that the upright curve oscillation amplitude of ADRC and FADRC is large, while the error between the inclination curve of FNNADRC and the target value is more acceptable. Fig. 11 shows that the displacement curves of ADRC and FADRC have large oscillation amplitude, while the error between the displacement curve of FNNADRC and the target value is more in line with the requirements. In summary, FNNADRC has the best comprehensive control effect under undulating road surface.
Upright curve on the undulating road.
Displacement curve on the undulating road.
5. Conclusion
In this research, the disturbance rejection problem of the TWSBR in complex environments is studied. According to the dynamics model of the TWSBR system on undulating pavement, the FNNADRC is designed. The nonlinear system is decoupled into three subsystems. Then, considering the characteristics of the subsystems, an ADRC is designed to ensure the stability of the subsystems. And the controller parameters are corrected by the self-learning function of the FNN. Under flat road surface, slope road surface and undulating road surface, the control methods of ADRC, FADRC and FNNADRC are simulated and compared respectively to examinate the disturbance rejection capability of FNNADRC. The results of the simulation display that FNNADRC has improved anti-interference capability compared with the other two methods.