Min-jie Ding* , Shao-zhong Zhang* , Zhong Hai-dong**,*** , Yao-hui Wu* and Liang-bin Zhang*A Prediction Model of the Sum of Container Based on Combined BP Neural Network and SVMAbstract: The prediction of the sum of container is very important in the field of container transport. Many influencing factors can affect the prediction results. These factors are usually composed of many variables, whose composition is often very complex. In this paper, we use gray relational analysis to set up a proper forecast index system for the prediction of the sum of containers in foreign trade. To address the issue of the low accuracy of the traditional prediction models and the problem of the difficulty of fully considering all the factors and other issues, this paper puts forward a prediction model which is combined with a back-propagation (BP) neural networks and the support vector machine (SVM). First, it gives the prediction with the data normalized by the BP neural network and generates a preliminary forecast data. Second, it employs SVM for the residual correction calculation for the results based on the preliminary data. The results of practical examples show that the overall relative error of the combined prediction model is no more than 1.5%, which is less than the relative error of the single prediction models. It is hoped that the research can provide a useful reference for the prediction of the sum of container and related studies. Keywords: BP Neural Network , Grey Relational Analysis , Sum of Container Prediction , Support Vector Machine 1. IntroductionPredicting the volume of container traffic is the basic premise for the container transportation area, the container system and container port planning and layout. Predicting the volume of container traffic also plays a very important role in determining the port’s direction of development, its scale of infrastructure investment, berth locations, the business strategy, etc. At present, studies on the numbers of containers are mainly based on the factors influencing the container volume and the prediction model of container volume. The study of the influencing factors can be mainly divided into economic and non-economic factors [1]. Macroeconomic indicators are usually treated as economic factors, while the related indicators of the port’s construction are generally treated as the non-economic factors. Nevertheless, these indexes are usually abstract, and they cannot directly reflect the characteristics of foreign trade containers and the reasons for changing trends. The use of unrepresentative indexes for prediction can reduce its precision, making the results unreliable. The volume of foreign trade containers is subject to many influencing factors, and is notably affected by related import and export elements. For the importing country, it can be limited by its own national politics, economy, culture, and the distance of the exporter; the exports of the exporter will be influenced by the respective location, the political environment, the development level of the economy, and the reserves of resources and other factors [2]. The present study of prediction models mainly adopts time series analysis and causality prediction methods such as moving averages, exponential smoothing, trend extrapolation, regression analysis and so on [3-6]. However, the prediction precision of these approaches is somewhat limited. The volume of foreign trade containers has many influencing factors, and it is difficult to select indexes from the great quantities of data because it is usually unknown whether one index is influenced by the forecasting object [7,8]. In the data mining field, there are many models that can be used to find the relationship between massive factors. Among these models, gray relational analysis is one of the most well-known approaches. It was initially applied to determine the suitable selection of machining parameters for the wire electrical discharge machining (Wire-EDM) process [9]. Later, many scholars used this method to select the influencing factors that are most closely related to the forecasting object. It has been proved that the gray relational analysis method can make the prediction index system more scientific and accurate [10]. The back-propagation (BP) neural network is one of the most widely applied neural network models that can be used to learn and store a great deal of mapping relations of an inputoutput model. It is now widely applied in many areas such as pattern recognition, classification, data compaction, data prediction, etc. [11,12]. However, the learning speed of the BP neural network is usually very slow because it requires small learning rates for stable learning. Support vector machine (SVM) is another popular algorithm for supervised machine learning and classification. It has many unique advantages in solving small sample, nonlinear and high dimensional pattern recognition problems [13]. In addition, a proper kernel function is one of the most important factors in applying SVM, and error of the input data needs to be well controlled. It can be found from the current studies that both the combined BP neural network and the SVM have their advantages and disadvantages [14,15]. Due to the complexity of the influencing factors in container volume prediction, joint forecasting models can take the advantages of each single one, and improve the accuracy significantly. In this paper, we propose a gray relational analysis approach to select indexes of container volume and combine the BP neural network and SVM [16] methods to forecast the foreign trade container volume. The remainder of the article is organized as follows: backgrounds and container volume related indicators, technologies, and measures are reviewed in Section 2. Then detailed concepts of the proposed prediction models and algorithms are explained in Section 3. Experiments on publicly available ports-ofentry yearbook data and analysis of the results are conducted in Section 4. Finally, we conclude the paper in Section 5. 2. Related WorksAt present, the topic of container volume prediction is being intensively investigated by scholars all over the world. Their studies mainly focus on two aspects. On the one hand, they aim to discuss the prediction indexes system, and how to select the main influencing factors from numerous other indicators. On the other hand, they select, analyze, establish and optimize the prediction models, and evaluate the predicted results accordingly. 2.1 Container Volume Impact FactorsResearch related to foreign trade container volume impact factors is a hot topic. In the existing studies, the related indictors fall into three categories [17-19]: world economy environment, regional economy scale, and technological advances. (1) The environment of the world economy factors: Foreign trade can be easily influenced by the environment of the world economy, and especially changes in the international exchange rate. For example, the exchange rate in bilateral trade plays a significant role in promoting or inhibiting imports, which affects exports. If the exchange rate changes, the volumes of imports and exports from importing and exporting countries will be impacted almost immediately. In the last 3 years, the growth rate of China’s foreign trade container volumes has decreased year by year. One important reason is that, along with changes in the industrial structure and increased labor costs and raw materials prices, many developed countries’ processing factories have turned to Southeast Asia with its cheaper labor. (2) The regional economy scale factors: The scale of the regional economy affects the foreign trade container volumes directly. For the importing country, the scale of its regional economy influences its purchasing power and the relative price of imported goods. For the exporter, the scale of its regional economy influences export volumes directly. These influencing factors can be reflected in the GDP, the total imports and exports of goods. With the continuous improvement of foreign trade policies in China, the total imports and exports of goods increased from $0.28 billion to $4.30 billion between 1995 and 2014 (https://tradingeconomics.com/china/indicators). Along with more countries signing free trade agreements, the reduction of tariffs, and the popularity of cross-border e-commerce, the quantity of bulk cargo will increase significantly. (3) The technological advance factors: With the help of progress in container operations and process related science and technology, diverse types of special containers are coming into being to meet the transportation requirement of ever-increasing special goods. Meanwhile, the utilization rate of containers is improving, and both the box change rate of common containers and dead weight of the average heavy box are increasing. However, the appearance of high-value products usually leads to fewer container needs. 2.2 Container Volume Prediction MethodsIn general, there are two kind of container volumes prediction models, qualitative and quantitative [20,21]. (1) Qualitative prediction models: Qualitative prediction models are widely used to analyze the factors influencing container volumes. Also, they are popularly applied to predict the total number of containers. Chou C-C, et al. estimated the container imports of Taiwan by using an improved regression prediction model. They verified that the model had greater predictive accuracy [4]; Hwang, et al. [5] proposed a fuzzy-neural network model GMDH to predict the container volumes for the port of Busan in South Korea; Freitas and Rodrigues [14] studied the feasibility of combining neural network models with the Gaussian radial basis function network approach, and then put forward a linear joint estimation model that was expanded from many commonly used methods; Although, much attention has been paid to this research area, many scholars have proved that the accuracy of the qualitative prediction models is rather low and the results are inclined to be influenced by many subjective elements [22,23]. (2) Quantitative prediction models: Quantitative prediction models are usually based on the neural network, time series analysis method, historical data extrapolation, system dynamics, and so on. These kinds of models can predict effectively, but they also have insufficient fixed predictive indexes and are low in precision. Makridakis and Winkler [1] found that the accuracy of the combined prediction models is higher than the single models in research using time series: when two kinds of prediction models are combined, the error is reduced by 7.2%; when five kinds of prediction models are jointly used, the error can be reduced by 16.3%. Similarly, Huang et al. [24] proposed a partially combined prediction framework for container throughput based on big data composed of structured historical data and unstructured data, and predicted the container traffic through Qingdao Port using the combined prediction approach. Wu [25] presented a load forecasting model based on hybrid particle swarm optimization with Gaussian and adaptive mutation (HAGPSO) and a wavelet v-support vector machine (Wv-SVM). At present, a great many studies focus on container volume prediction models, while less attention is paid to the factors that influence the prediction’s object and the predicted result accuracy. Therefore, we propose a joint approach to predict the sum of containers. First, the paper analyzes the factors influencing foreign trade container volumes through gray relational analysis, which can simplify the complexity of the prediction, and reduce the problem of prediction accuracy caused by small amounts of data. Then, the paper puts forward a combined prediction model based on a BP neural network and SVM, and aims at solving the shortcomings of the fixed index and the lower predictive accuracy of the previous prediction models. 3. Prediction Model and Algorithm3.1 The Combined Prediction Flow ChartThe complete process of the proposed combined prediction algorithm is shown in Fig. 1. First, the algorithm obtains the relevant preliminary data according to the establishment of the predictive index system of foreign trade container volumes, and normalizes the preliminary data. Then, it predicts and analyzes the data from preprocessing by the BP neural network model, and acquires the preliminary prediction data. It uses the difference between the preliminary data and the BP network prediction data as a new sequence, i.e., the residual sequence. The algorithm analyzes the residual sequence by SVM, and obtains the revised prediction residual values. Finally, it obtains the final prediction results by adding the prediction data from the BP neural network and the residual values predicted by the SVM model. We use a combined prediction model that corrects residual error. 3.2 The Algorithm DesignStudies show that the BP neural network model is effective with training data values between 0 and 1. To improve the learning speed of the model, we normalize the input data according to formula (3-1).
where [TeX:] $$x_{i}$$ is the preliminary data, [TeX:] $$x_{i}^{\prime}$$ is the data after normalized processing, [TeX:] $$x_{\min } \text { and } x_{\max }$$ are the minimum and maximum values of each variable. The data [TeX:] $$\chi_{i}^{\prime}$$ is used as input data in the BP neural network model to generate the preliminary predicted data [TeX:] $$\widehat{y}_{l}$$. In this process, we set the initial weights and threshold values at a small random array, the training times at 50 and the error function as [TeX:] $$\varepsilon$$. For the input matrix [TeX:] $$x_{k i}^{\prime}(k=1,2, \ldots, R ; i=1,2, \dots, M), R \text { and } M$$ are the numbers of rows and columns of the input matrix, respectively. The prediction output [TeX:] $$y_{k}$$ can be obtained through the neural network according to the following nonlinear activation function.
where [TeX:] $$n e t_{k}$$ is the input value of the node [TeX:] $$k$$ in the output layer. [TeX:] $$n e t_{k}=\sum_{k=1}^{L} \omega_{j k} x_{j k}^{\prime}+\theta_{k}, L$$ is the total number of nodes in the output layer, [TeX:] $$\omega_{j k}$$ is the weight of the output layer, and [TeX:] $$\theta_{k}$$ is the threshold value of node k. In the BP neural network data training process, it is necessary to amend the error value of the output layer and the hidden layer constantly until the prediction results satisfy the accuracy requirements, i.e. for the error function to achieve [TeX:] $$E_{p} \leq \varepsilon$$.
where [TeX:] $$y_{p k}$$ is the expected output value of the node [TeX:] $$k$$ in the output layers and [TeX:] $$a_{p k}$$ is the output value of node [TeX:] $$k$$ from the output layers. The weight values [TeX:] $$\left(\omega_{i j} \text { and } \omega_{j k}\right)$$ of the hidden layer and the output layer can be amended by formulas (3-4) and (3-5).
(3-4)[TeX:] $$\omega_{i j}(k+1)=\omega_{i j}(k)+\eta_{j} \delta_{j} a_{j}+a_{j}\left(\omega_{i j}(k)-\omega_{i j}(k-1)\right)$$
(3-5)[TeX:] $$\omega_{j k}(k+1)=\omega_{j k}(k)+\eta_{k} \delta_{k} a_{k}+a_{k}\left(\omega_{j k}(k)-\omega_{j k}(k-1)\right)$$where [TeX:] $$\delta_{j}=a_{j}\left(1-a_{j}\right) \sum_{k=1}^{L} \delta_{k} \omega_{j k}$$ is the error value from the hidden layer, [TeX:] $$\delta_{k}\left(\delta_{k}=y_{k}\left(1-y_{k}\right)\left(y_{p k} - y_{k}),k=1,2,...,L)\right.\right.$$ is the error value from the output layer, [TeX:] $$\eta_{j} \text { and } \eta_{k}$$ are the training speed thresholds, [TeX:] $$a_{j} \text { and } a_{k}$$ are values from the hidden layer and output layer, respectively. We combine the actual data with the BP neural network predicted value to generate a new residual sequence, represented as [TeX:] $$\zeta_{k}\left(\zeta_{k}=y_{k}-\widehat{y_{k}}\right)$$. According to the SVM prediction model in formula (3-6), the amended residual sequence [TeX:] $$\zeta_{k}^{\prime}$$ can be calculated.
where [TeX:] $$\omega$$ is a weight vector and [TeX:] $$b$$ is the depth offset. The research adopts the [TeX:] $$\omega$$-non-sensitive loss function, and establishes the model by cross-verifying the training sets. The function can be represented as
(3-7)[TeX:] $$L_{\omega}=\left\{\begin{array}{ll}{\left|f\left(\zeta_{k}\right)-\zeta_{k}^{\prime}\right|-\varepsilon} \ {\left|f\left(\zeta_{k}\right)-\zeta_{k}^{\prime}\right| \geq \varepsilon} \\ {0} \ {\left|f\left(\zeta_{k}\right)-\zeta_{k}^{\prime}\right|<\varepsilon}\end{array}\right.$$To obtain a better generalization ability in the training sets of the SVM prediction model, minimize the empirical prediction inaccuracy risk and the reduce the computational complexity, we introduce the Lagrange multiplier into the SVM:
(3-8)[TeX:] $$f\left(\zeta_{k}\right)=\sum_{k=1}^{n} a_{k}^{*} k\left(\zeta_{k}, \zeta_{k}^{\prime}\right)+b$$where [TeX:] $$k\left(\zeta_{k}, \zeta_{k}^{\prime}\right)$$ is kernel function of SVM and [TeX:] $$a_{k}^{*}$$ is the Lagrange multiplier. We follow the existing research and use the radial basis kernel function [13,16] (as presented in formula (3-9)) to establish the SVM prediction model.
(3-9)[TeX:] $$k\left(\zeta_{k}, \zeta_{k}^{\prime}\right)=\exp \left(-\frac{\left\|\zeta_{k}-\zeta_{k}^{\prime}\right\|^{2}}{\sigma^{2}}\right)$$Finally, we obtain the final prediction result [TeX:] $$\widehat{y}=\widehat{y_{k}}+\zeta_{k}^{\prime}$$, where [TeX:] $$\widehat{y_{k}}$$ is calculated by the BP neural network model and [TeX:] $$\zeta_{k}^{\prime}$$ is the amended residual value estimated by SVM. 4. Experiments and Analysis4.1 Indexes for Foreign Trade Container Volume PredictionBased on the analysis of factors influencing the volume of foreign trade containers, we select the regional GDP, total volume of import and export trade, total investment in fixed assets, container throughput, container loading, the rate of container cargo loading and the rate of foreign trade goods suitable for containers to check the effectiveness of the proposed approaches. Proper indexes selection is conducted with the help of the gray relational analysis method, which contain three processes: (1) The raw data is pre-treated by average transforming, according to the following formula:
(4-1)[TeX:] $$X_{i}^{\prime}=^{X_{i}} /_{X_{1}}=\left(X_{i}^{\prime}(1), X_{i}^{\prime}(2), \ldots, X_{i}^{\prime}(n)\right), i=1,2, \ldots, m$$(2) Figure out the absolute value [TeX:] $$\Delta_{0 i}\left(t_{j}\right)$$ between each sequence and its sub-sequence at each time point, and calculate the relationship coefficient between the maximum value and the minimum value of [TeX:] $$\Delta_{0 i}\left(t_{j}\right)$$ by formula (4-2).
(4-2)[TeX:] $$\Delta_{0 i}\left(t_{j}\right)=\frac{\Delta_{\min }-\Delta \max }{\Delta_{0 i}\left(t_{j}\right)+\Delta_{\max }}$$where [TeX:] $$\Delta_{\max }$$ is the maximum of [TeX:] $$\left|X_{i}-X_{0}\right|, \Delta_{\min }$$ is the minimum of [TeX:] $$\left|X_{i}-X_{0}\right|, \text { and } \Delta_{0 i}\left(t_{j}\right)$$ is the value of [TeX:] $$\left|X_{i}-X_{0}\right|$$ at the time [TeX:] $$t_{j}$$. (3) Calculate the average of all the relationship coefficients to conduct gray correlation analysis, and choose the indexes correspondingly according to the gray correlation values. With the help of MATLAB, we calculate the gray relational degree of the pre-selected indexes and find their gray relational degree in descending order as follows: container throughput, total volume of import and export trade, regional GDP, container loading, total investment in fixed assets, the rate of foreign trade goods suitable for containers and the rate of container cargo loading. Therefore, we select the top five indexes to predict the volume of foreign trade containers. According to the result of the analysis of the gray relational degree, the foreign trade container volume (ten thousand TEU) is set as Y, the regional GDP (¥100 million) is set as X1, the total volume of import and export trade ($ billion) is set as X2, the total investment in fixed assets (¥100 million) is set as X3, the container loading (ten thousand ton) is set as X4 and the container throughput (ten thousand TEU) is set as X5. Relevant data for Ningbo and Wenzhou in 2002–2014 are selected for simulation and verification. The data for 2002–2009 serve as the training data set and are used for optimization and establishment of the model’s parameters; the related data from 2010 to 2014 are used to examine the accuracy and adjust the values of parameters in the proposed prediction models; finally, the model with the optimized parameter values is used to predict the volume of foreign trade containers for the two cities during 2015–2019. 4.2 Primary Data Acquisition and PreprocessingTo evaluate the effectiveness of our methods, we select and establish the foreign trade container volume related indexes from publicly available statistical yearbook (as shown in Table 1). 4.3 Prediction with Combined ModelsIn MATLAB, we establish a BP neural network model with 20 nerve cells in the hidden layer, 5 nerve cells in the input layer and 1 nerve cell in the output layer. The normalized data indexes X1, X2, X3, X4, X5 (see Table 2) are used as the input training dataset to predict the Y values and get the residual sequence accordingly. The residual sequence is amended by using the SVM toolbox of MATLAB, with a radial basis kernel function. The final prediction results are obtained by adding the prediction data of the BP neural network and the residual sequence after correcting. Table 1.
Table 2.
The research conducts a comparison of predicted foreign trade container volume values from the BP neural network, SVM and the combined approach for Ningbo and Wenzhou in 2010–2014, respectively (as shown in Table 3). According to the data in Table 3, the volume of foreign trade containers in Ningbo increases steadily during 2010–2014. By contrast with Ningbo, the volume of foreign trade containers in Wenzhou increases suddenly between 2010 and 2011, but in the following 2 years it comes down steadily. Also, it can be found in Figs. 2 and 3 that the predicted values from the BP neural network model, SVM and the combined model generally reflect these tendencies. However, the foreign trade container volume values from the prediction results of the SVM and the combined prediction model are closer to the actual values than those of the BP neural network method. One reason may be that the BP neural network model obtains the results by repeated training, but the precision and frequency of the training is set by human agents. If the value is not properly set, the error may be very large. The combined prediction model is based on the prediction of the BP neural network and the residual sequence amended by SVM, which improve the precision of the prediction effectively, and can obtain more accurate results. Table 3.
Figs. 4 and 5 plot the relative errors of the predicted results for Ningbo and Wenzhou from the three prediction models. They imply that the relative error of the combined prediction model is smaller than the other two single prediction models, and the relative error of the BP’s predicted value is higher than the actual value. As can be found from the data in Table 3, the relative errors of SVM and the combined prediction model are less than 3.0% and 1.6%, respectively. All these results indicate the superiority of the joint prediction method. 4.5 Foreign Trade Container Volume Prediction in 2015–2019The real data experiment in the previous section proves that the degree of fitting of the prediction results from the joint prediction model that combines the BP neural network with SVM is higher than the other two methods predicting singly. Therefore, we employ the combined model to predict the foreign trade container volumes for Ningbo and Wenzhou in 2015–2019. The predicted results are shown in Table 4. The overall trend of the foreign trade container volume and the yearly growth ratio for the two cities are plotted in Figs. 6 and 7. Based on the prediction, we can draw a general conclusion that the foreign trade container volumes in both Ningbo and Wenzhou will continue to increase in the next 5 years. Additionally, the yearly growth rate of foreign trade container volume for Ningbo will be faster than for Wenzhou in 2015–2019. 5. ConclusionIn this paper, we study the influencing factors for the foreign trade container volume and establish a joint prediction method that combines a BP neural network model and SVM. Based on gray relational analysis, the top five influencing factors are selected to predict the foreign trade container volume in Ningbo and Wenzhou. The proposed joint prediction model uses SVM to correct the residual sequence after initial prediction by the BP neural network model and real statistical yearbook data for the two cities in 2010–2014 are selected to compare the efficiency of the prediction methods. In our prediction experiment, the relative error of the proposed combined prediction method is much lower than the other single models. This result suggests an effective and potential application value in logistics-demand predicting. Based on the proposed prediction approach, we find that the foreign trade container volume in both Ningbo and Wenzhou will increase steadily in 2015–2019, and the yearly growth rate of Ningbo will be faster than Wenzhou. AcknowledgementThis work was supported by the Natural Science Foundation of Zhejiang (No. LY16G020012), the Major Research Projects of Humanities and Social Sciences in Colleges and Universities of Zhejiang (No. 2014GH015), the Open Research Funding Program of the Key Laboratory of Geographic Information Science (Ministry of Education), East China Normal University (No. KLGIS2014A0l), the Zhijiang Youth Action Project: study on mobile e-commerce recommendation (No. G306), the Zhejiang Association for Business Economics project (No. 2016SJYB01), the key project of Ningbo Education Science Planning (No. 2017YZD010), and the Modern Port Service Industry and Culture Research Center of the Key Research Base of Philosophy and Social Sciences of Zhejiang Province. BiographyBiographyShao-zhong Zhanghttps://orcid.org/0000-0002-4345-4171He is a professor in the Faculty of Information and Intelligence Engineering School in Zhejiang Wanli University. Also, he is research fellow of the Electronic Services Research Center of Zhejiang University and Ningbo Institute of Electronic Services. He received his B.S. in world economy from Nankai University in 1999. In 2004, he got his Ph.D. in computer application technology from Dalian University of Technology. He has published two books, more than 80 articles. At present, his research interests include data mining and knowledge discovery, sentiment analysis, big data mining, and e-commerce and business intelligence. BiographyHai-dong Zhonghttps://orcid.org/0000-0002-5549-412XHe is an associate professor in the Faculty of Logistics and E-commerce School in Zhejiang Wanli University. He received his B.S. in computer science and technology in 2004 and his Ph.D. in cartography and geography information system from East China Normal University in 2011. At present, his main research interest is in mobile e-commerce, personalized recommendation and business intelligence. Min-jie Ding, Shao-zhong Zhang, Hai-dong Zhong, Yao-hui Wu, and Liang-bin Zhang BiographyYao-hui Wuhttps://orcid.org/0000-0003-1347-0442He is an associate professor in the Faculty of Information and Intelligent Engineering School in Zhejiang Wanli University. He received his B.S. in communication engineering from Beijing University of Posts and Telecommunications, China in 2004, and received his Ph.D. in communication engineering from Ningbo University, China in 2018. At present, his main research interest is in wireless communication, wireless networking and information fusion. BiographyLiang-bin Zhanghttps://orcid.org/0000-0003-0783-7316He is an associate professor in the Faculty of Big Data and Software Engineering School in Zhejiang Wanli University. He received his B.S. in computer science and application from Hangzhou University of Electronic Science and Technology, China in 2006. At present, his main research interest is in cloud computing, edge computing, system integration, image coding, etc. References
|