Documentos de Académico
Documentos de Profesional
Documentos de Cultura
Full Paper
Teknologi
CRUDE OIL PRICE FORECASTING BY Article history
Received
CEEMDAN BASED HYBRID MODEL OF 12 April 2017
Received in revised form
ARIMA AND KALMAN FILTER 23 January 2018
Accepted
Muhammad Aamira*, Ani Shabria, Muhammad Ishaqb 15 February 2018
Published online
aDepartment of Mathematical Sciences, Faculty of Science, 3 June 2018
Universiti Teknologi Malaysia
bDepartment of Mathematics, School of Natural Sciences, *Corresponding author
National University of Sciences and Technology, Islamabad, amuhammad29@live.utm.my
Pakistan
Abstrak
Kajian ini merupakan kombinasi penuh penguraian mod emperik dengan
penyesuaian bunyi atau ensemble empirical mode decomposition with adaptive
noise (CEEMDAN) berdasarkan kepada kombinasi model ramalan harga minyak
mentah dunia. Untuk itu, siri harga minyak mentah asal di bahagikan kepada
subkecil terhingga yang diberi nama intrinsic mode functions (IMFs). Kemudian,
model ARIMA digunakan oleh setiap IMF untuk penganggaran parameter. Dengan
nilai penganggaran parameter model ARIMA, Kalman Filter digunakan oleh setiap
IMFs bagi mendapatkan jangkaan bacaan yang tepat. Untuk pengujan dan
pengesahan, dua harga minyak mentah digunakan. Contohnya dengan peranan
siri harga minyak mentah dari Brent dan WTI (West Texas Intermediate). Nilai D-
statistic yag dicadangkan pada model Brent adalah 93.33% dan untuk WTI adalah
89.29% dimana ianya menunjukkan kepentigan CEEMDAN berasaskan model hibrid.
𝑃𝑅𝐸(𝑀𝐴𝐸) =
𝑀𝐴𝐸(𝐴𝑅𝐼𝑀𝐴 𝑚𝑜𝑑𝑒𝑙)
× 100 of 350 observations and the rest 57 observations
𝑀𝐴𝐸((𝜗)𝑚𝑜𝑑𝑒𝑙)
starting from March, 2011 is used as a testing data set.
For WTI crude oil price, the data covers the period
𝑅𝑀𝑆𝐸(𝐴𝑅𝐼𝑀𝐴 𝑚𝑜𝑑𝑒𝑙) from January, 1982 to November, 2015 with a total of
𝑃𝑅𝐸(𝑅𝑀𝑆𝐸) = × 100 407 observations. Similarly, for WTI series the data from
𝑅𝑀𝑆𝐸((𝜗)𝑚𝑜𝑑𝑒𝑙)
January, 1982 to Feb, 2011 is used as a training data
𝑀𝐴𝑃𝐸(𝐴𝑅𝐼𝑀𝐴 𝑚𝑜𝑑𝑒𝑙)
𝑃𝑅𝐸(𝑀𝐴𝑃𝐸) = × 100 set while the last 57 observations are used as a testing
𝑀𝐴𝑃𝐸((𝜗)𝑚𝑜𝑑𝑒𝑙) data set starting from March, 2011. Both the testing
periods is used to evaluate the performance of the
prediction, based on some evaluation criteria’s. In this
where 𝜗 = ARIMA Kalman Filter, EMD ARIMA, EMD study, only the one step ahead forecasting is carried
ARIMA Kalman Filter, CEEMDAN ARIMA and CEEMDAN out.
The ARIMA-Kalman Filter model used the estimated as discussed in the case of EMD, here the only
values of the ARIMA model presented in Table 1 as difference is that IMFs and residue are extracted by
starting values of the Kalman Filter recursion. The EMD- the CEEMDAN technique. While in the case of
ARIMA model uses the ARIMA model to predict each CEEMDAN-ARIMA-Kalman Filter model, the only
IMF and residue extracted by EMD and after that difference is the extraction of IMFs and residual as
simply combine all predicted IMFs and residue to get compared to EMD-ARIMA-Kalman Filter model here
the result. In EMD-ARIMA-Kalman Filter model first the IMFs are extracted by the CEEMDAN technique.
ARIMA model apply to the extracted IMFs and residue Thus, ARIMA, EMD-ARIMA ensemble, EMD-ARIMA-
and obtain the parameters estimate for each ARIMA Kalman Filter ensemble, CEEMDAN-ARIMA ensemble
model after that using the estimated value of ARIMA and CEEMDAN-ARIMA-Kalman Filter ensemble models
as starting values for the Kalman recursion and predict are all used for comparison to predict the crude oil
all IMFs including residual at the end simply combine prices for the two benchmark Brent and WTI
the predicted values of all IMFs to get the result. In respectively.
In this study, the complete intrinsic mode functions applying ARIMA model. First, the EMD technique is
are not used which is generated by EMD/CEEMDAN used to decompose the two crude oil prices for the
technique. The purpose of the limited use of extracted extraction of several independents IMFs. For WTI series
IMFs is to get the better prediction. The limited use of the extracted total number of IMFs including residue
the IMFs was decided based on MAE and RMSE while was eight while for Brent series it was nine. Applying
74 Muhammad Aamir, Ani Shabri & Muhammad Ishaq / Jurnal Teknologi (Sciences & Engineering) 80:4 (2018) 67–79
ARIMA model to a different number of IMFs, the IMFs and residue. The graphical representation of IMFs
minimum MAE and RMSE was obtained by using five by EMD method are shown in Figure 2.
IMFs, where the last IMF called residue is the sum of last
Next, the CEEMDAN technique is used to ARIMA model to different numbers of IMFs extracted
decompose the two crude oil prices series into by CEEMDAN, the minimum MAE and RMSE was
different independent IMFs and the same number of obtained by using five IMFs, where the last IMF called
IMFs were extracted by this method as well. For Brent residue is the sum of last IMFs and residue. The
series, the adaptive noise strength was taken as 0.21 graphical representation of IMFs by CEEMDAN
while for WTI series it was equal to 0.19 with these two method are shown in Figure 3. It is to be noted that
values the optimum mean absolute error, root mean only the single ARIMA and ARIMA-Kalman Filter model
square error and mean absolute percentage error use the original series for prediction purposes. The rest
were obtained for selected number of IMFs. Applying of the ensemble models are used the extracted
75 Muhammad Aamir, Ani Shabri & Muhammad Ishaq / Jurnal Teknologi (Sciences & Engineering) 80:4 (2018) 67–79
decomposed IMFs for the prediction purposes for both MAE, RMSE and MAPE indicators, the CEEMDAN-
crude oil prices series. The ARIMA model estimated ARIMA-Kalman Filter model executes the best
parameters results are shown in Table 1 for Brent and performance for all oil data sets, followed by the
WTI crude oil price series respectively. For the CEEMDAN-ARIMA model, EMD-ARIMA model, EMD-
remaining four models, different orders of ARIMA-Kalman Filter, ARIMA-Kalman Filter and at the
autoregressive and moving averages terms were used end the individual ARIMA model. Remarkably, it is
by ARIMA models for different extracted IMFs in each interesting to note that the performance of the EMD-
case. After selection of best ARIMA model for each ARIMA model is better than the EMD-ARIMA-Kalman
extracted IMFs, the respective parameters were Filter model for both crude oil prices series. From the
estimated in all four models and rest of the analysis above results, it is also noted that the decomposition
performed correspondingly as discussed earlier in this results of the IMFs of two methods that are EMD and
section. At the end, all the IMFs and residuals results CEEMDAN are different. Especially in the case of the
are simply added for obtaining the prediction results. models CEEMDAN-ARIMA-Kalman Filter and EMD-
Next, the evaluation of the prediction results of the two ARIMA-Kalman Filter, the Kalman Filter produces good
benchmark crude oil prices series are shown in Table 2 forecasting results with less MAE, RMSE, and MAPE for
and 4 via MAE, RMSE, MAPE, and D statistic. So, it is the model CEEMDAN-ARIMA-Kalman Filter while for
clear from Table 3 that the CEEMDAN-ARIMA-Kalman the model EMD-ARIMA-Kalman Filter the situation was
Filter model has very promising results as compared to different. From this study, we noted that the Kalman
the other designated models listed in this study. Filter is not producing good results with EMD while with
Hence, the prediction performance of the model CEEMDAN it produces good results. The possible
CEEMDAN-ARIMA-Kalman Filter is much better than reasons may be that the CEEMDAN decomposition
other models by the output of the three-evaluation influences the forecasting performance.
criterion MAE, RMSE, and MAPE. Concentrating on the
Table 2 The MAE, RMSE and MAPE comparisons of the different selected methods
This paper also examines the percentage relative Brent crude oil series. For WTI series the results for the
efficiency (PRE) approach used to compare different model CEEMDAN-ARIMA-Kalman Filter are improved
models with the basic ARIMA model per the three hundred percent associated with the single
percentage improvement in each MAE, RMSE and ARIMA model per the forecasting power of the model.
MAPE criterion respectively. Table 3 presented the The rest of the models forecasting power percentages
PRE-of all models, it is clear from the Table 3 that the are shown in Table 3 for both Brent and WTI crude oil
forecasting power of the model CEEMDAN-ARIMA- prices associated with the single ARIMA model. Figure
Kalman Filter is improved almost two hundred percent 4 also shows the percentage relative efficiency of all
as compared to the single ARIMA model by the models with the three-evaluation criterion namely
evaluation criterion MAE, RMSE, and MAPE for the MAE, RMSE, and MAPE.
Table 3 The percentage relative efficiency comparisons of the different selected methods
The next evaluation criterion is Dstatistic which evaluates strategy. From the outcomes of this paper, we
the directional performance of the model. Since the observed that the decomposition of the series is a
MAE, RMSE, and MAPE do not ensure the high success better strategy regarding the prediction of the series
rate of crude oil price movement direction which is as compared to other single models.
more important for investors because the businessmen
plan their activity per the market trend. So, therefore, Table 4 The percentage Dstatistic comparisons of the
the Dstatistic assessment is obligatory. different selected methods
Table 4 presented the percentage values of
Dstatistic, from the Table 4 we observed that the model Models Brent WTI
CEEMDAN-ARIMA-Kalman Filter perform well with high
Dstatistic Rank Dstatistic Rank
percentage among other models for both crude oil
prices. Regarding the investor point of view, the Dstatistic CEEMDAN-ARIMA-
93.33 1 89.29 1
Kalman Filter
is much important evaluation criterion than the MAE,
RMSE, and MAPE for prediction purposes. According CEEMDAN-ARIMA 87.78 2 85.71 2
with Table 4 shows a significant difference among EMD-ARIMA-
78.89 4 78.57 4
different models although the ranks remain same Kalman Filter
between all evaluation criterion in Table 4, the Brent EMD-ARIMA 82.22 3 82.14 3
crude oil Dstatistic values for single ARIMA model is ARIMA-Kalman
75.00 5 65.45 5
44.44%, for the ARIMA-Kalman Filter model it is 75.00%, Filter
for the EMD-ARIMA model is 82.22%, for the EMD- ARIMA 44.44 6 50.00 6
ARIMA-Kalman Filter model the value is 78.89%, for the
CEEMDAN-ARIMA model the value is 87.78%, and for
the last model which is CEEMDAN-ARIMA-Kalman In this paper, the hybrid model of ARIMA-Kalman
Filter, the Dstatistic value is 93.33%. Except for the ARIMA Filter is used to predict the extracted IMFs from the
model, the rest of the models are having higher original series. So, the performance of the model
percentages for Dstatistic. For WTI series the pattern is the CEEMDAN-ARIMA-Kalman Filter is considerably better
same for Dstatistic with the lowest value for ARIMA than other models considered in this study, namely
model. The main reason is that the nonlinearity, high they are single ARIMA model, ARIMA-Kalman Filter
noise, and complex dynamics are the characteristics model, EMD-ARIMA model, EMD-ARIMA-Kalman Filter
of crude oil price series Yu andWang[38], while ARIMA model and CEEMDAN-ARIMA model for the
is a class of linear models. The study presented in this mentioned two crude oil prices with respect to
paper is the comparison of the three different forecasting power accuracy, as stately measured by
techniques the first one is the single ARIMA model, the MAE, RMSE, MAPE and the directional Dstatistic.
second is the hybrid model of ARIMA and Kalman Filter
while the third is the decomposition and ensemble
4.0 CONCLUSION descriptive measures that are MAE, RMSE, MAPE and
one directional measure that is Dstatistic criterion. On
This paper proposed a CEEMDAN-ARIMA-Kalman Filter every case the MAE, RMSE, and MAPE are the lowest
model for forecasting the world crude oil monthly while the Dstatistic is the highest, demonstrating that the
prices. In terms of experimental results, the suggested CEEMDAN-ARIMA-Kalman Filter technique is a
model performed well by evaluating the three promising approach needs to be used for prediction
of the world crude oil monthly prices in future.
(a) One step ahead forecast for testing data of Brent crude oil prices
(b) One step ahead forecast for testing data of WTI crude oil prices
Figure 5 One Step Ahead Forecast for Testing Data Sets of both Crude Oil Prices
[6] Abosedra, S. and Baghestani, H. 2004. On the Predictive [32] Rosenberg, B. 1973. Random Coefficients Models: The
Accuracy Of Crude Oil Futures Prices. Energy Policy. 32(12): Analysis of a Cross Section of Time Series by Stochastically
1389-1393. Convergent Parameter Regression. Annals of Economic
[7] Wang, S. Y., L. A. Yu and K. K. Lai. 2005. Crude Oil Price and Social Measurement. 2(4): 399-428.
Forecasting with TEI@ I Methodology. Journal of Systematic [33] Engle, R. F. 1979. Estimating Structural Models of
Science. Complexity. 18(2): 145-166. Seasonality. Seasonal Analysis of Economic Time Series.
[8] Huntington, H. G. 1994. Oil Price Forecasting in the 1980s: NBER. 281-308.
What Went Wrong? The Energy Journal. 1-22. [34] Harvey, A. C. and Phillips, G. D. 1979. Maximum Likelihood
[9] Abramson, B. and Finizza, A. 1995. Probabilistic Forecasts Estimation of Regression Models with Autoregressive-
from Probabilistic Models: A Case Study in the Oil Market. Moving Average Disturbances. Biometrika. 66(1): 49-58.
International Journal of Forecasting. 11(1): 63-72. [35] Ravichandran, S. and Prajneshu, J. 2001. State Space
[10] Barone-Adesi, G., Bourgoin, F., and Giannopoulos, K. 1998. Modeling Versus ARIMA Time-Series Modeling. Journal of
Don't Look Back. Risk. 11:100-104. Indian Society of Agricultural Statistics. 54(1): 43-51.
[11] Morana, C. 2001. A Semiparametric Approach to Short- [36] Nikolaisen Sävås, F. 2013. Forecast Comparison of Models
Term Oil Price Forecasting. Energy Economics. 23(3): 325- Based on SARIMA and the Kalman Filter for Inflation. Master
338. Thesis, Uppsala Universitet.
[12] Gülen, S. G. 1998. Efficiency in the Crude Oil Futures Market. [37] Aamir, M. and Shabri, A. 2016. Modelling and Forecasting
Journal of Energy Finance & Development. 3(1): 13-21. Monthly Crude Oil Price of Pakistan: A Comparative Study
[13] Mirmirani, S. and Li, H. C. 2004. A Comparison of VAR and of ARIMA, GARCH and ARIMA Kalman Model. Advances in
Neural Networks with Genetic Algorithm in Forecasting Industrial and Applied Mathematics: Proceedings of 23rd
Price of Oil. Advances in Econometrics. 19: 203-223. Malaysian National Symposium of Mathematical Sciences
[14] Lanza, A., Manera, M., and Giovannini, M. 2005. Modeling (SKSM23). AIP Publishing.
and Forecasting Cointegrated Relationships Among Heavy [38] Yu, L., Wang, S., and Lai, K. K. 2008. Forecasting Crude Oil
Oil and Product Prices. Energy Economics. 27(6): 831-848. Price with an EMD-based Neural Network Ensemble
[15] Ye, M., Zyren, J., and Shore, J. 2002. Forecasting Crude Oil Learning Paradigm. Energy Economics. 30(5): 2623-2635.
Spot Price using OECD Petroleum Inventory Levels. [39] Tang, B., Dong, S., and Song, T. 2012. Method for Eliminating
International Advances in Economic Research. 8(4): 324- Mode Mixing of Empirical Mode Decomposition Based on
333. the Revised Blind Source Separation. Signal Processing.
[16] Ye, M., Zyren, J., and Shore, J. 2005. A Monthly Crude Oil 92(1): 248-258.
Spot Price Forecasting Model Using Relative Inventories. [40] An, X., et al. 2012. Short-term Prediction of Wind Power
International Journal of Forecasting. 21(3): 491-501. Using EMD and Chaotic Theory. Communications in
[17] Ye, M., Zyren, J., and Shore, J. 2006. Forecasting Short-run Nonlinear Science and Numerical Simulation. 17(2): 1036-
Crude Oil Price Using High-and Low-inventory Variables. 1042.
Energy Policy. 34(17): 2736-2743. [41] Guo, Z., et al. 2012. Multi-step Forecasting for Wind Speed
[18] Dées, S., et al. 2008. Assessing the Factors Behind Oil Price Using a Modified EMD-based Artificial Neural Network
Changes. European Central Bank Working Paper. 4-36. Model. Renewable Energy. 37(1): 241-249.
[19] Liu, L. M. 1991. Dynamic Relationship Analysis of US Gasoline [42] Huang, N. E. and Wu, Z. 2008. A Review on Hilbert‐Huang
and Crude Oil Prices. Journal of Forecasting. 10(5): 521-547. Transform: Method and Its Applications to Geophysical
[20] Chinn, M. D., LeBlanc, M., and Coibion, O. 2005. The Studies. Reviews of Geophysics. 46(2).
Predictive Content of Energy Futures: An Update on [43] Schlotthauer, G., Torres, M. E., and Rufiner, H. L. 2009. A New
Petroleum, Natural Gas, Heating Oil and Gasoline. National Algorithm for Instantaneous F 0 Speech Extraction Based on
Bureau of Economic Research. Ensemble Empirical Mode Decomposition. Signal
[21] Agnolucci, P. 2009. Volatility in Crude Oil Futures: A Processing Conference, 2009 17th European. IEEE.
Comparison of the Predictive Ability of GARCH and Implied [44] Wu, Z. and Huang, N. E. 2009. Ensemble Empirical Mode
Volatility Models. Energy Economics. 31(2): 316-321. Decomposition: A Noise-assisted Data Analysis Method.
[22] Ahmad, M. 2012. Modelling and Forecasting Oman Crude Advances in Adaptive Data Analysis. 1(01): 1-41.
Oil Prices Using Box–Jenkins Techniques. International [45] Wu, Z., et al. 2011. On the Time-varying Trend in Global-
Journal of Trade and Global Markets. 5(1): 24-30. Mean Surface Temperature. Climate Dynamics. 37(3-4):
[23] Sadorsky, P. 2006. Modeling and Forecasting Petroleum 759-773.
Futures Volatility. Energy Economics. 28(4): 467-488. [46] Torres, M. E., et al. 2011. A Complete Ensemble Empirical
[24] Hou, A. and Suardi, S. 2012. A Nonparametric GARCH Mode Decomposition with Adaptive Noise. Acoustics,
Model of Crude Oil Price Return Volatility. Energy Speech and Signal Processing (ICASSP), 2011 IEEE
Economics. 34(2): 618-626. International Conference on. IEEE.
[25] Ahmed, R. A. and Shabri, A. B. 2013. Fitting GARCH Models [47] Antico, A., Schlotthauer, G., and Torres, M. 2014. Analysis of
to Crude Oil Spot Price Data. Life Science Journal. 10(4). Hydroclimatic Variability and Trends Using a Novel
[26] Aamir, M. and Shabri, A. B. 2015. Modelling and Forecasting Empirical Mode Decomposition: Application to the Paraná
Monthly Crude Oil Prices of Pakistan: A Comparative Study River Basin. Journal of Geophysical Research:
of ARIMA, GARCH and ARIMA-GARCH Models. Science Atmospheres. 119(3): 1218-1233.
International. 27(3): 2365-2371. [48] Montgomery, D. C., Jennings, C. L., and Kulahci, M. 2015.
[27] Diebold, F. X. and Mariano, R. S. 2012. Comparing Introduction to Time Series Analysis and Forecasting. John
Predictive Accuracy. Journal Of Business & Economic Wiley & Sons.
Statistics. 20(1): 134-144. [49] Box, G. E., Jenkins, G. M., and Reinsel, G. C. 2011. Time Series
[28] Hyndman, R., et al. 2008. Forecasting with Exponential Analysis: Forecasting and Control. Vol. 734. John Wiley &
Smoothing: The State Space Approach. Springer Science & Sons.
Business Media. [50] Brocklebank, J. and Dickey, D. A. 2003. SAS for Forecasting
[29] Harvey, A. C. 1984. A Unified View of Statistical Forecasting Time Series. Vol. 2. SAS institute.
Procedures. Journal of Forecasting. 3(3): 245-275. [51] Hamilton, J. D. 1994. Time Series Analysis. Vol. 2. Princeton
[30] Kalman, R. E. 1960. A New Approach to Linear Filtering and university Press Princeton.
Prediction Problems. Journal of Fluids Engineering. 82(1): 35- [52] Saini, N. and Mittal, A. K. 2014. Forecasting Volatility in
45. Indian Stock Market using State Space Models. Journal of
[31] Morrison, G. W. and Pike, D. H. 1977. Kalman Filtering Statistical and Econometric Methods. 3(1): 115-136.
Applied to Statistical Forecasting. Management Science. [53] Huang, N. E., et al. 1998. The Empirical Mode
23(7): 768-774. Decomposition and the Hilbert Spectrum for Nonlinear and
Non-Stationary Time Series Analysis. Proceedings of the
79 Muhammad Aamir, Ani Shabri & Muhammad Ishaq / Jurnal Teknologi (Sciences & Engineering) 80:4 (2018) 67–79
Royal Society of London A: Mathematical, Physical and for Foreign Exchange Rates. Computers & Operations
Engineering Sciences. The Royal Society. Research. 32(10): 2523-2541.
[54] Huang, N. E., Shen, Z., and Long, S. R. 1999. A New View of [57] Yu, L., Wang, S., and Lai, K. K. 2010. Foreign-Exchange-Rate
Nonlinear Water Waves: The Hilbert Spectrum 1. Annual Forecasting with Artificial Neural Networks. Vol. 107.
Review of Fluid Mechanics. 31(1): 417-457. Springer Science & Business Media.
[55] Colominas, M. A., et al. 2012. Noise-assisted EMD Methods [58] Petris, G. 2010. An R Package for Dynamic Linear Models.
in Action. Advances in Adaptive Data Analysis. 4(04): Journal of Statistical Software. 36(12): 1-16.
1250025. [59] Akaike, H. 1974. A New Look at the Statistical Model
[56] Yu, L., Wang, S., and Lai, K. K. 2005. A Novel Nonlinear Identification. Automatic Control, IEEE Transactions on.
Ensemble Forecasting Model Incorporating GLAR and ANN 19(6): 716-723.