Multisource Data-Driven Carbon Price Composite Forecasting Model: Based on Feature Selection and Multiscale Prediction Strategy
DOI:
https://doi.org/10.13052/spee1048-5236.4342Keywords:
Carbon price forecasting, second decomposition, feature selection, unstructured data, influencing factors, bidirectional long short-term memory network, extreme gradient boosting, particle swarm optimizationAbstract
Accurate prediction of carbon trading prices is crucial for guiding investors to make informed decisions and assisting governments in formulating scientific carbon trading policies. This paper introduces a multisource data-driven carbon price composite forecasting model, aimed at enhancing prediction accuracy through advanced data processing and analysis methods. The model initially employs a secondary decomposition strategy, including Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) and Variational Mode Decomposition (VMD) methods, to decompose the original data series into three sub-sequences of different frequencies. Subsequently, it utilizes the Partial Autocorrelation Function (PACF) and Random Forest algorithm for feature selection to determine the input variables for different frequency sequences and conducts in-depth analysis and selection of influencing factors, including unstructured data. Furthermore, the model employs a multiscale forecasting strategy, combining Particle Swarm Optimization (PSO) enhanced Bidirectional Long Short-Term Memory (BiLSTM) and Extreme Gradient Boosting (XGBoost) models, along with the Autoregressive Integrated Moving Average (ARIMA) method, to predict based on the characteristics of different frequency components. Finally, the forecasts are integrated using PSO-BiLSTM to form a comprehensive forecast. Notably, given the high correlation between the trend series and influencing factor variables, the model jointly predicts them. A case study based on the Guangdong carbon market in China demonstrates that the proposed composite forecasting model outperforms other benchmark models, with Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE) values of 0.4009, 0.2699, and 0.5183%, respectively. This forecasting model provides an effective tool for predicting and analyzing carbon price fluctuations, offering new insights for precise carbon market price predictions.
Downloads
References
Arouri M E H, Jawadi F, Nguyen D K. Nonlinearities in carbon spot-futures price relationships during Phase II of the EU ETS. Economic Modelling. 2012;29(3):884–892.
Benz E, Trück S. Modeling the price dynamics of CO2
emission allowances. Energy Economics. 2009;31(1):4–15.
Byun S J, Cho H. Forecasting carbon futures volatility using GARCH models with energy volatilities. Energy Economics. 2013;40(Nov.): 207–221.
Zhang Y, Liu Z, Xu Y. Carbon price volatility: The case of China. PLoS ONE. 2018;13(10):e0205317.
Vapnik V. The nature of statistical learning theory. Springer science & business media. 1999;
Cheng Y, Hu B. Forecasting regional carbon prices in China based on secondary decomposition and a hybrid kernel-based extreme learning machine. Energies. 2022;15(10):3562.
Abdi A, Taghipour S. Forecasting carbon price in the Western Climate Initiative market using Bayesian networks. Carbon Management. 2019;10(3):255–268.
Li H, Huang X, Zhou D, Cao A, Su M, Wang Y, Guo L. Forecasting carbon price in China: a multimodel comparison. International Journal of Environmental Research and Public Health. 2022;19(10):6217.
Zhang F, Wen N. Carbon price forecasting: a novel deep learning approach. Environmental Science and Pollution Research. 2022;29(36): 54782–54795.
Wu Y-X, Wu Q-B, Zhu J-Q. Improved EEMD-based crude oil price forecasting using LSTM networks. Physica A: Statistical Mechanics and its Applications. 2019;516:114–124.
Zhang K, Cao H, Thé J, Yu H. A hybrid model for multi-step coal price forecasting using decomposition technique and deep learning algorithms. Applied Energy. 2022;306:118011.
Deng C, Huang Y, Hasan N, Bao Y. Multi-step-ahead stock price index forecasting using long short-term memory model with multivariate empirical mode decomposition. Information Sciences. 2022;607: 297–321.
Hao H, Yu F, Li Q. Soil temperature prediction using convolutional neural network based on ensemble empirical mode decomposition. Ieee Access. 2020;9:4084–4096.
Zhang Z, Hong W-C. Electric load forecasting by complete ensemble empirical mode decomposition adaptive noise and support vector regression with quantum-based dragonfly algorithm. Nonlinear dynamics. 2019;98:1107–1136.
Huang Y, Yang L, Liu S, Wang G. Multi-step wind speed forecasting based on ensemble empirical mode decomposition, long short term memory network and error correction strategy. Energies. 2019;12(10):1822.
Zhu B, Wang P, Chevallier J, Wei Y. Carbon price analysis using empirical mode decomposition. Computational Economics. 2015;45:195–206.
Sun W, Li Z. An ensemble-driven long short-term memory model based on mode decomposition for carbon price forecasting of all eight carbon trading pilots in China. Energy Science & Engineering. 2020;8(11):4094–4115.
Lu H, Ma X, Huang K, Azimi M. Carbon trading volume and price forecasting in China using multiple machine learning models. Journal of Cleaner Production. 2020;249:119386.
Zhou F, Huang Z, Zhang C. Carbon price forecasting based on CEEMDAN and LSTM. Applied Energy. 2022;311:118601.
Sun W, Huang C. A carbon price prediction model based on secondary decomposition algorithm and optimized back propagation neural network. Journal of Cleaner Production. 2020;243:118671.
Zhou J, Wang Q. Forecasting carbon price with secondary decomposition algorithm and optimized extreme learning machine. Sustainability. 2021;13(15):8413.
Bokelmann B, Lessmann S. Spurious patterns in Google Trends data-An analysis of the effects on tourism demand forecasting in Germany. Tourism management. 2019;75:1–12.
Huang X, Zhang L, Ding Y. The Baidu Index: Uses in predicting tourism flows – A case study of the Forbidden City. Tourism management. 2017;58:301–306.
Almaraashi M. Investigating the impact of feature selection on the prediction of solar radiation in different locations in Saudi Arabia. Applied Soft Computing. 2018;66:250–263.
Dragomiretskiy K, Zosso D. Variational mode decomposition. IEEE transactions on signal processing. 2013;62(3):531–544.
Chen T, Guestrin C. XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 2016;785–794.