A combined model for short-term wind speed forecasting based on empirical mode decomposition, feature selection, support vector regression and cross-validated lasso

View article
Loading...
PeerJ Computer Science

Main article text

 

Introduction

  1. A novel model based on EMD, FS, SVR and LassoCV is proposed to improve the accuracy of multi-step wind speed forecasting, where EMD is used to extract IMFs from the original wind speed data to reduce the non-stationarity of wind speed.

  2. Based on the principle of EMD, the first IMF component decomposed by EMD contains most of the high-frequency information, and an algorithm with good generalization performance is usually required for prediction. We combine FS and SVR to predict the high-frequency IMF (i.e., the first IMF) component.

  3. Compared with the first IMF component, the frequency of the other IMF components decomposed by EMD is much lower and presents a Sin-like curve. Linear regression usually gets better performance. We introduce LassoCV to complete the prediction of low-frequency IMFs and trend.

Methods

The whole process of the proposed model

  1. Use EMD to decompose wind speed into a series of IMFs. EMD algorithm is introduced in ‘Empirical model decomposition’

  2. Combine FS and SVR to predict the high-frequency IMF obtained by EMD. FS and SVR algorithms are provided in ‘Feature selection’ and ‘Support vector regression’, respectively.

  3. Use LassoCV to complete the prediction of the low-frequency IMF and trend. LassoCV algorithm is listed in ‘Cross-validated lasso’.

  4. Performance evaluation. The performance indicators are introduced in ‘Prediction performance criteria’, and the experimental results and analysis are given in ‘Results’ and ‘Discussion’.

Empirical model decomposition

Feature selection

Support vector regression

Cross-validated lasso

Prediction performance criteria

Results

Wind speed data

Experiments and result analysis

  1. In the 1-step forecasting, for wind station #1, the proposed model obtains the best accuracy: RMSE, MAE, and MAPE are 0.5859, 0.4426, and 21.11%, respectively. The classic individual models from low to high based on RMSE are ELM, ANN, Persistence, SVR, and ARIMA, with MAPE values of 36.20%, 36.24%, 36.20%, 34.87%, and 34.25%, respectively. Likely, in wind station #2, compared with the classic individual models, the proposed model still obtains the best performance, and the MAPE value is 17.10%.

  2. In the 2-step forecasting, when wind station #1 is used, the proposed model has the lowest performance criteria, i.e., the values of RMSE, MAE, and MAPE are 0.7531, 0.5848, and 24.78%, respectively. In addition, for wind station #2, the proposed model still achieves the lowest performance criteria value. Take MAPE as an example, the value of MAPE is 22.99%, which is significantly lower than other models.

  3. In the 3-step forecasting, the proposed model is still the model with the highest prediction accuracy, and the MAPE of wind stations #1 and #2 are 27.55% and 24.59%, respectively. Persistence has the worst RMSE value among these models, with MAPE of 57.64% and 47.99%, respectively.

Compared with traditional EMD methods

  1. Compared with the above-mentioned classic individual models, the performance of the EMD-based method is significantly improved. Take wind station #1 as an example, in the 1-step forecasting, the value of RMSE of the EMD-based methods is around 0.60, while the classic individual model is around 1.20. After the wind speed is decomposed by EMD, the value of RMSE is reduced almost doubled.

  2. For wind station #1, except for the MAE in the 3-step forecasting, the performance indicators obtained from the proposed model are significantly better than those EMD-based combined models. For the 3-step forecasting, the performance of EMD-SVR and EMD-SVR-SP in MAE is slightly better than the proposed combined model, but in other evaluation indicators, the proposed combined model achieves a significantly better performance. Furthermore, EMD-ANN is always worse in MAPE as compared with the other three combined models, with MAPE of 23.55%, 27.67%, and 29.31% for 1- to 3-step forecasting.

  3. For wind station #2, in 1- to 3-step wind speed forecasting, the proposed combined model obtains the best prediction results. The RMSE, MAE and MAPE in the 1-step forecasting are 0.5593, 0.419, and 17.10%, respectively. In comparison, among the other four EMD-based combined models, the EMD-ELM and EMD-ANN models have similar prediction performance in 1- to 3-step forecasting, with MAPE values of 21.59%, 27.49%, 27.65% and 21.83%, 25.3%, 27.86%, respectively.

Discussion

Performance of SVR-SP and LassoCV on different IMFs

Comparison of different signal decomposition techniques

The impact of the number of selected features on performance

Performance under different signal-to-noise ratios

Conclusions

Additional Information and Declarations

Competing Interests

The authors declare there are no competing interests.

Author Contributions

Tao Wang conceived and designed the experiments, performed the experiments, analyzed the data, performed the computation work, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The code is available at GitHub: https://github.com/JackAndCole/short-term-wind-speed-forecasting.

The data is available at GitHub: https://github.com/JackAndCole/short-term-wind-speed-forecasting/tree/main/data.

Funding

The authors received no funding for this work.

15 Citations 1,713 Views 540 Downloads

Your institution may have Open Access funds available for qualifying authors. See if you qualify

Publish for free

Comment on Articles or Preprints and we'll waive your author fee
Learn more

Five new journals in Chemistry

Free to publish • Peer-reviewed • From PeerJ
Find out more