Prediction of sea ice area based on the CEEMDAN-SO-BiLSTM model

Qiao Guo; Haoyu Zhang; Yuhao Zhang; Xuchu Jiang

doi:10.7717/peerj.15748

Prediction of sea ice area based on the CEEMDAN-SO-BiLSTM model

Qiao Guo, Haoyu Zhang, Yuhao Zhang, Xuchu Jiang

Zhongnan University of Economics and Law, Wuhan, Hubei, China

DOI: 10.7717/peerj.15748

Published: 2023-07-19
Accepted: 2023-06-22
Received: 2023-03-10

Academic Editor: Roger Jones

Subject Areas: Computational Science, Data Mining and Machine Learning, Aquatic and Marine Chemistry, Environmental Impacts, Biological Oceanography
Keywords: Sea ice area, Daily prediction, CEEMDAN, SO, BiLSTM

Copyright: © 2023 Guo et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Guo Q, Zhang H, Zhang Y, Jiang X. 2023. Prediction of sea ice area based on the CEEMDAN-SO-BiLSTM model. PeerJ 11:e15748 https://doi.org/10.7717/peerj.15748

The authors have chosen to make the review history of this article public.

Abstract

This article proposes a combined prediction model based on a bidirectional long short-term memory (BiLSTM) neural network optimized by the snake optimizer (SO) under complete ensemble empirical mode decomposition with adaptive noise. First, complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) was used to decompose the sea ice area time series data into a series of eigenmodes and perform noise reduction to enhance the stationarity and smoothness of the time series. Second, this article used a bidirectional long short-term memory neural network optimized by the snake optimizer to fully exploit the characteristics of each eigenmode of the time series to achieve the prediction of each. Finally, the predicted values of each mode are superimposed and reconstructed as the final prediction values. Our model achieves a good score of RMSE: 1.047, MAE: 0.815, and SMAPE: 3.938 on the test set.

Introduction

Sea ice extent is a key observational indicator of climate change and diversity (Serreze, Holland & Stroeve, 2007) Over the past half-century, satellite observations have revealed a gradual increase in Arctic temperatures, a gradual decrease in sea ice cover, and the emergence of Arctic amplification (Holland et al., 2019). At different time scales, sea ice cover anomalies can have extreme effects on atmospheric circulation and precipitation patterns, which in turn can further affect the climate at mid- and high latitudes (Screen, 2013), such as the 2021 cold snap in Texas and Oklahoma. Based on current trends, Arctic sea ice could disappear completely by 2050 (Notz & Stroeve, 2018). In addition, accurate daily, quarterly and annual monitoring and prediction of changes in sea ice extent have important implications for human exploitation of maritime resources, navigation activities in sea ice regions, and global climate analysis and prediction (Smith & Stephenson, 2013; Choi, De Silva & Yamaguchi, 2019; Cavalieri et al., 1999). Therefore, accurate prediction of sea ice movement is essential for human activity and climate modeling (Stroeve et al., 2012).

At present, research on modeling the characteristics of sea ice mainly involves statistical models and numerical models. Statistical models are constructed based on historical observations and relationships between atmospheric conditions (e.g., temperature, sea level pressure, and clouds), ocean conditions (e.g., sea surface temperature), and sea ice variables (e.g., concentration, extent, ice type, and thickness). For example, Turner et al. (2013) used statistical regression to analyze the relationship between the Amundsen Sea low pressure and Antarctic Sea ice cover, indicating that the deepening of the Amundsen Sea low pressure is associated with West Antarctic warming and the expansion of the Ross Sea ice cover. However, Wang, Chen & Kumar (2013) believe that statistical methods do not consider the interaction between sea ice and the atmosphere, and there are certain limitations. Numerical models are primarily physically driven models based on the physical equations of control system dynamics and thermodynamics, such as Gent et al. (2011), which describe all developments in the Community Climate System Model (CCSM) and document fully coupled preindustrial control operations compared to previous versions of CCSM3. Guemas et al. (2016) argue that numerical models are generally superior to statistical models in short-term forecasting. However, while inputs such as atmosphere, oceanic and ice parameters can be obtained from remotely sensed data, they must be calibrated and validated through spatially and temporally well-distributed in situ observations, which are difficult and costly to obtain and therefore often inefficient.

Machine learning and deep learning techniques have developed rapidly in recent years and have shown significant advantages in sea ice cover prediction. Barnhart et al. (2016) used support vector machine (SVM) models to analyze the relationship between sea ice and climate variables, successfully predicting Arctic open water expansion and Arctic sea ice changes in the coming decades. Deep learning model prediction also solves the limitation of numerical models in multiparameter accurate acquisition to some extent (Rasp, Pritchard & Gentine, 2018), but there are still some challenges in capturing temporal correlations in the time series prediction of nonlinear sea ice area data (Ren, Li & Zhang, 2022). The LSTM model has attracted great attention due to the rapid development of artificial intelligence and its ability to automatically extract feature modeling (Hochreiter & Schmidhuber, 1997), and the research of Siami-Namini, Tavakoli & Namin (2019) also proved that the predictive performance of bidirectional LSTM is due to LSTM. With the study of time series frequency domain analysis methods, the EMD method (Huang et al., 1998) was developed, which decomposes noisy data according to its own time scale characteristics and does not need to set any basis function in advance, which has obvious advantages in processing nonstationary and nonlinear data. Torres et al. (2011) proposed the adaptive noise complete set empirical modal decomposition (CEEMDAN) algorithm, which overcame the defects of EMD and EEMD decomposition loss of completeness and modal aliasing by adaptive addition of white noise. Hu et al. (2022) integrated CEEMDAN with LSTM and temporal convolutional networks (TCN) to enable ultra-short-term wind power forecasting and real-time prediction of wind energy. Similarly, Gao & Zhang (2023) leveraged a combined approach of variational mode decomposition (VMD) and LSTM for decomposed prediction. Their research focused on the impact of investor sentiment on price volatility in China’s capital market. The results from both studies underscore the efficacy of such combined methodologies in their respective fields.

Therefore, this article explored an optimal data-driven time series model combining the empirical mode decomposition (EMD) method and optimized deep learning neural networks to capture the nonlinear and nonstationary characteristics of sea ice area time series. This combination allows us to better understand the temporal correlations present in the sea ice area series and overcome the limitations of current time series models. This article compared the performance of our proposed model with both a benchmark model and similar approaches and analyze their differences and advantages. Our analysis demonstrates the superiority and effectiveness of our target model.

Theoretical Model Construction

Complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN)

Due to the “mode mixing” caused by EMD and the noise residual caused by EEMD, this article introduces complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN), which overcomes the defects of EEMD decomposition in terms of loss of completeness and mode mixing by adaptively adding white noise. In the algorithm, E_i(⋅) is defined as the i-th mode generated by EMD decomposition, $\bar{C i (t)}$ represents the ith mode generated by CEEMDAN decomposition, ɛ is the standard deviation of the noise, v^j follows N (0,1) and j =1,2, ….,N denotes the number of times white noise is added, while r represents the residue. The specific steps of the CEEMDAN algorithm are as follows:

Add Gaussian white noise to the original signal y(t) to obtain a new signal y(t) + (−1)^qɛv^j, where q = 1,2. EMD is performed on the signal to obtain the first-stage intrinsic mode component C₁ (1) $E (y (t) + {(- 1)}^{q} ɛ v^{j} (t)) = C_{1}^{j} (t) + r^{j}$
Taking the overall average of the N generated mode components produces the first intrinsic mode function of the CEEMDAN decomposition. (2) $\bar{C_{1} (t)} = \frac{1}{N} \sum_{j = 1}^{N} C_{1}^{j} (t)$
Calculate the residue after removing the first modal component (3) $r_{1} (t) = y (t) - \bar{C_{1} (t)}$
Add paired positive and negative Gaussian white noise to r ₁(t) to obtain a new signal, and perform EMD on the new signal to obtain the first-order modal component D ₁. Then, the second intrinsic mode component of the CEEMDAN decomposition can be obtained. (4) $\bar{C_{2} (t)} = \frac{1}{N} \sum_{j = 1}^{N} D_{1}^{j} (t)$
Calculate the residue after removing the second modal component (5) $r_{2} (t) = r_{1} (t) - \bar{C_{2} (t)}$
The above steps are repeated until the obtained residual signal is a monotonic function and cannot be further decomposed, and the algorithm ends. The number of intrinsic mode functions obtained at this time is denoted as K, and the original signal y(t) can be decomposed as: (6) $y (t) = \sum_{k = 1}^{K} \bar{C_{k} (t)} + r_{k} (t)$

Bidirectional long short-term memory neural network (BiLSTM) and optimization

BiLSTM

The BiLSTM allows for the transmission and feedback of past and future states of the hidden layers through a bidirectional network (Fig. 1).

The BiLSTM is interpreted through Eq. (7). (7) $\{\begin{matrix} h_{f} = LSTM (x_{i}, h_{f - 1}) \\ h_{b} = LSTM (x_{t}, h_{b - 1}) \\ h_{t} = w_{t} h_{f} + v_{t} h_{b} + b_{t} \end{matrix}$ where x_i is the input, h_f is the forward-pass implicit layer state, h_b is the reverse-pass implicit layer state, h_t is the implicit layer state, w_t is the forward-pass implicit layer output weight, v_t is the reverse-pass implicit layer output weight, and b_t is the error value.

Adaptive moment estimation (Adam)

Adam’s algorithm improves model accuracy and network training speed by calculating the first-order moments and second-order moment estimates that can be adapted to the corresponding learning rate by computing the gradient of the objective function. Each iteration of Adam’s update of the BiLSTM parameter θ_t is (8) $θ_{t} = θ_{t - 1} - α \frac{{\hat{m}}_{t}}{\sqrt{{\hat{n}}_{t}} + ɛ}$ where ${\hat{m}}_{t}$ and ${\hat{n}}_{t}$ are the corrected first- and second-order moment estimates, respectively, and ɛ is a constant 10⁻⁸.

BiLSTM for snake optimizer optimization (SO-BiLSTM)

Based on the special mating behavior of snakes, Hashim & Hussien (2022) proposed the snake optimizer (SO). The algorithm is divided into two stages: global exploration when there is no food and local exploitation when there is food. When food is scarce (Q<0.25), the snakes search for food and update the location by choosing any random location to achieve global optimization. When there is sufficient food (Q ≥0.25), snakes move toward food and update their position if Temp>0.6; if the temperature Temp ≤0.6, the snakes enter combat mode at a random number Rand<0.6 taking the value of [0,1] and enter mating mode at Rand ≥0.6 and replace the individual snake with the worst fitness value by laying and hatching eggs.

The prediction accuracy of the BiLSTM neural network is affected by the number of neurons in the hidden layer, the learning rate and the L2 regularization coefficient. Considering the randomness of artificially set parameters, the root mean square error RMSE is selected as the fitness function in this article, and the parameters are optimized using the SO algorithm (Fig. 2).

Figure 2: Working process of SO.

Download full-size image

DOI: 10.7717/peerj.15748/fig-2

CEEMDAN-SO-BiLSTM

Through the above analysis, this article introduces the CEEMDAN module for noise reduction of the original data based on the advantages of the optimization algorithm and deep learning and proposes a combined CEEMDAN-SO-BiLSTM prediction model (Fig. 3).

As seen from the above figure, the prediction model in this article is divided into three parts. The first part is CEEMDAN decomposition, which decomposes the time series into K modes for noise reduction; the second part is the SO-BiLSTM neural network model to train the prediction of K modes; and the third part superimposes and reconstructs the prediction results of the second part to obtain the prediction results of the original data.

Experimental Analysis

Data sources

The data set used in this article was obtained from the sea ice data set on the official website of the National Environmental Information Center (https://www.ncei.noaa.gov/) and the Grambling Sea Ice area time series data was used for the study in the model analysis. The time range is from day 272 of 2017 to day 271 of 2022, with a total of 1,824 data points.

Model evaluation criteria

To evaluate the prediction performance of the model, this article selects four indicators: mean absolute error (MAE), root mean square error (RMSE), determinability coefficient R², and symmetric mean absolute percentage error (SMAPE). Equations for calculating these indicators are provided in Eqs. (9)–(12).

These indicators quantify the accuracy of the model’s predictions. MAE and RMSE represent the average and root of the squared errors between the predicted and actual values, respectively, while R² measures the correlation between the predicted and actual values. SMAPE calculates the symmetric percentage difference between the predicted and actual values. (9) $MAE = \frac{1}{n} \sum_{i = 1}^{n} |y_{i} - {\hat{y}}_{i}|$ (10) $RMSE = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}$ (11) $R^{2} = \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}$ (12) $SMAPE = \frac{100 %}{n} \sum_{i = 1}^{n} \frac{|{\hat{y}}_{i} - y_{i}|}{(|{\hat{y}}_{i}| + |y_{i}|) / 2}$ where y_i is the i-th observation, ${\hat{y}}_{i}$ is the i-th prediction, $\bar{y}$ is the mean, and n is the number of samples.

Experimental analysis

According to the characteristics of the data sets, this article first partitions the original data into a training set comprising 70% of the data and a testing set consisting of the remaining 30%. Then, using the rolling window method with CEEMDAN decomposition, it predicts the values within each roll-out window. The noise ratio is set at 0.2, with 500 iterations of adding noise to the signal and a maximum span of 2000. For example, it shows the different modalities obtained by decomposing the first prediction window (Fig. 4), including 10 modes representing various frequency dimensions of the ice concentration time series. The high correlation coefficients of IMF8 and IMF9 with the original sequence, reaching 0.91 and 0.77, respectively, indicate that these two modes contribute significantly to the periodic trend of the original signal. IMF1 to IMF3 are considered as noise signals, while IMF10 represents a short-term trend component (Fig. 5). As seen from the figure, compared to the original sequence, the decomposed modalities are more stable and smooth and exhibit clear information features, providing a solid foundation for predictions.

Figure 4: Results of empirical modal decomposition of the original sequence adaptive noise complete set.

Download full-size image

DOI: 10.7717/peerj.15748/fig-4

Figure 5: Adaptation curve of SO.

Download full-size image

DOI: 10.7717/peerj.15748/fig-5

The specific process of making predictions using the rolling window method is as follows: assuming that the model training and test time series data are D, with length T + k, where the first T data points are used for model training and the last k data points are used for model testing. It uses the previous 30 data points to predict the next data point in the testing set. To obtain each predicted value from the testing set, follow these steps:

CEEMDAN decomposition is performed on the first T data points to extract multiple features and train separate models for each feature.
Following completion of model training, utilize data from D[T-29:T] to obtain predicted outputs from each feature model and sum them to acquire the prediction value for D[T+1].
Execute CEEMDAN1 decomposition on data D[2:T+1] to extract multiple features once again and train individual models for every feature. Utilizing 30 data points from D[T-28:T+1], it receives the predicted data from each feature model and aggregate them to attain the forecast for D[T+2].

Repeat actions 1 through 3, sliding the window of 30 data points and performing decomposition and model training on the enclosed data before making predictions, ultimately arriving at projections for all future time intervals D[T+1:T+k].

After applying SO-BiLSTM training for predictive outcomes from the preceding splitting apart, SO adjusts BiLSTM’s primary acquisition pace, opaque node contingent, and L2 regulation factor. Following numerous trials, the versatile extent characteristic ultimately slopes toward equilibrium (Fig. 6), and the versions for every mode within the initial estimation aperture are improved as per Table 1.

Figure 6: Prediction results of each mode.

Download full-size image

DOI: 10.7717/peerj.15748/fig-6

Table 1:

Prediction hyperparameters of each mode.

Modal	Number of hidden layer nodes	Initial learning rate	L2 regularization factor
IMF1	108	0.013	10⁻⁹
IMF2	96	0.017	10⁻¹⁰
IMF3	97	0.018	10⁻⁸
IMF4	100	0.001	10⁻⁸
IMF5	200	0.012	10⁻¹⁰
IMF6	200	0.001	10⁻¹⁰
IMF7	100	10⁻⁴	10⁻¹⁰
IMF8	100	10⁻⁴	10⁻¹⁰
IMF9	100	10⁻⁴	10⁻¹⁰
IMF10	100	10⁻⁴	10⁻¹⁰

DOI: 10.7717/peerj.15748/table-1

For each modality, the subsequent predictive value is recreated so that the concluding forecast can be generated for the initial information. By persistently pushing the prognostication casement ahead and producing supplementary projections, up to thirty percent of all estimations have been accomplished (Fig. 7). Evaluation parameters for the objective model are determined using Table 2.

Figure 7: Reconfiguration results.

Download full-size image

DOI: 10.7717/peerj.15748/fig-7

Table 2:

CEEMDAN-SO-BiLSTM prediction effects.

Evaluation indicators	Indicator value
MAE	0.815
RMSE	1.047
R²	0.998
SMAPE	3.938%

DOI: 10.7717/peerj.15748/table-2

Model comparison

To verify the superiority of the model proposed in this article, models such as BiGRU and BiLSTM were tested and compared with the model in this article, and the test results of each model are shown in Table 3.

Table 3:

Comparison of the predicted effects of the original data.

Models	MAE	RMSE	R²	SMAPE
ARIMA	5.029	6.229	0.913	18.456%
SVR	4.467	5.649	0.929	17.154%
BiLSTM	3.323	4.300	0.961	13.231%
BiGRU	3.726	4.739	0.950	14.688%
CEEMDAN-BiGRU	2.930	3.574	0.971	11.986%
CEEMDAN-BiLSTM	2.583	3.248	0.976	11.848%
VMD-BiGRU	2.610	3.586	0.971	13.050%
VMD-BiLSTM	2.605	3.561	0.972	12.944%
VMD-SO-BiLSTM	2.021	4.182	0.981	10.424%
CEEMDAN-SO-BiGRU	1.917	2.783	0.983	10.137%
CEEMDAN-SO-LSTM	1.828	2.416	0.987	9.060%

DOI: 10.7717/peerj.15748/table-3

Based on our findings (Fig. 8), it appears that the performance of the comparison model deteriorates during specific periods, particularly around May 2021 and January 2022, characterized by substantial fluctuations and increased prediction bias. Notably, the ARIMA model demonstrates a remarkable shortcoming in fitting the original time series during these periods.

Figure 8: Comparison of model prediction results.

Download full-size image

DOI: 10.7717/peerj.15748/fig-8

On the other hand, the proposed CEEMDAN-SO-BiLSTM model yields a superior fitting capability, effectively capturing the variability of the time series. As evidenced in Tables 2 and 3, the proposed method exhibits remarkable advantages over both the single-model counterparts and the benchmark SVR model. Specifically, the combined CEEMDAN-SO-BiLSTM model reduces MAE, RMSE, and SMAPE by approximately 81.8%, 81.5%, and 9.150%, respectively, and raises R² by approximately 3.9%.

Furthermore, our experimental findings reveal that applying decomposition and optimization strategies consistently enhances prediction accuracy, with the CEEMDAN-SO-BiLSTM model outperforming its VMD counterpart. Finally, among the different variants of the LSTM and BiLSTM models, CEEMDAN-SO-BiLSTM provides the best performance. These results underscore the efficacy and robustness of our proposed framework in accurately predicting sea ice areas in the Greenland Sea.

Conclusion

In this article, a combined CEEMDAN-SO-BiLSTM model is proposed for predicting the daily sea ice area in the Greenland Sea. By decomposing the data into multiple relatively stable eigenmodes via the CEEMDAN method, the model takes into account the nonstationarity and nonlinear characteristics of the time series. By optimizing the hyperparameters of the BiLSTM model using the SO algorithm and training each mode separately, the final predictions are then merged and reconstituted to yield daily sea ice area forecasts.

An array of comparative experiments was conducted against alternative hybrid models to evaluate the effectiveness and practicability of the proposed approach. Experimental results indicate that CEEMDAN decomposition considerably enhances the extraction of relevant features from the time series and leads to reduced RMSE and MAE predictions by 2.201 and 0.297, respectively, compared to the single BiLSTM model. Moreover, the hyperparameter optimization through SO strengthens the sensitivity of the CEEMDAN-BiLSTM model to data perturbations, resulting in improved evaluation metrics, including MAE, RMSE, and SMAPE, with respective reductions of 1.768, 2.201, and 7.910% and an increase of 2.20% in R².

Despite these promising findings, there remain some limitations due to insufficient data leading to suboptimal hyperparameter settings and the lack of interpretability in deep learning models hindering exhaustive error analysis. Future research directions may include integrating additional environmental factors, exploiting advanced deep learning structures such as GNNs or attention mechanisms, enhancing data quality and quantity through methods such as data fusion and augmentation, and addressing issues related to interpretability and error diagnosis. Ultimately, advancements along these lines will enable the development of increasingly accurate and applicable sea ice prediction models.

Supplemental Information

Code of CEEMDAN-SO-BILSTM model

DOI: 10.7717/peerj.15748/supp-1

Download

Dataset

DOI: 10.7717/peerj.15748/supp-2

Download

[1] Barnhart KR, Miller CR, Overeem I, Kay JE. 2016. Mapping the future expansion of Arctic open water. Nature Climate Change 6(3):280-285

[2] Cavalieri DJ, Parkinson CL, Gloersen P, Comiso JC, Zwally HJ. 1999. Deriving long-term time series of sea ice cover from satellite passive-microwave multisensor data sets. Journal of Geophysical Research: Oceans 104(C7):15803-15814

[3] Choi M, De Silva LWA, Yamaguchi H. 2019. Artificial neural network for the short-term prediction of arctic sea ice concentration. Remote Sensing 11(9):1071

[4] Gao Z, Zhang J. 2023. The fluctuation correlation between investor sentiment and stock index using VMD-LSTM: evidence from China stock market. The North American Journal of Economics and Finance 66:101915

[5] Gent PR, Danabasoglu G, Donner LJ, Holland MM, Hunke EC, Jayne SR, Lawrence DM, Neale RB, Rasch PJ, Vertenstein M, Worley PH, Yang Z-L, Zhang M. 2011. The community climate system model version 4. Journal of Climate 24(19):4973-4991

[6] Guemas V, Blanchard-Wrigglesworth E, Chevallier M, Day JJ, Déqué M, Doblas-Reyes FJ, Fučkar NS, Germe A, Hawkins E, Keeley S, Koenigk T, Mélia DS, Tietsche S. 2016. A review on Arctic sea-ice predictability and prediction on seasonal to decadal time-scales. Quarterly Journal of the Royal Meteorological Society 142(695):546-561

[7] Hashim FA, Hussien AG. 2022. Snake optimizer: a novel meta-heuristic optimization algorithm. Knowledge-Based Systems 242:108320

[8] Hochreiter S, Schmidhuber J. 1997. Long short-term memory. Neural Computation 9(8):1735-1780

[9] Holland MM, Landrum L, Bailey D, Vavrus S. 2019. Changing seasonal predictability of Arctic summer sea ice area in a warming climate. Journal of Climate 32(16):4963-4979

[10] Hu C, Zhao Y, Jiang H, Jiang M, You F, Liu Q. 2022. Prediction of ultra-short-term wind power based on CEEMDAN-LSTM-TCN. Energy Reports 8:483-492

[11] Huang NE, Shen Z, Long SR, Wu MC, Shih HH, Zheng Q, Yen NC, Tung CC, Liu HH. 1998. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences 4541971:903-995

[12] Notz D, Stroeve J. 2018. The trajectory towards a seasonally ice-free Arctic Ocean. Current Climate Change Reports 4:407-416

[13] Rasp S, Pritchard MS, Gentine P. 2018. Deep learning to represent subgrid processes in climate models. Proceedings of the National Academy of Sciences of the United States of Ameica 115(39):9684-9689

[14] Ren Y, Li X, Zhang W. 2022. A data-driven deep learning model for weekly sea ice concentration prediction of the Pan-Arctic during the melting season. IEEE Transactions on Geoscience and Remote Sensing 60:1-19

[15] Screen JA. 2013. Influence of Arctic sea ice on European summer precipitation. Environmental Research Letters 8(4):044015

[16] Serreze MC, Holland MM, Stroeve J. 2007. Perspectives on the Arctic’s shrinking sea-ice cover. Science 315(5818):1533-1536

[17] Siami-Namini S, Tavakoli N, Namin AS. 2019. The performance of LSTM and BiLSTM in forecasting time series. In: 2019 IEEE international conference on big data (Big Data). Piscataway. IEEE. 3285-3292