A robust chaos-inspired artificial intelligence model for dealing with nonlinear dynamics in wind speed forecasting

View article
PeerJ Computer Science

Main article text

 

Introduction

Related works

Proposed methodology

Robust empirical mode decomposition

  1. Identifying all extrema in the original wind speed time series z(t).

  2. Calculating the upper {zu(t)} and lower {zl(t)} envelopes using cubic spline interpolation.

  3. Calculating the mean envelope m(t)=(zu(t)+zl(t))2.

  4. Subtracting the mean envelope from the original wind speed time series to obtain d(t)=z(t)m(t).

  5. Testing the characteristics of d(t).

    If d(t) qualifies as an IMF, f(t) is set to d(t) and designated as the i-th IMF. The residual z(t) is updated as R(t)=z(t)f(t). If d(t) does not meet the criteria for an IMF, z(t) is replaced with d(t).

  6. (i) to (v) are repeated until the stop conditions are met.

  • Step 1: Identify local maxima and minima:

    In the first step, identify the local maxima and minima points in the time series.

  • Step 2: Smooth the estimated envelope:

    Use the Eq. (2) to calculate the weights for smoothing the estimated envelope: ω{m,n}[k,j]exp(ΔI[m,n;k,j]22rσ2rm2n22rσ2s).

    Here, ΔI[m,n;k,j] represents the intensity difference between pixel pairs.

  • Step 3: Create maximum and minimum envelopes:

    Maximum envelope Emax and minimum envelope Emin are created by solving the Eq. (3):

    E=[PTePe+λ(DW)T(DW)]1PTePeI.

  • Step 4: Calculate the mean envelope:

    Calculate the mean envelope as

    ¯E(i)=Emax+Emin2.

  • Step 5: Extract IMF function:

    Calculate the IMF function using

    hci=I(i)¯E(i).

  • Step 6: Repeat steps to meet IMF requirements:

    Repeat the previous steps (ii) through (v) until the requirements for an IMF are met.

  • Step 7: Determine modulation mixing issues:

    Identify any modulation mixing issues using the Eq. (5).

  • Step 8: Create adaptive mask signal:

    Use the Eq. (6) to create the adaptive mask signal MCOS

    hi=h+i+hi2.

  • Step 9: Calculate residual:

    Calculate the residual as Res=I(˙I)hi. If there is no additional point, the EMD process stops; otherwise, update i=i+1 and revert I(i)=Res.

    The final IMF components {hi} and the residual Resare given as output. After the above nine steps, the Y sedimentation series is expressed as

    Y=mi=1hi+Res.

Long short-term memory network

  1. Decide which information to discard from the cell state: the values of xt and ht1 are passed through a sigmoid function to determine what should be discarded.

    ft=σ(wf[ht1,xt]+bf).

  2. Determine what new information will be stored in the cell state: Initially, a sigmoid layer determines the information to retain in the cell state. Subsequently, the values from xt and ht1 passed through the tanh() layer are regarded as a new candidate value ˜Ct.

    it=σ(wi[ht1,xt]+bi)

    ˜Ct=tanh(wC[ht1,xt]+bC).

  3. Update the previous cell state Ct1 to the new cell state Ct: multiply the previous cell state Ct1 by ft to forget the information we decided to forget, and plus it times ˜Ct to get a new cell state Ct.

    Ct=ftCt1+it˜Ct.

  4. Determine the information to be output: Initially, a sigmoid layer determines which information will be output from the cell state. Then, apply the cell state Ct to a tanh() function and multiply it by the output of the sigmoid gate.

    ot=σ(wo[ht1,xt]+b0

    ht=ottanh(Ct).

Chaotic African vultures optimization

  1. The African vulture’s population consists of N vultures, with N determined by the algorithm user based on the specific context. Each vulture operates in a D-dimensional position space, where D corresponds to the dimensions of the problem being addressed. Additionally, a maximum number of iterations T must be predefined, reflecting the maximum number of actions a vulture can perform. Thus, the position of each vulture i (where 1iN) at different iterations t (where 1tT) is represented by

    Xi(t)=[xi1(t),xi2(t),,xiD(t)].

    This equation represents the position vector of each vulture in the population at iteration t

  2. Based on the living habits of African vultures, the population is divided into three groups. If the fitness value of the feasible solution measures the quality of a vulture’s position, the first group consists of the vultures with the best feasible solution. The second group contains those with the second-best feasible solution. The remaining vultures form the third group.

  3. Vultures forage collectively as a population, with different types of vultures playing distinct roles within the group.

  4. Assuming that the fitness value of the feasible solution indicates the relative quality of vultures, the weakest and hungriest vultures correspond to those with the poorest fitness values. Conversely, the strongest and most satiated vulture represents the one with the best fitness value. In AVO, all vultures strive to approach the best vultures and distance themselves from the worst.

where BestVulturet1=[bt11,,bt1d,,bt1D] represents the best vulture, BestVulturet2=[bt21,,bt2d,,bt2D] represents the second best vulture, L1 and L2 are two random numbers in the range [0,1] whose sum is 1. pti is determined using the roulette wheel strategy, and its calculation is given by

where fti stands for the fitness value of the vultures in the first and second groups, while m denotes the total count of vultures in these groups.

where randti1 is a randomly generated number within the interval [0,1], zt is a random number within the range of [1,1], and gt is computed by

where ht is a randomly generated number within the range of [2,2], and k is a predefined parameter that indicates the likelihood of the vulture transitioning to the exploitation stage. A higher value of k suggests a greater probability of transitioning to the exploration stage in the final optimization phase. Conversely, a lower value of k indicates a higher likelihood of transitioning to the exploitation stage in the final optimization phase.

where Xt+1i denotes the position of the ith vulture at the (t+1)th iteration, randtp1, randti2, and randti3 are random values uniformly distributed within the range [0,1], Rti is determined by Eq. (17), Ft is calculated using Eq. (19), ub and lb denote the upper and lower bounds of the solution space, and Dti is computed using

to represent the distance between the vulture and the current optimal position. Here, Xti denotes the position of the ith vulture at iteration t, and C is a random value uniformly distributed between 0 and 2.

  • i. Food competition

where randti4 is a random variable uniformly distributed within [0,1], and dti is defined by

  • ii. Rotating flight

where Sti1 and Sti2 are defined as

  • i. Aggregation behavior

where Ati1 and Ati2 are determined by

  • ii. Attack behavior

where r1 and r2 are random variables uniformly distributed within [0,1], δ is typically set to 1.5, and σ is computed by

where the gamma function Γ(x) is defined as Γ(x)=(x1).

  • Tent chaotic mapping for population initialization

  • Individual history optimal solution

where Pi represents the best position the ith vulture has historically achieved, and ωt1 and ωt2 are values that vary with iterations, calculated using

ωt2=11.8+e0.015×(T2t)0.8
respectively. Here, T is the maximum number of iterations, and t is the current iteration count. In Eq. (36), two parameters, ωt1 and ωt2, are added to control the influence of the current optimal vulture and the historical optimal vulture, respectively. This design ensures that even if |Ft| remains greater than 1 in the algorithm’s middle and late stages, the optimal vulture does not overly influence the current vultures, allowing for convergence. This time-varying mechanism is detailed in Eqs. (37) and (38).

  • Time-varying mechanism

where ωt3 and ωt4 vary with the number of iterations, calculated by

respectively. Here, T is the maximum number of iterations, and t is the current iteration count.

REMD-LSTM-TAVO framework for wind speed forecasting

Data preprocessing phase

Construction phase of the REMD-LSTM combined wind speed forecasting model

Optimization phase

  • Encoding type (representation scheme): TAVO utilizes a real-valued vector encoding approach, where each individual (or vulture) is represented as a vector of continuous values. Each element of the vector corresponds to a specific parameter or coefficient that needs optimization. This encoding scheme is particularly suitable for refining the weighted coefficients of the IMFs generated through data decomposition, which are critical in enhancing the accuracy of wind speed forecasting models.

  • Population initialization: The initial population of vultures is randomly generated within predefined bounds for each parameter. These bounds are chosen based on prior knowledge or exploratory runs to ensure that the initial solutions are diverse yet relevant to the problem domain. This diversity in the initial population helps the algorithm to explore the solution space effectively from the start.

  • Exploration and exploitation mechanism: TAVO adapts its exploration and exploitation strategies dynamically throughout the optimization process. In the initial stages, the algorithm focuses on broad exploration to cover the search space extensively. As the process continues, the focus shifts towards exploitation, allowing the algorithm to fine-tune the solutions and approach the global optimum. The vultures’ movement and decision-making process are modeled after real vultures’ hunting strategies. Vultures tend to explore extensively when they are far from their target (global optimum) and exploit more aggressively as they get closer to it.

  • Selection mechanism: A tournament selection process is used to choose individuals for reproduction. In this method, a subset of the population is randomly selected, and the best-performing individuals within this subset are chosen as parents for the next generation. This strategy maintains a balance between selecting high-quality candidates and preserving genetic diversity.

  • Crossover and mutation operators: A linear crossover operator combines two parent solutions to produce offspring. This operator computes a weighted average of the parent vectors, facilitating smooth transitions between solutions and enabling the algorithm to explore new regions of the search space. An adaptive mutation strategy introduces small random perturbations to individual parameters. The size of these mutations decreases over time, reducing the likelihood of overshooting the optimal solution in the later stages of the optimization process.

  • Fitness function: The fitness function used in TAVO is the MSE between the forecasted and actual wind speed values. This metric is chosen because it effectively quantifies the prediction accuracy of the model. The algorithm aims to minimize the MSE and thereby improve the accuracy of the forecast model. The fitness function ensures that only solutions that resulting in lower prediction errors are considered superior.

  • Convergence criteria: The algorithm is set to terminate after a fixed number of generations unless the convergence criteria are met earlier. A stagnation detection mechanism is in place, where if no significant improvement in the best fitness value is observed over a certain number of generations, the algorithm concludes that it has reached convergence.

  • Diversity preservation: To prevent premature convergence, TAVO incorporates a diversity maintenance mechanism. This is achieved by periodically introducing new random individuals or slightly modifying existing individuals that are too similar to others in the population. This approach helps to avoid local optima and ensures a more robust search process.

Model evaluation phase

Results and discussion

Dataset

Experimental results and discussions

Conclusions and future research

Supplemental Information

REMD LSTM TAVO function code.

DOI: 10.7717/peerj-cs.2393/supp-2

Additional Information and Declarations

Competing Interests

The authors declare that they have no competing interests.

Author Contributions

Caner Barış conceived and designed the experiments, performed the experiments, analyzed the data, performed the computation work, prepared figures and/or tables, and approved the final draft.

Cağfer Yanarateş conceived and designed the experiments, performed the experiments, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Aytaç Altan conceived and designed the experiments, performed the experiments, analyzed the data, performed the computation work, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The raw data are available in the Supplemental File.

Funding

The authors received no funding for this work.

9 Citations 471 Views 24 Downloads

Your institution may have Open Access funds available for qualifying authors. See if you qualify

Publish for free

Comment on Articles or Preprints and we'll waive your author fee
Learn more

Five new journals in Chemistry

Free to publish • Peer-reviewed • From PeerJ
Find out more