Development of a cryptocurrency price prediction model: leveraging GRU and LSTM for Bitcoin, Litecoin and Ethereum

View article
PeerJ Computer Science

Main article text

 

Introduction

  • This article presents a comprehensive analysis of deep learning models used to predict the prices of three cryptocurrencies named Bitcoin, Ethereum and Litecoin.

  • Two categories of recurrent neural networks, namely long short-term memory (LSTM) and gated recurrent unit (GRU), are utilized for cryptocurrency price prediction. The study leverages these DL algorithms to enhance prediction accuracy and handle the inherent nonlinearities in time-series data.

  • Experimental results represent that GRU model outperforms LSTM model in predicting cryptocurrency prices for BTC, LTC, and ETH, as indicated by lower MAPE values, thereby establishing GRU as the more accurate prediction algorithm.

Materials and Methods

Proposed methodology

Dataset description

Deep learning algorithms

Long short-term memory

  • Forget gate: The forget gate eliminates data that is no longer relevant in the cell state. The gate receives two inputs, xt (input at that specific moment) and ht1 (output from the preceding cell), which are multiplied by weight matrices before bias is added. An activation function sigmoid (σ) is applied to the outcome, producing a binary output. A piece of information is lost if the output for a certain cell state is 0, but it is kept for later use if the output is 1. The forget gate’s Eq. (1) is:

where

  • Input gate: The input gate is responsible for adding valuable information to the cell state. Initially, the sigmoid function is used to regulate the data, and the inputs ht1 and xt are used to filter the values to be remembered in a manner akin to the forget gate. The tanh function is then used to construct a vector that contains all of the possible values from ht1 and xt and has an output ranging from −1 to +1. Finally, the useful information is obtained by multiplying the vector values by the regulated values. The input gate’s Eq. (2) is:

  • Output gate: The output gate is responsible for obtaining valuable information from the current cell state to be displayed as output. The tanh function is first used to the cell to create a vector. The data is then controlled by the sigmoid function and filtered by the values that need to be retained using inputs ht1 and xt. Finally, to be delivered as an output and input to the following cell, the vector values and the controlled values are multiplied. The output gate’s Eq. (3) is:

Gated recurrent unit

where

Experimental setup and hyperparameters tuning for models

  • GRU neurons: 100 neurons, optimizing the model’s capacity for learning temporal dependencies.

  • Epochs: 20, with early stopping applied to prevent overfitting.

  • Batch size: 32, balancing training efficiency and model stability.

  • Loss function: MSE, RMSE, MAE and MAPE chosen for its effectiveness in regression tasks.

  • Dropout rate: 0.2, used to mitigate overfitting by introducing regularization.

  • Optimizer: Adam, selected for its adaptive learning rate and robust optimization performance.

  • Activation function: Linear, suitable for the continuous nature of cryptocurrency price predictions.

Performance metrics

  • Mean squared error: It estimates the average squared difference among real and expected values. It offers a quadratic loss metric that penalizes greater mistakes more harshly. The performance of the prediction model improves with a reduced MSE value. MSE helps improve the accuracy of the GRU and LSTM models for Bitcoin, Litecoin, and Ethereum by efficiently penalizing significant differences in forecast prices. The mathematical representation for MSE is mentioned in Eq. (8):

  • Root mean squared error: The measurement of the discrepancies between values that a model predicts and values that are actually observed is called the root mean square error. It is simpler to read since it gives an error measure in the same units as the target variable. The prediction model performs better with lower RMSE value. By explicitly representing the error magnitude in the same units as the values of Bitcoin, Litecoin, and Ethereum, RMSE offers an interpretable indicator of forecast accuracy for cryptocurrency prices. The mathematical representation for RMSE is mentioned in Eq. (9):

  • Mean absolute error: It estimates the average absolute difference among real and expected values. It offers a linear loss measure that does not severely penalize big mistakes. The prediction model performs better when the MAE value is lower. Without unduly penalizing significant errors, MAE provides a strong evaluation of average prediction error, which aids in assessing how consistently GRU and LSTM models predict Bitcoin prices. Equation (10) describes the MAE mathematical representation:

  • Mean absolute percentage error: It estimates the average absolute percentage difference among real and expected values. With a lower MAPE value, the prediction model performs better. Even if the price ranges of Bitcoin, Litecoin, and Ethereum vary, MAPE makes it easier to compare their price predictions, allowing for a scale-independent assessment of model performance. Equation (11) identifies the mathematical representation of MAPE:

Results

Results for LTC

Results of ETH

Results of BTC

Discussion

Limitation

  • Restricted cryptocurrency scope: Because the study primarily examines the three most important cryptocurrencies—BTC, ETH, and LTC—it may be unable to reliably forecast how other, less liquid, or new cryptocurrencies will behave.

  • Absence of feature diversity: If the model just considers historical prices or currency rates, it may neglect important variables such as transaction volume, market sentiment, or macroeconomic data.

  • Lack of exogenous factors: The model may not account for external factors that have a significant impact on Bitcoin pricing, such as regulatory changes, social media influence, or worldwide financial events.

  • Limited comparisons to other models: This study does not analyze traditional statistical techniques or simpler, non-neural network models such as ARIMA, which could offer light on the relative benefits of LSTM and GRU.

  • Metrics for Evaluation: The study focusses primarily on the MAE, MSE, MAPE and RMSE metrics, which may not truly reflect how well the model performs in real-world trading settings where risk and profit are more relevant factors.

Conclusions and future scope

Supplemental Information

Additional Information and Declarations

Competing Interests

Author Contributions

Data Availability

Funding

This work was supported by the Basic Science Research Program under Grant 2020R1I1A3069700, and by the Technology Development Program of MSS under Grant S3033853. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Your institution may have Open Access funds available for qualifying authors. See if you qualify

Publish for free

Comment on Articles or Preprints and we'll waive your author fee
Learn more

Five new journals in Chemistry

Free to publish • Peer-reviewed • From PeerJ
Find out more