Precision prediction of aquaculture water quality: a spatiotemporal model integrating optimized-LSTM and radial basis function neural networks

Bing Shi; Xu Jin; Yujie Hu; Jianming Jiang; Yueping Sun

doi:10.7717/peerj-cs.3515

Precision prediction of aquaculture water quality: a spatiotemporal model integrating optimized-LSTM and radial basis function neural networks

Bing Shi ¹, Xu Jin¹, Yujie Hu¹, Jianming Jiang², Yueping Sun³

1School of Mechanical Engineering and Rail Transit, Changzhou University, Changzhou, Jiangsu, China

2School of Microelectronics and Control Engineering, Changzhou University, Changzhou, Jiangsu, China

3School of Electrical and Information Engineering, Jiangsu University, Zhenjiang, Jiangsu, China

DOI: 10.7717/peerj-cs.3515

Published: 2026-01-27
Accepted: 2025-12-02
Received: 2025-07-07

Academic Editor: Xiangjie Kong

Subject Areas: Algorithms and Analysis of Algorithms, Artificial Intelligence, Computer Vision, Data Mining and Machine Learning, Neural Networks
Keywords: Aquaculture water quality, Spatiotemporal prediction, Optimized LSTM, RBF neural network

Copyright: © 2026 Shi et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

Cite this article: Shi B, Jin X, Hu Y, Jiang J, Sun Y. 2026. Precision prediction of aquaculture water quality: a spatiotemporal model integrating optimized-LSTM and radial basis function neural networks. PeerJ Computer Science 12:e3515 https://doi.org/10.7717/peerj-cs.3515

The authors have chosen to make the review history of this article public.

Abstract

Aquaculture water quality parameters are influenced by multiple factors, exhibiting significant temporal and spatial variations. Current prediction methods for these parameters primarily focus on time series predictions at specific observation points, which do not comprehensively characterize the spatiotemporal distribution dynamics of pond water quality parameters. To address this limitation, this study proposes a novel model which incorporates a Self-Attention (SA) mechanism to enhance the capture of long-term dependencies within the data. Furthermore, an enhanced Sparrow Search Algorithm (ESSA) is implemented to optimize the hyperparameters of the long short-term memory (LSTM) network, thereby improving the time series prediction of water quality parameters. Building upon these predictions, the Radial Basis Function (RBF) algorithm is utilized for spatial prediction. The proposed spatiotemporal prediction model, which combines ESSA-SA-LSTM and RBF, demonstrates superior performance by reducing the mean square error (MSE), root mean square error (RMSE), and mean absolute error (MAE) of dissolved oxygen (DO) and water temperature spatiotemporal predictions, outperforming existing comparative algorithms. The model presented in this study significantly enhances the accuracy of spatiotemporal predictions for water quality parameters, playing a crucial role in ensuring the safe production and management of aquatic environments in aquaculture.

Introduction

In aquaculture, maintaining optimal water quality is crucial for the growth, development, and reproduction of aquatic organisms. A well-balanced aquatic environment plays a central role in ensuring the success of aquaculture operations (Li et al., 2020; Jongjaraunsuk, Taparhudee & Suwannasing, 2024; Sabwa et al., 2022; El-Sayed, 2002). Low dissolved oxygen (DO) levels in ponds can lead to reduced metabolic rates, diminished disease resistance, and abnormal swimming behaviors in fish. In severe cases, it can result in surface gulping and even mortality. On the other hand, excessively high DO concentrations can increase the risk of gas bubble disease in fish (Dehghani, Torabi Poudeh & Izadi, 2021; Hafez & Lian-Shin, 2021; Cao et al., 2021; Edouard & Roberto, 2022). Several factors, such as water depth, atmospheric pressure, and temperature, vary spatially and temporally, affecting the prediction of water quality parameters. To reduce the impact of randomness, synergistic effects, and correlations among variables, key water quality parameters are often used as independent variables in predictive models. By integrating these parameters into advanced predictive algorithms, multivariate models can be developed to enhance accuracy (Ustaoğlu, Tepe & Taş, 2020; Zhi et al., 2021; Huan et al., 2020). Traditional water quality monitoring relies on manual sampling, while capable of measuring a wide range of parameters, is labor-intensive, time-consuming, and costly. Additionally, sampled data only represent specific points in time, making large-scale monitoring impractical (Sun et al., 2022; Min et al., 2022; Yi et al., 2022; Dheda & Cheng, 2020). To effectively mitigate aquaculture risks, relying solely on current water quality monitoring data proves insufficient (Zhou et al., 2022). The integration of artificial intelligence with IoT technologies enables intelligent aquaculture systems to not only monitor but also predict future water quality parameter concentrations and their evolving trends. This predictive capability allows for proactive adjustments to maintain optimal water conditions, thereby creating a stable and healthy environment for aquatic organisms. Such technological advancements play a crucial role in preventing water quality deterioration, minimizing farming risks, and promoting the sustainable development of intensive aquaculture practices (Zheng et al., 2023; Eze et al., 2021; Wang et al., 2021; Biazi & Marques, 2023).

Water quality parameters exhibit nonlinear, time-varying, and unstable characteristics due to their susceptibility to various external influencing factors (Feng et al., 2022). Time-series prediction of water quality parameters involves constructing mathematical models and utilizing algorithms to analyze collected data, enabling the forecasting of future trends in water quality over specific time periods (Farsi et al., 2021). Traditional prediction methods, including time-series analysis, grey theory models, and regression analysis, are commonly used (Wu et al., 2022; Panidhapu et al., 2020; Katimon, Shahid & Mohsenipour, 2018). While these models can extract data features and improve prediction accuracy to some extent, they are hindered by time-delay responses and limitations in capturing long-term dependencies within temporal data (Michael, Zhong & Ridha, 2023). In recent years, deep learning models based on neural networks have shown remarkable capability in capturing the nonlinear characteristics of water quality parameters, significantly enhancing prediction performance (Shahi et al., 2020; Dani et al., 2023; Fafoutellis & Vlahogianni, 2023). For instance, Nong et al. (2023) combined Support Vector Regression (SVR) with Random Forest (RF) for DO prediction, achieving a notable improvement in accuracy. However, SVR is sensitive to missing data and computationally demanding during training. In contrast, Artificial Neural Network (ANN) offer greater tolerance for incomplete data. Gautam et al. (2023) applied an ANN model to predict sodium levels in groundwater, demonstrating its reliable accuracy. Adnan et al. (2025) achieved an 85.70% prediction accuracy for biochemical oxygen demand in aquatic environments using an enhanced ANN model. With advancements in deep learning, Recurrent Neural Networks (RNN) have proven more effective than ANN for processing nonlinear time-series data. Zhang, Fitch & Thorburn (2020) integrated Kernel Principal Component Analysis (KPCA) with RNN for DO prediction, achieving an impressive accuracy of 90.80% in one-hour-ahead forecasts. However, RNN are susceptible to gradient vanishing and explosion issues. To overcome these challenges, Hochreiter & Schmidhuber (1997) introduced Long Short-Term Memory (LSTM) networks (Graves, 2012; Heddam et al., 2022). Lee et al. (2022) employed LSTM for real-time DO prediction at the Han River confluence, where ANN models produced accuracy ranging from 0.64 to 0.86, while LSTM models achieved an accuracy of 0.93 to 0.97, further validating LSTM’s superior precision in modeling water quality data. The attention mechanism has gained widespread adoption in water quality prediction due to its ability to enhance the weighting of critical features, demonstrating particular advantages in capturing long-term dependencies among water quality parameters (Cheng et al., 2022; Pranolo et al., 2022; Yan et al., 2022). Huang et al. (2023) achieved exceptional performance (R² = 99.86%) in predicting water flow velocity by integrating a spatiotemporal attention mechanism with LSTM. Similarly, D et al. (2024) developed an A-LSTM model combining attention mechanisms with LSTM architecture, which attained remarkable prediction accuracy ranging from 98.30% to 99.70% across multiple water quality parameters.

The studies mentioned above highlight the strong performance of LSTM-based deep learning models in the time-series prediction of water quality parameters, demonstrating their capacity to effectively capture complex temporal variations due to their powerful nonlinear modeling capabilities. However, water quality parameters are influenced not only by temporal fluctuations but also by spatial correlations at specific locations. Wang et al. (2025) applied Kriging interpolation for spatial prediction of water quality parameters, achieving mean absolute errors (MAE) of 0.025, 0.025, and 0.074 for DO, pH, and turbidity, respectively. Tayyab et al. (2023) assessed 10 interpolation techniques to predict arsenic concentrations in groundwater resources in Punjab, Pakistan, and found that the Inverse Distance Weighting (IDW) algorithm produced the lowest RMSE and MAE among the tested models. Unlike traditional Kriging and IDW methods, Radial Basis Function (RBF) neural networks do not impose strict requirements on data distribution or geometry, making them well-suited for handling non-uniformly distributed data. This flexibility allows RBF networks to effectively address complex multidimensional prediction challenges. Xie et al. (2024) developed an improved RBF neural network to analyze data from 28 reservoir monitoring stations, and the experimental results demonstrated that the enhanced RBF model outperformed all benchmark models across various evaluation metrics.

The current challenges in aquaculture water quality prediction can be summarized into three key issues: (a) Water quality parameters exhibit high non-linear temporal dynamics and spatial heterogeneity, complicating the ability of a single model to manage them concurrently; (b) Existing deep learning models are sensitive to hyperparameters and are challenging to tune, which limits their application in field deployment scenarios with constrained computational resources; (c) There is a lack of a lightweight and robust predictive interpolation framework tailored for small-scale, highly variable aquaculture pond environments.

To gain a comprehensive understanding of variations in water quality parameters, it is essential to develop a spatiotemporal prediction model that simultaneously considers both temporal and spatial variation patterns. This study proposes a combined model that integrates an enhanced Sparrow Search Algorithm (ESSA), a Self-Attention (SA) mechanism, and an LSTM network to predict water quality parameters in a time-series context across five sampling points. The time-series prediction results are subsequently utilized as inputs for a RBF network to facilitate spatial predictions. By merging time-series and spatial prediction techniques, this study constructs a spatiotemporal prediction model that effectively captures long-term dependencies in temporal data while incorporating spatial correlations. The resulting model provides accurate spatiotemporal forecasting of water quality parameters, offering valuable technical support for water quality monitoring and management.

Materials and Methods

Study area and data acquisition

The study focuses on an irregularly shaped aquaculture water area located at coordinates 32^∘18′50″N, 120^∘35′31″E, in Changzhou City, China, as depicted in Figs. 1A and 1B. This area measures approximately 150 m in length, 45 m in width, and 2 m in depth. The spatial environment of this region is well-preserved, with no surrounding pollution sources. During the deployment of the buoys, they were strategically positioned at various locations. Each buoy was equipped with multiple sensors installed at the same depth, ensuring uniformity across the sensors on each individual buoy. However, the deployment depths of the sensors varied among different buoys, ranging from 0.3 m to 1.9 m. To provide a clearer illustration of this arrangement, we have included a schematic diagram, as shown in Figs. 1C and 1D. The sensors, housed within waterproof enclosures, are detailed in Table 1. The collected water quality data includes DO concentration, water temperature, pH, ammonia nitrogen level, and ORP, with measurements recorded at 10-min intervals.

Figure 1: Study area and deployment of measuring points.
(A) Aerial view of the study area. (B) Geographic location of the study area. (C) Schematic layout of the monitoring points. (D) Coordinates of sensors placement.

Download full-size image

DOI: 10.7717/peerj-cs.3515/fig-1

Table 1:

Sensor parameters.

Sensor type	Model	Measurement range	Measurement error
DO sensor	JXSZ-1001-DOY	0–20 (mg/L)	$\pm$ 0.01 (mg/L)
Water temperature sensor	DS18B20	−10 – +85 (°C)	$\pm$ 0.5 (°C)
pH sensor	PHG-206A	0–14 (pH)	$\pm$ 0.1
Ammonia nitrogen sensor	NHN-206A	0–100 (mg/L)	$\pm$ 0.01 (mg/L)
ORP sensor	ORP-206A	−1,500 – +1,500 (mV)	$\pm$ 10 (mV)

DOI: 10.7717/peerj-cs.3515/table-1

Data preparation

For the missing data in sensor collection, the adjacent data collected under the same conditions are utilized to estimate the missing values, as demonstrated in Eq. (1).

(1) $X_{k + i} = X_{k} + \frac{i \cdot (X_{k + j} - X_{k})}{j}, 0 < i < j$ where $X_{k + i}$ represents the missing sensor data at time $k + i$ , while $X_{k}$ and $X_{k + j}$ denote the original sensor data at times $k$ and $k + j$ , respectively.

Given that water quality data and meteorological data exhibit continuity and temporal sequence, any instance where the variation range of continuously collected data exceeds that of the preceding and succeeding moments is identified as an outlier. These outliers are subsequently processed horizontally utilizing the mean smoothing method, as illustrated in Eq. (2).

(2) $X_{k}^{'} = \frac{X_{k - 1}^{'} + X_{k + 1}^{'}}{2}$ where $X_{k}^{'}$ denotes the outlier at time $k$ , while $X_{k - 1}^{'}$ and $X_{k + 1}^{'}$ represent the valid neighboring values at the adjacent time points.

Upon partitioning the collected data into training and testing sets, normalization is subsequently applied, as demonstrated in Eq. (3).

(3) $\tilde{X} = \frac{X - X_{min}}{X_{max} - X_{min}}$ where $\tilde{X}$ represents the normalized value, X denotes the original data, and $X_{max}$ and $X_{min}$ correspond to the maximum and minimum values of the original sequence, respectively.

The research collected data samples over a 32-day period, specifically from March 15 to April 15, 2024. The dataset was divided into training and testing sets with a ratio of 70:30. Table 2 presents a representative subset of the raw data collected during the monitoring period of 2024.

Table 2:

Representative selections of raw data.

Data	Time	pH	DO (mg/L)	Ammonia nitrogen (mg/L)	Water temperature (°C)	ORP (mV)
3–15	01:00	8.32	9.37	1.33	16.31	361
3–15	02:20	8.41	9.47	1.33	16.42	363
3–15	08:30	8.21	9.44	1.34	17.33	370
3–15	12:20	8.25	9.50	1.35	20.53	390
3–15	16:10	9.29	9.32	1.38	22.16	370
3–15	20:50	8.21	9.18	1.35	19.87	3,634
3–16	00:00	8.13	9.22	1.34	16.54	392
3–16	03:10	7.63	8.78	1.35	16.75	389
3–16	08:30	7.62	8.84	1.36	17.21	383
3–16	12:50	7.68	8.91	1.44	21.32	303
3–16	13:10	7.68	9.28	1.43	21.47	289
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
4–15	16:00	7.67	9.04	1.52	21.33	413
4–15	17:40	7.64	8.67	1.45	21.21	416

DOI: 10.7717/peerj-cs.3515/table-2

Model’s performance evaluation metrics

To comprehensively evaluate the predictive performance of the model, this study utilizes three key evaluation metrics: RMSE, MAE, and the Coefficient of Determination (R-squared, R²). Superior model is indicated by lower values of RMSE and MAE, along with a higher R² value. The mathematical formulas for these metrics are presented in Eqs. (4) to (6).

(4) $R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}$

(5) $M A E = \frac{1}{n} \sum_{i = 1}^{n} | {\hat{y}}_{i} - y_{i} |$

(6) $R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y_{i}})}^{2}}$ where $y_{i}$ represents the observed value of the water quality parameter, $\hat{y_{i}}$ represents the predicted value of the water quality parameter, $n$ is the sample size, and $\bar{y_{i}}$ is the mean value of the water quality parameter.

Descriptions of the proposed models

To improve the spatiotemporal prediction accuracy of aquaculture water quality parameters, this study proposes an innovative prediction model that integrates ESSA-SA-LSM with RBF neural networks.

(1) SA-LSTM model

The primary objective of this study is to address the challenge of time series prediction for forecasting future water quality parameters based on historical data. To achieve this, an LSTM model is employed to construct a foundational prediction framework, which effectively captures the sequential temporal dependencies inherent in water quality parameters. Furthermore, the model is enhanced by incorporating a self-attention mechanism, leading to the proposed SA-LSTM prediction model. The architecture of the proposed model is depicted in Fig. 2.

Figure 2: Time series prediction structure diagram.

Download full-size image

DOI: 10.7717/peerj-cs.3515/fig-2

The time step of the LSTM is represented by d, with a value of 6. The processed data from time t−d+1 to t, including pH, ammonia nitrogen, water temperature, DO, and ORP ( $X_{t - d + 1}^{i}$ to $X_{t}^{i}$ ), are utilized as inputs to the SA-LSTM model. Initially, feature extraction is conducted using the LSTM network. Subsequently, the data are fed into the self-attention mechanism to capture the relationships between the current time step and historical water quality parameter data, assigning higher weights to more significant time steps. An improved SSA is employed to optimize the hyperparameters of the LSTM network. Finally, the model predicts water quality parameters, outputting the spatiotemporal predictions of pH, ammonia nitrogen, water temperature, DO, and ORP at time t+1 as $X_{t + 1}^{1}$ to $X_{t + 1}^{5}$ .

The LSTM model is a specialized architecture of RNN that incorporates gating mechanisms and hidden states to handle the problems of gradient explosion and vanishing gradients when managing long-term dependencies. In contrast to traditional RNNs, LSTM introduces gating units that regulate the flow of information, allowing the model to retain significant data and essential features while discarding irrelevant information. The cell architecture of the LSTM is depicted in Fig. 3, and the computational processes for the forget gate ( $f_{t}$ ), input gate ( $i_{t}$ ), output gate ( $o_{t}$ ), cell state ( $c_{t}$ ) and output ( $h_{t}$ ) are presented in Eqs. (7) to (12).

(7) $f_{t} = σ (W_{f} [h_{t - 1}; x_{t}] + b_{f})$

(8) $i_{t} = σ (W_{i} [h_{t - 1}; x_{t}] + b_{i})$

(9) $o_{t} = σ (W_{o} [h_{t - 1}; x_{t}] + b_{o})$

(10) ${\tilde{c}}_{t} = \tanh (W_{c} [h_{t - 1}; x_{t}] + b_{c}$

(11) $c_{t} = i_{t} \times {\tilde{c}}_{t} + f_{t} \times c_{t - 1}$

(12) $h_{t} = o_{t} \times \tanh (c_{t})$ where $\tilde{X}$ denotes the value labeled by time; tanh represents the hyperbolic tangent function, which outputs values in the range (−1, 1); $b_{o}$ , $b_{i}$ and $b_{c}$ are constant biases; and $W_{i}$ , $W_{o}$ and $W_{c}$ correspond to the weight matrices associated with the input gate, output gate, and the conveyor belt mechanism, respectively.

Figure 3: Structure of LSTM unit.

Download full-size image

DOI: 10.7717/peerj-cs.3515/fig-3

SA represents a significant advancement in attention mechanisms, offering two primary benefits: it facilitates the efficient extraction of critical information while minimizing attention to irrelevant data, and it concurrently reduces reliance on external information sources, thereby enhancing the capacity to capture intrinsic relationships within input data. By integrating the self-attention mechanism, the LSTM can effectively address challenges related to information overload, while simultaneously improving predictive accuracy and system robustness.

The attention mechanism is fundamentally based on three parameters: Q, K, and V. The output of LSTM, denoted as $h_{t}$ , serves as the input for the SA module. The output of the SA module, denoted as $A t t e n t i o n (Q, K, V)_{i}$ , is calculated as illustrated in Eq. (13).

(13) ${\begin{matrix} Q = ω_{Q} h_{t} \\ V = ω_{v} h_{t} \\ K = ω_{k} h_{t} \\ A t t e n t i o n {(Q, K, V)}_{i} = softmax (\frac{Q K^{T}}{\sqrt{d_{k}}}) v_{i} \end{matrix} .$

The predicted value $y_{t}^{i}$ of the target water quality parameter is presented in Eq. (14).

(14) $y_{t}^{i} = softmax (ω_{o u t} \cdot A t t e n t i o n (Q, K, V) + b_{o u t})$ where $ω_{o u t}$ represents the output weight matrix of the fully connected network, and $b_{o u t}$ denotes the output bias vector across the entire network structure.

(2) ESSA module

The SSA, a swarm intelligence-based optimization technique, is widely recognized for its robust optimization capabilities and rapid convergence. However, the standard SSA implementation exhibits certain limitations that hinder its performance. Firstly, the random initialization of sparrow population positions at the algorithm’s inception results in limited population diversity, consequently yielding suboptimal target solutions that adversely affect the algorithm’s iterative performance and error rates. Secondly, the discoverers in SSA tend to exhibit excessive aggressiveness during food source exploration. Upon locating an optimal solution, other individuals rapidly converge towards it, thereby diminishing population diversity and increasing susceptibility to local optima. Furthermore, the underutilization of the finder’s positional information may cause the algorithm to overlook potential optimal regions, thereby missing valuable exploration opportunities. To address these limitations, this study proposes the ESSA module that incorporates two key improvements: (1) composite chaos mapping for population initialization and (2) an adaptive inertial weight factor. These modifications aim to optimize the hyperparameters of LSTM neural networks, effectively resolving issues related to slow parameter convergence and inadequate global search capabilities. Specifically, the improved Tent-Logistic-Cosine composite mapping enhances the initial positioning of sparrows, as detailed in Eqs. (15)–(17), while the adaptive inertia weight factor and finder position optimization are mathematically formulated in Eqs. (18)–(19).

(15) $y_{i + 1} = {\begin{matrix} \cos (π (2 r y_{i} + 4 (1 - r) y_{i} (1 - y_{i})) - 0.5)), y_{i} < 0.5 \\ \cos (π (2 r (1 - y_{i})) + 4 (1 - r) y_{i} (1 - y_{i})) - 0.5)), y_{i} \geq 0.5 \end{matrix}$ where $y_{i}$ represents the sequence value at the i-th iteration, and $r$ denotes the control parameter that governs the system’s behavior.

When $y_{i} \in [0, 1]$ and $r \in [0, 1]$ , Eq. (13) is normalized to constrain its range within the interval [0, 1], as expressed by Eq. (16).

(16) $z_{i} = a b s (ω y_{i})$ where $abs (\cdot)$ represents the absolute value function, while $ω$ serves as the normalization constant.

The final initialization of sparrow positions is given by Eq. (17).

(17) $X_{i, j}^{1} = l b_{j} + z_{i} (u b_{j} - l b_{j})$ where $X_{i, j}^{1}$ represents the initial coordinate of the $i - t h$ sparrow in the $j - t h$ dimension, where $l b_{j}$ and $u b_{j}$ denote the lower and upper bounds of the $j - t h$ dimension, respectively. The dimension j corresponds to the parameter to be optimized, with $j = 4$ in the current implementation. Specifically, the parameter ranges are defined as follows: the learning rate is bounded within [0.001, 0.1], the iteration number is constrained to [10, 1,000], and the number of hidden layer neurons is limited to [1, 100].

(18) $ω_{t} = 0.5 (1 - {(\frac{2 t}{{i t e r}_{max}} - 1)}^{3})$

(19) $X_{i, j}^{t + 1} = {\begin{matrix} ω_{t} \cdot X_{i, j}^{t} \cdot \exp (\frac{- i}{α \cdot {iter}_{max}}), R_{2} < T_{s} \\ ω_{t} \cdot (X_{i, j}^{t} + Q \cdot L), R_{2} \geq T_{s} \end{matrix}$ where $ω_{t}$ represents the adaptive inertia weight factor, where $ω_{m a x}$ is set to 0.9 and $ω_{m i n}$ to 0.3. The variable $t$ denotes the current iteration index, and j indicates the dimensionality of the parameters to be optimized $X_{i, j}^{t}$ represents the position of the $i - t h$ sparrow in the $j - t h$ dimension at the $t - t h$ iteration. The term L denotes a unit row vector, and $α$ is a random number uniformly distributed in the interval [0, 1]. The maximum iteration count is denoted by ${i t e r}_{max}$ and Q represents a random number following the standard normal distribution. The warning threshold $R_{2}$ satisfies $R_{2} \in [0, 1]$ , and the safety threshold $T_{s}$ is bounded within the range [0.5, 1].

The ESSA module is employed to optimize four critical hyperparameters of the SA-LSTM model: the learning rate, iteration count, neuron count in the first hidden layer, and neuron count in the second hidden layer. The optimization procedure consists of the following specific steps:

(a)

Initialization: Configure the population size and the proportion of discoverers, joiners, and alarmers within the SSA module. The population is initialized using the Tent-Logistic-Cosine composite mapping to ensure a more uniform distribution of the initial population.
(b)

Parameter Space Definition: Define the dimensionality of the parameter space and establish the search boundaries for each parameter, thereby determining the initial positions of the sparrow population.
(c)

Fitness Evaluation: Calculate the fitness value for each sparrow using the error function, identifying the optimal fitness value and its corresponding position.
(d)

Position Update: Incorporate the adaptive inertia factor to update the positions of discoverers, joiners, and alarmers through the improved dynamic step size mechanism.
(e)

Termination Check: Re-evaluate the fitness values of the sparrow population. If the error falls within the target threshold, terminate the iteration; otherwise, return to step (d) for further optimization.

The workflow of the ESSA module is depicted in Fig. 4.

(3) Spatial prediction models

The key water quality parameters exhibit distinct three-dimensional distribution characteristics in large surface ponds. Significant variations are observed both across different water layers within the same vertical plane and at different positions within the same water layer. Consequently, measurements from a single point cannot adequately represent the overall water quality dynamics of the entire pond. To better characterize the spatiotemporal distribution of these critical parameters, a three-dimensional spatial analysis method is employed, enabling comprehensive monitoring of water quality parameter distributions throughout the pond.

To enhance the interpolation accuracy of water quality parameters, this study utilizes the RBF for interpolating prediction results. The coordinates of the RBF centers are determined from the training data using the k-means clustering algorithm. For each RBF center, the width parameter $σ$ is calculated based on the average distance from this center to its $p$ nearest neighboring centers, expressed by Eq. (20). To mitigate the risk of overfitting, $L 2$ regularization is applied when solving for the output layer weights.

(20) $σ_{i} = \frac{1}{p} \sum_{j = 1}^{p} ‖ c_{i} - c_{j} ‖$ where $p$ = 3 is a predefined hyperparameter that represents the number of the nearest neighboring centers considered; $c_{i}$ is the $i_{t h}$ RBF center; $c_{j}$ is the $j_{t h}$ nearest neighbor of center $c_{i}$ ; $‖ c_{i} - c_{j} ‖$ represents the Euclidean distance between the two centers.

The spatiotemporal prediction outcomes are demonstrated using DO as a representative example. Figure 5 illustrates the structural framework of the RBF-based spatiotemporal prediction model. The model incorporates data from the three sensors nearest to the interpolation point at time $t + 1$ as input, generating predictions for both water temperature and DO values at the specified interpolation point and time.

Figure 5: Structural framework of the RBF.

Download full-size image

DOI: 10.7717/peerj-cs.3515/fig-5

This process begins with the construction of a RBF network and the initialization of its parameters. Subsequently, the time series prediction results at time $t + 1$ from the three monitoring points nearest to the target prediction location are used as inputs to the RBF network. Specifically, 15 data points, represented as $X_{t + 1}^{1, 1} \sim X_{t + 1}^{1, 5}$ , $X_{t + 1}^{2, 1} \sim X_{t + 1}^{2, 5}$ and $X_{t + 1}^{3, 1} \sim X_{t + 1}^{3, 5}$ , are employed to predict the target point’s pH, ammonia nitrogen, water temperature, DO, and ORP values ( $y_{t + 1}^{1} \sim y_{t + 1}^{5}$ ) at time $t + 1$ .

Results and discussions

The experimental setup in this study comprised a computing system with the following specifications: an Intel Core i5-8265U processor, 16 GB of RAM, and the Windows 10 operating system. The computational framework was implemented using Python 3.10 through the Anaconda distribution platform. For model training, the experimental protocol employed the gradient descent optimization algorithm with the following parameters: a batch size of 32 samples, an input sequence length (time steps) of 6, and 1,000 training epochs. The Adam optimizer was employed with an initial learning rate of 0.001. To mitigate overfitting, the dropout rate was set to 0.3. The key parameters of the ESSA adopted in this study are shown in Table 3.

Table 3:

Key parameters of the ESSA.

Parameter	Value
Population size	50
Discoverers ratio	20%
Joiners ratio	70%
Alerters ratio	10%
R2, Alert value	0.8
ST, Safety threshold	0.6
Max iterations	1,000

DOI: 10.7717/peerj-cs.3515/table-3

This study identifies the optimal combination of three key hyperparameters of the SA-LSTM model using the ESSA algorithm. The specified search ranges for optimization and the final results of the ESSA-SA-LSTM model are presented in Table 4.

Table 4:

Optimization hyperparameters for the SA-LSTM using the ESSA.

Optimized parameter	Search range	Optimal value
Initial learning rate	[0.001, 0.01]	0.00369
Epochs	[10, 1,000]	512
Hidden neurons	[1, 1,000]	81

DOI: 10.7717/peerj-cs.3515/table-4

Ablation experiments

To comprehensively evaluate the predictive performance of the proposed ESSA-SA-LSTM model, systematic ablation experiments were conducted using LSTM as the baseline model. The experimental design included three key components: the baseline LSTM, SA mechanism and ESSA optimization. The comparative results of these experiments are presented in Table 5.

Table 5:

Results of ablation experiments.

Group	LSTM	SA	ESSA	RMSE	MAE	$R^{2}$
1	✓			0.0384	0.0287	0.9285
2	✓	✓		0.0305	0.0270	0.9551
3	✓	✓	✓	0.0157	0.0122	0.9881

DOI: 10.7717/peerj-cs.3515/table-5

The proposed model demonstrates significant performance improvements across all evaluation metrics compared to alternative architectures. Specifically, when compared to the SA-LSTM model, it achieves an average reduction of 48.52% in RMSE and 54.81% in MAE, while improving the $R^{2}$ value by an average of 3.46%. More substantial enhancements are observed relative to the LSTM model, with average reductions of 59.11% in RMSE and 57.49% in MAE, respectively, along with an average increase of 6.42% in $R^{2}$ .

These performance gains can be attributed to three key innovations: (1) the implementation of correlation analysis on the original data, which effectively reduces computational redundancy; (2) the integration of a self-attention mechanism that mitigates information loss during sequential processing; (3) the incorporation of composite mapping and adaptive inertia factors within the enhanced SSA algorithm, which optimizes LSTM parameters, accelerates convergence, and enhances global search capabilities, ultimately improving overall model performance. The proposed model requires less than 10 MB of memory and achieves an average inference time of under 20 ms during training, both of which are entirely acceptable. In scenarios that demand even faster execution, the inference time can be significantly reduced by strategically compromising the density of the spatial prediction points.

Comparison of experimental results on time-series sequence

The model proposed in this study is benchmarked against the SA-LSTM, and standard LSTM models, all of which utilize identical datasets. Using Point 1 (Fig. 1D) as a case study, the comparative performance across five key variables is illustrated in Figs. 6A to 6E. The performance metrics reveal substantial improvements: compared to the SA-LSTM model, the proposed model achieves reductions of 75.00% in RMSE and 73.24% in MAE, along with a 3.37% increase in R². More significant enhancements are observed relative to the LSTM model, with 80.91% and 79.12% reductions in RMSE and MAE, respectively, and a 6.40% improvement in $R^{2}$ . These results substantiate that the proposed model not only accurately tracks the temporal dynamics of dissolved oxygen and water temperature variations but also effectively predicts their absolute values. The comprehensive evaluation confirms the superior predictive capabilities of the proposed architecture over existing models.

Figure 6: Performance of different models across five water quality parameters.
(A) Prediction performance of DO for different algorithms. (B) Prediction performance of ORP for different algorithms. (C) Prediction performance of pH for different algorithms. (D) Prediction performance of water temperature for different algorithms. (E) Prediction performance of ammonia nitrogen for different algorithms.

Download full-size image

DOI: 10.7717/peerj-cs.3515/fig-6

The proposed model demonstrates a significant enhancement in performance across all five water quality parameters compared to the other two models. Significance testing reveals that all $p$ -values are substantially below the 0.05 threshold, indicating highly statistically significant differences. These results further substantiate the superiority of the model presented in this study.

Spatiotemporal distribution prediction of water quality parameters

Using DO and water temperature as representative cases, the collected dataset reveals distinct diurnal patterns with observable peaks occurring at 12:00 and 24:00. This temporal variation underscores the necessity of developing accurate spatiotemporal distribution models for these critical water quality parameters at these specific time points. To address this, the study employs the ESSA-SA-LSTM model to predict DO and water temperature levels at 12:00 and 24:00 on April 15, 2024. These prediction results subsequently serve as input data for comprehensive spatiotemporal analysis. The spatial prediction component was implemented using the Python computational platform. For enhanced visualization and analysis of the spatiotemporal distribution patterns, the model incorporates RBF interpolation to generate detailed distribution maps. Figure 7 presents the spatial distribution characteristics of DO and water temperature at both 12:00 and 24:00, with subfigures 7A through 7D providing comprehensive visual representations of these patterns.

Figure 7: Spatiotemporal distribution prediction of DO and water temperature at 12:00 and 24:00.
(A) Predicted DO distribution at 12:00. (B) Predicted water temperature distribution at 12:00. (C) Predicted DO distribution at 24:00. (D) Predicted water temperature distribution at 24:00.

Download full-size image

DOI: 10.7717/peerj-cs.3515/fig-7

As illustrated in Figs. 7A and 7C, the spatiotemporal distribution of dissolved oxygen exhibits distinct vertical stratification, with DO concentration progressively decreasing with increasing depth from the water surface. This phenomenon can be attributed to the primary mechanism of oxygen production in aquatic systems-photosynthesis by phytoplankton and aquatic plants, which predominantly inhabit the surface and littoral zones. As depth increases, photosynthetic activity diminishes due to light attenuation caused by water refraction and turbidity, consequently reducing oxygen production and resulting in lower dissolved oxygen levels in deeper water layers. Similarly, Figs. 7B and 7D reveal a comparable vertical temperature gradient, with water temperature decreasing as distance from the surface increases. This thermal stratification pattern results from the extensive exposure of the surface layer to solar radiation, creating a temperature differential between the warmer surface waters and cooler deeper layers.

To comprehensively evaluate the performance of the proposed RBF method, we conducted comparative experiments using ESSA-SA-LSTM integrated with two alternative interpolation approaches: IDW and linear triangular interpolation. The evaluation was performed on identical datasets to ensure fair comparison. The study employed a cross-validation methodology to assess the accuracy of each algorithm. Table 6. presents the spatiotemporal prediction accuracy metrics for DO and water temperature across all evaluated algorithms. The results demonstrate that, compared to ESSA-SA-LSTM with linear triangular interpolation and ESSA-SA-LSTM with IDW, the proposed ESSA-SA-LSTM-RBF approach achieved reductions in MSE of 38.49% and 48.08%, respectively. Similarly, RMSE decreased by 35.15% and 27.96%, while MAE showed reductions of 15.09% and 3.68%, respectively. Through a comparative analysis of interpolation accuracy for DO and water temperature, the results indicate that the proposed algorithm exhibits superior performance, particularly in scenarios with limited monitoring points. This enhanced performance is attributed to the RBF method’s ability to effectively handle sparse data distributions while maintaining interpolation accuracy.

Table 6:

Spatiotemporal prediction accuracy of the three algorithms.

Algorithm	MSE	RMSE	MAE
Ours	0.1892	0.4349	0.3853
ESSA-SA-LSTM-Linear triangular interpolation	0.3076	0.6706	0.4538
ESSA-SA-LSTM-IDW	0.3644	0.6037	0.4000

DOI: 10.7717/peerj-cs.3515/table-6

Conclusion

To precisely characterize the spatiotemporal distribution of DO and water temperature in aquaculture ponds and mitigate associated farming risks, this study proposes an innovative algorithm that integrates an enhanced sparrow search algorithm with a SA-LSTM and RBF. The primary contributions and findings can be summarized as follows:

(1)

For the time-series prediction of water quality parameters, the proposed methodology first employs a self-attention mechanism to capture intrinsic correlations within the data and highlight the influence of key features in the input data. Subsequently, an improved sparrow search algorithm is implemented to optimize the hyperparameters of the LSTM network. Finally, ablation experiments demonstrate that the proposed time-series prediction model significantly enhances the accuracy of water quality parameter forecasting.
(2)

Comprehensive comparative experiments demonstrate the superior performance of ESSA-SA-LSTM-RBF, showing significant improvements over alternative approaches. Specifically, compared to ESSA-SA-LSTM with linear triangular interpolation and ESSA-SA-LSTM-IDW algorithms, the proposed method achieves reductions in MSE of 38.49% and 48.08%, respectively. Similarly, RMSE values decrease by 35.15% and 27.96%, while MAE improvements reach 15.09% and 3.68%, respectively. These results confirm the enhanced predictive capabilities of the proposed algorithm over conventional methods.
(3)

In the next phase of our research, we will systematically incorporate models such as GRU, Bi-LSTM, TCN, and the classical ARIMA model into our comparative framework. A rigorous hyperparameter tuning process will be conducted for these models to ensure a comprehensive performance evaluation. Furthermore, we will introduce additional spatial interpolation algorithms, such as Kriging, for comparative analysis. Confidence intervals and statistical tests will also be implemented to ascertain whether the observed differences are statistically significant. We also plan to systematically collect monitoring data from ponds and similar water bodies across various seasons, such as summer and autumn, as well as from different geographical locations. Incorporating multi-source external driving factors is essential for enhancing the model’s physical interpretability and generalization performance. Meanwhile, the analysis of the model’s time complexity will be performed to rigorously establish its practical feasibility and scalability.

The developed spatiotemporal model offers dual advantages: it not only enhances the accuracy of time series predictions but also improves the precision of spatial predictions. This advancement provides a robust theoretical foundation for intelligent aquaculture systems.

Supplemental Information

Code for water variables spatiotemporal distribution.

DOI: 10.7717/peerj-cs.3515/supp-1

Download

Comparison prediction data of four models.

The data point indicates output of the four models.

DOI: 10.7717/peerj-cs.3515/supp-2

Download

[1] Adnan RM, Ewees AA, Wang M, Kisi O, Heddam S, Parmar KS, Zounemat-Kermani M. 2025. Enhancing BOD5 forecasting accuracy with the ANN-enhanced runge kutta model. Journal of Environmental Chemical Engineering 13(2):115430

[2] Biazi V, Marques C. 2023. Industry 4.0-based smart systems in aquaculture: a comprehensive review. Aquacultural Engineering 103:102360

[3] Cao X, Ren N, Tian G, Fan Y, Duan Q. 2021. A three-dimensional prediction method of dissolved oxygen in pond culture based on Attention-GRU-GBRT. Computers and Electronics in Agriculture 181:105955

[4] Cheng Q, Chen Y, Xiao Y, Yin H, Liu W. 2022. A dual-stage attention-based Bi-LSTM network for multivariate time series prediction. The Journal of Supercomputing 78(14):16214-16235

[5] D RG, VP H, KP RAH, Bhide A. 2024. Attention-driven LSTM and GRU deep learning techniques for precise water quality prediction in smart aquaculture. Aquaculture International 32(6):8455-8478

[6] Dani I, Maisarah A, Najah AA, Gan J, Aiman N, Chah PHM, Nouar A, Ahmed ES. 2023. Predicting water quality with artificial intelligence: a review of methods and applications. Archives of Computational Methods in Engineering 30(8):4633-4652

[7] Dehghani R, Torabi Poudeh H, Izadi Z. 2021. Dissolved oxygen concentration predictions for running waters with using hybrid machine learning techniques. Modeling Earth Systems and Environment 8(2):1-15

[8] Dheda D, Cheng L. 2020. A multivariate water quality parameter prediction model using recurrent neural network. ArXiv

[9] Edouard R, Roberto P. 2022. Data assimilation as a key step towards the implementation of an efficient management of dissolved oxygen in land-based aquaculture. Aquaculture International 31(3):1287-1301

[10] El-Sayed A-FM. 2002. Effects of stocking density and feeding levels on growth and feed efficiency of Nile tilapia (Oreochromis niloticus Ll.) fry. Aquaculture Research 33(8):621-626

[11] Eze E, Kirby S, Attridge J, Ajmal T. 2021. Time series chlorophyll-a concentration data analysis: a novel forecasting model for aquaculture industry. Engineering Proceedings 5(1):27

[12] Fafoutellis P, Vlahogianni EI. 2023. Unlocking the full potential of deep learning in traffic forecasting through road network representations: a critical review. Data Science for Transportation 5(3):23

[13] Farsi M, Hosahalli D, Manjunatha B, Gad I, Atlam E-S, Ahmed A, Elmarhomy G, Elmarhoumy M, Ghoneim OA. 2021. Parallel genetic algorithms for optimizing the SARIMA model for better forecasting of the NCDC weather data. Alexandria Engineering Journal 60(1):1299-1316

[14] Feng K, Zhao Z, Li M, Tian L, An T, Zhang J, Xu X, Zhu L. 2022. Novel intelligent control framework for WWTP optimization to achieve stable and sustainable operation. ACS ES&T Engineering 2(11):2086-2094

[15] Gautam VK, Pande CB, Moharir KN, Varade AM, Rane NL, Egbueri JC, Alshehri F. 2023. Prediction of sodium hazard of irrigation purpose using artificial neural network modelling. Sustainability 15(9):7593

[16] Graves A. 2012. Long short-term memory. In: Supervised Sequence Labelling with Recurrent Neural Networks. Berlin, Heidelberg: Springer Berlin Heidelberg. 37-45

[17] Hafez AM, Lian-Shin L. 2021. Dissolved oxygen concentration predictions for running waters with different land use land cover using a quantile regression forest machine learning technique. Journal of Hydrology 597:126213

[18] Heddam S, Kim S, Mehr AD, Zounemat-Kermani M, Malik A, Elbeltagi A, Kisi O. 2022. Predicting dissolved oxygen concentration in river using new advanced machines learning: long-short term memory (LSTM) deep learning. In: Computers in Earth and Environmental Sciences. Amsterdam, Netherlands: Elsevier.

[19] Hochreiter S, Schmidhuber J. 1997. Long short-term memory. Neural Computation 9(8):1735-1780

[20] Huan J, Li H, Li M, Chen B. 2020. Prediction of dissolved oxygen in aquaculture based on gradient boosting decision tree and long short-term memory network: a study of Chang Zhou fishery demonstration base, China. Computers and Electronics in Agriculture 175(2):105530

[21] Huang J, Li J, Oh J, Kang H. 2023. Lstm with spatiotemporal attention for IoT-based wireless sensor collected hydrological time-series forecasting. International Journal of Machine Learning and Cybernetics 14(10):3337-3352

[22] Jongjaraunsuk R, Taparhudee W, Suwannasing P. 2024. Comparison of water quality prediction for red tilapia aquaculture in an outdoor recirculation system using deep learning and a hybrid model. Water 16(6):907

[23] Katimon A, Shahid S, Mohsenipour M. 2018. Modeling water quality and hydrological variables using ARIMA: a case study of Johor River, Malaysia. Sustainable Water Resources Management 4(4):991-998

[24] Lee J, Lee J, Lee M, Lee M, Kim Y, Hyung J, Kim K, Cha Y, Koo J. 2022. Development of a short-term water quality prediction model for urban rivers using real-time water quality data. Water Supply 22(4):4082-4097

[25] Li W, Wu H, Zhu N, Jiang Y, Tan J, Guo Y. 2020. Prediction of dissolved oxygen in a fishery pond based on gated recurrent unit (GRU) Information Processing in Agriculture 8:185

[26] Michael DC, Zhong L, Ridha K. 2023. The prediction of mid-winter and spring breakups of ice cover on Canadian rivers using a hybrid ontology-based and machine learning model. Environmental Modelling and Software 160(1):105577

[27] Min HS, Hwa CK, Sanghyun P, Taegu K, Sung KM, Gibeom N, JongCheol P. 2022. Estimation of cyanobacteria pigments in the main rivers of South Korea using spatial attention convolutional neural network with hyperspectral imagery. GIScience & Remote Sensing 59(1):547-567

[28] Nong X, Lai C, Chen L, Shao D, Zhang C, Liang J. 2023. Prediction modelling framework comparative analysis of dissolved oxygen concentration variations using support vector regression coupled with multiple feature engineering and optimization methods: a case study in China. Ecological Indicators 146:109845

[29] Panidhapu A, Li Z, Aliashrafi A, Peleato NM. 2020. Integration of weather conditions for predicting microbial water quality using bayesian belief networks. Water Research 170(12):115349

[30] Pranolo A, Mao Y, Wibawa AP, Utama ABP, Dwiyanto FA. 2022. Robust lstm with tuned-pso and bifold-attention mechanism for analyzing multivariate time-series. IEEE Access 10:78423-78434

[31] Sabwa AJ, Manyala JO, Masese FO, Fitzsimmons K. 2022. Effect of stocking density on growth performance of monosex nile tilapia (Oreochromis niloticus) in the aquaponic system integrated with lettuce (Lactuca sativa) Aquaculture and Fisheries 7(3):328-335

[32] Shahi TB, Shrestha A, Neupane A, Guo W. 2020. Stock price forecasting with deep learning: a comparative study. Mathematics 8(9):1441

[33] Sun X, Zhang Y, Shi K, Zhang Y, Li N, Wang W, Huang X, Qin B. 2022. Monitoring water quality using proximal remote sensing technology. Science of the Total Environment 803:149805

[34] Tayyab M, Aslam RA, Farooq U, Ali S, Khan SN, Iqbal M, Khan MI, Saddique N. 2023. Comparative study of geospatial techniques for interpolating groundwater quality data in agricultural areas of Punjab, Pakistan. Water 16(1):139

[35] Ustaoğlu F, Tepe Y, Taş B. 2020. Assessment of stream quality and health risk in a subtropical turkey river system: a combined approach using statistical analysis and water quality index. Ecological Indicators 113(6276):105815

[36] Wang C, Li Z, Wang T, Xu X, Zhang X, Li D. 2021. Intelligent fish farm—the future of aquaculture. Aquaculture International 29(6):1-31

[37] Wang D, Zhang C, Li A, Guo Y, Zhang H, Tan C. 2025. Spatio-temporal analysis and prediction for raw water quality of drinking water source by improved RNN algorithm. Journal of Water Process Engineering 71:107164

[38] Wu J, Zhang J, Tan W, Lan H, Zhang S, Xiao K, Wang L, Lin H, Sun G, Guo P. 2022. Application of time serial model in water quality predicting. Computers, Materials & Continua 74(1):67-82

[39] Xie Z, Jia B, Wang J, Mo C. 2024. Three-dimensional prediction method of reservoir water quality based on ics optimization of RBF. Transactions of the CSAM 55(2):306-314

[40] Yan X, Gan X, Wang R, Qin T. 2022. Self-attention eidetic 3D-LSTM: video prediction models for traffic flow forecasting. Neurocomputing 509(3):167-176

[41] Yi X, Yahui G, Guodong Y, Xuan Z, Yu S, Fanghua H, Yongshuo F. 2022. UAV multispectral image-based urban river water quality monitoring using stacked ensemble machine learning algorithms—a case study of the Zhanghe river, China. Remote Sensing 14(14):3272