Artificial intelligence-enhanced CSI-based Wi-Fi sensing for non-contact vital sign monitoring: a systematic review

View article
PeerJ Computer Science

Introduction

In recent years, health monitoring systems have emerged as a crucial component of modern healthcare, with the potential to transform patient care, improve health outcomes, and reduce costs (Majumder, Mondal & Deen, 2017). Advances in technology have enabled innovative approaches that leverage Wi-Fi signals and AI for non-invasive and continuous health monitoring (Armenta-Garcia, Gonzalez-Navarro & Caro-Gutierrez, 2024). These technologies offer unique advantages, including operation without direct patient contact and the ability to process complex data patterns to detect health issues in real time (Wang, Yang & Mao, 2020a).

However, despite these systems’ potential, ensuring their security remains a critical challenge. Health data are highly sensitive, and breaches can have serious consequences for patient privacy and well-being (Meingast, Roosta & Sastry, 2018). Moreover, the accuracy of such health monitoring systems is a significant concern, as environmental factors such as signal interferences can introduce noise, leading to false alarms or missed detections (Wang, Liu & Shahzad, 2016). Additionally, variations in hardware, user positioning, and individual physiological differences can affect system reliability (Abdelnasser, Youssef & Harras, 2015).

This literature review provides a comprehensive overview of the current state of Wi-Fi-based vital sign monitoring systems, with a particular focus on using Channel State Information (CSI) for contactless estimation of heart and respiration rates. Both artificial intelligence (AI)-based and deterministic (non-AI) approaches are examined. We evaluated their effectiveness, reliability, and clinical potential. This review is intended to support clinicians, biomedical engineers, and healthcare policymakers seeking non-invasive, scalable solutions for continuous patient monitoring in both home and clinical settings. Our objectives are fivefold: first, to report recent advances in CSI-based vital signs monitoring systems; second, to compare the methodologies used in CSI-based vital signs monitoring; third, to assess their noted performance in various settings; fourth, to identify the gaps and challenges in the field; and fifth, to suggest promising recommendations for future research and development. By analyzing the findings of recent studies, this review offers valuable insights for researchers, healthcare professionals, and engineers working toward enhancing these critical systems (Newaz et al., 2022).

The scope of this review comprises peer-reviewed journal studies published within the last 5 years. We focus primarily on health monitoring systems using Wi-Fi signals, in particular CSI, for non-invasive tracking of heart rate and respiration rate (Yang, Zhou & Liu, 2013). We consider both AI-based solutions and deterministic solutions to analyze their performance, reliability, and feasibility for clinical or home use. Studies on human activity recognition, fall detection, or those based on non-Wi-Fi wireless protocols were excluded to maintain a focused scope (Zou et al., 2018).

The primary contribution of this work can be outlined as follows:

  • 1.

    A structured systematic review adhering to the guidelines proposed by both Kitchenham & Charters (2007) and PRISMA (Page et al., 2021).

  • 2.

    Formulation of essential research questions exploring CSI-based vital sign monitoring systems, including both AI- and non-AI-based approaches.

  • 3.

    A comprehensive survey of methodologies used for Wi-Fi-based vital sign monitoring systems published over the past five years, including a critical analysis of their findings, strengths, and weaknesses.

  • 4.

    Identification and evaluation of proposed machine learning (ML) and deep learning models used to enhance Wi-Fi-based monitoring systems to make them reliable and trustworthy for clinician adoption.

  • 5.

    A focused discussion on current research in vital sign monitoring using Wi-Fi signals, highlighting its benefits, limitations, challenges, and future directions.

The rest of the article is organized as visualized in Fig. 1.

Overview of the article structure.

Figure 1: Overview of the article structure.

Theoretical background

Overview of health monitoring systems

Health monitoring systems have revolutionized healthcare by offering continuous methods to track vital signs and detect potential health issues. These systems range from wearable devices to ambient sensors, providing real-time data on various physiological parameters. Traditional hospital-based methods typically rely on multiple wired sensors, which restrict patient mobility and comfort (Zhang et al., 2023). These systems often require manual checks, increasing staff workload. Moreover, subtle patient movements can easily affect these devices, leading to inaccurate readings and multiple false alarms (Liu et al., 2021).

To address the limitations of traditional devices, recent advancements in healthcare monitoring have embraced Wi-Fi-based sensing technologies, which show promising results for non-contact vital signs monitoring. For instance, Wang, Yang & Mao (2020b) demonstrated that Wi-Fi can accurately monitor breathing and heart rates. However, challenges remain in data privacy, real-world accuracy, and healthcare workflow integration. Therefore, addressing these issues is crucial for the widespread adoption of advanced health monitoring systems.

CSI and its role in health monitoring

When a transmitter (TX) and a receiver (RX) communicate wirelessly through Wi-Fi, various environmental factors, including walls, windows, obstacles, and even human bodies, can affect the signals. Even the smallest change in surroundings can alter how signals propagate in the medium. CSI essentially describes the characteristics of the communication medium between the transmitter and receiver, providing detailed insights into movements and changes in the environment with each packet. CSI sensitivity to micro movements makes it a perfect fit for sensing vital signs such as respiration rate and heart rate. For instance, subtle chest movements during the breathing cycle—inhaling and exhaling—affect signal propagation, and are therefore captured by CSI. Thus, Wi-Fi signals are a promising tool for non-invasive health monitoring. By analyzing subtle changes in Wi-Fi signal patterns caused by human presence and movement, researchers have developed innovative methods for detecting vital signs and activities. CSI extracted from Wi-Fi signals has been successfully used to monitor respiratory and heart rates. For instance, Zhang et al. (2023) achieved 96.887% accuracy for estimating breathing rate and 94.708% accuracy for heart rate using CSI data. Similarly, Zeng et al. (2019) demonstrated respiration monitoring with 98% detection rate at distances up to 9 m.

More advanced applications include sleep monitoring and apnea detection. Yu et al. (2021) developed Wi-Fi sleep, achieving 81.8% accuracy in four-stage sleep classification, while Yang et al. (2021) created a system for detecting sleep apnea with over 95% accuracy.

Wi-Fi-based health monitoring offers several advantages, such as:

  • Non-invasive and contactless monitoring, ideal for long-term patient observation.

  • Utilizing existing Wi-Fi infrastructure, reducing implementation costs.

  • Ability to operate through walls and furniture, allowing for discreet monitoring.

  • High accuracy in readings and predictions, aiding early intervention in critical cases.

  • Potential for muti-person monitoring, as demonstrated by Gao et al. (2020) for respiration tracking.

However, researchers have yet to fully address challenges such as sensitivity and environmental changes. Ongoing research focuses on improving accuracy of readings and expanding the range of detectable health parameters to enable better health assessments and early disease detection.

Wi-Fi CSI overview

Wi-Fi CSI extraction

CSI extraction can be achieved through various tools, each of which is associated with specific Wi-Fi chipsets. One widely used tool is the Linux 802.11n CSI, which is built on top of the Intel 5300 Wi-Fi chipset. This tool is popular because it can be plugged directly into a PC or laptop, facilitating both CSI extraction and data processing within a single setup.

Another tool for CSI extraction is the Atheros CSI, which is now somewhat outdated. Although building this system from source code is no longer possible due to its obsolescence, prebuilt images are available on GitHub. These images can be flashed onto routers equipped with Atheros Wi-Fi chipsets, such as the TP-Link TL-WDR4300, which is often used with this system. This method allows users to collect CSI data without manually compiling the software.

The ESP32 chip is another versatile and powerful option for capturing CSI data. As a system on chip, it integrates both Wi-Fi and Bluetooth functionalities, making it highly flexible for various applications. The ESP32 supports a wide range of IEEE 802.11 Wi-Fi standards, ensuring compatibility with most Wi-Fi networks. One of its key features is its ability to function in multiple modes: it can act as a transmitter (TX), receiver (RX), or both. Additionally, it can be set up as an access point or operate in master mode, allowing interaction with other devices in various network configurations. For CSI measurement, the ESP32 provides detailed data from 64 subcarriers, enhancing the quality of the collected data. Furthermore, it supports up to 16 antennas, increasing its capacity to capture and analyze complex signal interactions. Thus, the ESP32 is highly effective for collecting and processing CSI data in research and applications such as health monitoring (Atif et al., 2020).

Moreover, CSI can be captured using a Raspberry Pi 4 (RPi), which is equipped with the BCM43455C0 Wi-Fi chip. To enable effective CSI extraction, the chip requires a specialized firmware patch, Nexmon, which modifies the BCM43455C0 to operate in monitor mode. Monitor mode allows the Wi-Fi chip to capture all 802.11 frames, including those not normally visible in standard operating modes—a capability crucial for analyzing Wi-Fi signal interactions as they travel through or bounce off objects. Once Nexmon firmware is installed, the RPi can collect raw CSI data from monitored Wi-Fi frames. Beyond mere CSI capture, the RPi can filter and process CSI data (Chafni, 2022), enhancing its utility in research and applications by providing a comprehensive solution for both data collection and initial data handling.

It is important to point out that CSI extraction at the chipset level is usually handled by low-level tools and firmware written in languages such as C or C++. For instance, the Intel 5300 CSI or Nexmon operate at the Wi-Fi driver or firmware layer, allowing raw CSI data to be captured directly from the network interface card (NIC). Once CSI data are gathered and stored, either as logs or structured data, more advanced tools, such as Python, are commonly used later for post-processing, filtering, visualizing, or applying machine learning and deep learning techniques.

Understanding CSI

Wi-Fi signals can be described using two main metrics: received signal strength (RSS) and CSI. RSS is not always ideal due to its sensitivity to environmental changes, which can lead to false detections (Wang et al., 2018). In contrast, CSI has gained popularity in recent years for its ability to accurately describe signal properties as they travel between transmitters and receivers. CSI offers detailed insights into signal behavior, including reflections, multipath propagation, and the detection of human movements (Ma, Zhou & Wang, 2020).

CSI values are typically represented as complex numbers, comprising both amplitude and phase components of the received wireless signals. The amplitude represents the magnitude of the complex number, while the phase represents the vector’s shift angle. To analyze these values, research articles often segment CSI data into amplitude and phase components before preprocessing. Some studies focus exclusively on either amplitude or phase, while others utilize both, stating that the two features offer complementary information. The general representation of CSI can be expressed as:

CSI=a+bi.

Instead of relying on manual processes, the NumPy library in Python can simplify the extraction of amplitude and phase information from complex numbers. The NumPy library provides functions such as abs() to compute amplitude (magnitude) and angle() to determine phase angle. By applying these functions to a given complex number (a + bj), researchers can efficiently break down a CSI value into its amplitude and phase components.

Typical setup

While study setups vary, the core elements remain consistent. Generally, the setup includes Wi-Fi-enabled devices acting as a transmitter and a receiver, both equipped with antennas. The hardware may consist of routers, laptops, or RPi (Abdelnasser, Youssef & Harras, 2015; Zeng, Pathak & Mohapatra, 2016). However, CSI extraction is limited by the availability of the specific tools discussed above, as each tool supports only a certain NIC, which restricts the range of compatible hardware (Abdelnasser, Youssef & Harras, 2015; Li et al., 2020). The main advantage of this setup is that individuals do not need to wear any devices; instead, existing Wi-Fi signals are employed. As people move or breathe, these signals are affected, and the resulting changes are captured in the CSI data. The data are then sent to a computer for preprocessing, such as noise removal or dimension reduction, and then analyzed using machine learning or AI models for tasks like breathing tracking, fall detection, or gesture recognition (Abdelnasser, Youssef & Harras, 2015).

Methodology

This section presents the systematic review process used to search, screen, and evaluate applicable studies. This review investigates peer-reviewed articles from the past 5 years that use Wi-Fi CSI for contactless vital signs monitoring. To identify relevant contributions and summarize results, we examined published articles on AI and its associated fields were examined. The main aim of this review is to summarize methodologies, evaluate reported performance, and identify research gaps in the use Wi-Fi-based technologies for monitoring vital signs. To ensure the inclusion of the most relevant articles, we followed established guidelines set by Kitchenham & Charters (2007) and PRISMA (Page et al., 2021), as shown in the Appendix of this article. In addition, the detailed search procedure is outlined in the Appendix.

Research questions

The aim of the research questions is to identify a well-established plan to retrieve articles restricted only from the areas of consideration. Table 1 outlines the research questions identified in this article.

Table 1:
Research questions.
No. Research question Motivation
RQ1 What methods have been used in Wi-Fi sensing systems for non-invasive monitoring of vital signs such as heart rate and respiration rate? To identify and classify the technical approaches used in Wi-Fi-based vital signs monitoring.
RQ2 How reliable and effective are Wi-Fi signals for monitoring vital signs such as heart rate and breathing rate? To assess the effectiveness of incorporating Wi-Fi signals to non-invasive health monitoring in real-world healthcare environments.
RQ3 What are the main AI techniques which can enhance the accuracy and reliability of Wi-Fi health monitoring systems? To explore the impact of incorporating AI to enhance the performance of the vital signs monitoring systems, so that they become reliable to be used for clinical and home use.
RQ4 What are the key challenges, limitations, and research gaps in deploying Wi-Fi-based vital signs monitoring systems in clinical or home settings? To identify the challenges and research gaps in the application of these technologies along with highlighting future research opportunities.
DOI: 10.7717/peerj-cs.3375/table-1

Search strategy

To ensure a comprehensive and inclusive search of the literature, we developed a multi-layered keyword-based strategy using Boolean logic across six major databases: IEEE Xplore, ScienceDirect, SpringerLink, ACM Digital Library, Google Scholar, and ResearchGate. We focused on keyword searches in titles, abstracts, and keywords, which is appropriate given the topic’s engineering and computer science orientation. While PubMed and MeSH indexing are highly valuable for biomedical and public health research, many studies on Wi-Fi-based sensing and signal-level data processing are not indexed there. Therefore, we prioritized repositories that more comprehensively cover technical research. At the same time, we recognize the clinical relevance of our topic and the importance of including healthcare-related work. Although PubMed was not part of our main search strategy due to its limited coverage of wireless sensing technologies, we sought to compensate by using broader repositories such as Google Scholar and ResearchGate, which include research across both technical and medical disciplines. We also incorporated clinically related keywords, such as “heart rate,” “respiration rate,” “health monitoring,” and “patient monitoring,” to capture articles that bridge both fields. This approach helped us address the overlap between clinical applications and signal-level AI techniques. More details about the search logic, keywords used, and database selection are provided in Appendix.

Articles selection criteria

Choosing the right search terms is an important step in conducting a thorough and wide-ranging systematic review. In this study, we carefully selected search terms that included key concepts such as Wi-Fi signals and CSI. The goal was to strike a balance, not too generic to avoid unrelated articles and not too narrow to not miss related articles (Cooper et al., 2018). The primary keywords were derived from the research questions and refined through several iterations. Searches were then conducted using the six major databases mentioned earlier. A full list of search strings, keyword combinations, and databases searched as well as the overall search process are outlined in the Appendix.

Screening of articles

The results of the individual searches returned a total of 8,573 publication records: (IEEEXplore = 5,473, Springer Link = 250, ScienceDirect = 1,412, Google Scholar = 312, ACM Digital Library = 576, ResearchGate = 550). Following title and abstract screening, and the application of our inclusion/exclusion criteria (Table S3), 45 studies were included in the final analysis. A comprehensive description of the screening and exclusion process is available in the Appendix, along with figures and tables showing study selection by database, timeline, and final inclusion outcomes.

Taxonomy for study categorization and quality assessment

  • Taxonomy table: to facilitate comparative analysis, with classification dimensions (see Table 2).

  • Sensing technique: indicates if the study was relying on extracting CSI amplitude, phase, or both.

  • Use case: describes the health parameter being monitored by the study, such as respiration rate, heart rate, and sleep apnea.

  • AI integration level: specifies the degree of AI involvement, ranging from no AI integration to traditional machine learning or deep learning approaches.

  • Evaluation: defines the complexity of the environment in which the system was tested, including the number of people considered and whether the system was running in real-time.

  • Metrics reported: lists the metrics used to evaluate the system, including Mean Absolute Error (MAE), accuracy, recall, precision, specificity, and F1-score.

Table 2:
Taxonomy for study categorization.
Study Sensing technique Use case AI integration level Evaluation context Metrics reported
Yu et al. (2021) Amplitude and phase Sleep stage Deep learning Single-person, Real-time MAE, Accuracy, Recall, Precision,
F-score
Bao et al. (2023) Amplitude and phase Respiration rate Machine learning Single-person, Real-time MAE, Accuracy
Khan et al. (2023) Amplitude and phase Respiration rate Machine learning Single-person Accuracy
Kumar et al. (2022) Amplitude and phase Respiration rate and heart rate Deep learning and machine learning Single-person Accuracy
Zhang et al. (2024) Amplitude and phase Respiration rate Machine learning Multi-person Accuracy
Guo et al. (2023) Amplitude and phase Respiration rate Machine learning Single-person, Real-time MAE
Kontou, Smida & Anagnostou (2024) Amplitude Respiration rate Deep learning Single-person MAE, Accuracy
Hu et al. (2023) Amplitude and phase Respiration rate Deep learning Multi-person, Real-time MAE, Accuracy
Armenta-Garcia et al. (2021) Amplitude Respiration rate Machine learning Single-person, Real-time Accuracy, Recall, Precision, Specificity, F-score
Mosleh et al. (2022) Amplitude and phase Respiration rate Deep learning Single-person Accuracy, Recall, Precision, Specificity, F-score
Armenta-Garcia et al. (2023) Amplitude Respiration rate and heart rate Machine learning Single-person Accuracy, Recall, Precision, Specificity, F-score
Yang et al. (2021) Amplitude and phase Sleep Apnea Deep learning Single-person, Real-time Accuracy, Recall, Precision, F-Score
Zhang et al. (2023) Amplitude Respiration rate and heart rate No AI Single-person, Real-time Accuracy
Zhang et al. (2019) Phase Respiration rate No AI Multi-person Accuracy
Wang, Yang & Mao (2020b) Phase Respiration rate and heart rate No AI Multi-person MAE
Liu et al. (2021) Amplitude Respiration rate No AI Single-person MAE
Hussain et al. (2021) Amplitude and phase Respiration rate No AI Single-person, Real-time MAE
Wang, Yang & Wang (2022) Amplitude and phase Respiration rate No AI Single-person, Real-time Accuracy
Qiu et al. (2022) Amplitude and phase Respiration rate No AI Multi-person, Real-time MAE
Xie et al. (2022) CSI ratio (Amplitude and phase) Respiration rate No AI Single-person, Real-time MAE
Li et al. (2021) Amplitude and phase Respiration rate No AI Single-Person MAE
Wang et al. (2020) Amplitude Respiration rate No AI Multi-person Accuracy
Sun, Bian & Li (2024) Amplitude and phase Respiration rate No AI Single-person MAE, Accuracy
Zhuo et al. (2023) Amplitude and phase Respiration rate No AI Single-person Accuracy
Alzaabi, Arslan & Polydorides (2024) Amplitude Respiration rate No AI Single-person MAE
Zeng et al. (2019) Amplitude and phase Respiration rate No AI Single-person, Real-time Accuracy
Wang & Liu (2024) Amplitude Respiration rate No AI Single-person, Real-time Accuracy
Zeng et al. (2020) Amplitude Respiration rate No AI Multi-person MAE
Dou & Huan (2021) Amplitude and phase Respiration rate No AI Single-person MAE
Wang & Sun (2022) Phase Respiration rate No AI Single-person, Real-time Accuracy
Peng et al. (2024) Amplitude Respiration rate No AI Single-person, Real-time MAE
Atif et al. (2022) Amplitude Respiration rate No AI Single-person, Real-time Accuracy
Guo et al. (2023) Amplitude and phase Respiration rate No AI Single-person MAE, Accuracy
Gao et al. (2020) Amplitude and phase Respiration rate No AI Multi-person Accuracy
Gui et al. (2023) Amplitude Respiration rate No AI Single-person MAE
Gu et al. (2019) Amplitude Respiration rate and heart rate No AI Single-person, Real-time MAE, Accuracy
Guo et al. (2021) Amplitude and phase Sleep Apnea No AI Single-person Accuracy
Wang, Yang & Mao (2020c) Amplitude and phase Respiration rate No AI Single-person, Real-time MAE, Accuracy
Guan et al. (2023) Phase Respiration rate No AI Multi-person Accuracy
Ali et al. (2023) Amplitude Respiration rate No AI Single-person, Real-time MAE
Zhang et al. (2022) Amplitude and phase Respiration rate and heart rate No AI Single-person Accuracy
Li et al. (2020) Amplitude and phase Respiration rate No AI Single-person, Real-time None
Yin et al. (2022) Amplitude Respiration rate No AI Multi-person, Real-time MAE
Khamis et al. (2020) Amplitude and phase Respiratory cycle No AI Single-person, Real-time MAE
Zhuo, Zhong & Zhuo (2022) Amplitude and phase Respiration rate No AI Single-person, Real-time Accuracy
DOI: 10.7717/peerj-cs.3375/table-2

Table 3 assesses the quality of each study using seven criteria. Sample size defines the size of the dataset used. Validation method reflects the process used to train and test the data in AI studies, and how vital signs ground truth was obtained in non-AI articles. Benchmarking indicates whether the study compared its performance to previous work. Dataset, code, and algorithms availability, notes whether any of these resources are publicly accessible or included in the article. Reproducibility assesses how replicable the system’s performance is, considering factors such as dataset availability and the sufficiency of the data provided. Measurement method specifies the device, sensor, or method used to identify the ground truth data. Finally, overall quality ranks the study into one of three levels based on the information provided.

Table 3:
Quality assessment of the studies.
Study Sample size Validation method Benchmarking Dataset, Code, & Algorithms availability Reproducibility Measurement method Overall quality
Yu et al. (2021) 12 64–36 split Yes None Moderate PSG system Moderate
Bao et al. (2023) 5 10-fold
cross-validation
No None Moderate Respiration band Moderate
Khan et al. (2023) 1 80–20 split, 10-fold cross-validation No None Moderate Not mentioned Low
Kumar et al. (2022) 84 cross-validation Yes None Moderate Oximeter High
Zhang et al. (2024) 12 70–30 split No None Moderate Wi-side Moderate
Guo et al. (2023) Not mentioned Wearable Sensor No None Moderate Not mentioned Low
Kontou, Smida & Anagnostou (2024) 1 72% training, 18% validation, and 10% testing Yes None Moderate Not mentioned Moderate
Hu et al. (2023) 8 70–30 split No None Moderate Neulog respiration monitor belt logger sensor NUL-236, Pulse oximeter Moderate
Armenta-Garcia et al. (2021) 6 10-fold cross-validation No Algorithms Moderate Metronome Moderate
Mosleh et al. (2022) 1 10-fold cross-validation No Dataset (https://doi.org/10.18434/mds2-2963) High RespiPro: Calibrated breathing manikin Moderate
Armenta-Garcia et al. (2023) 17 10-fold cross-validation No Dataset, Code (https://github.com/AlbanyArmenta0711/WiFi-CSI-MiningTool) High Metronome + Triaxial Accelerometer for Breathing Rate, Polar H9 Sensor for Heart Rate High
Yang et al. (2021) Not mentioned Uknown No None Low Simulated patterns Low
Zhang et al. (2023) 10 Wearable sensor Yes None Moderate Accelerometer, pulse oximeter Moderate
Zhang et al. (2019) 8 Controlled breathing No None Moderate Metronome Moderate
Wang, Yang & Mao (2020b) 4 Wearable sensor No None Moderate Neulog Respiration Monitor Belt logger sensor NUL-236, Pulse Oximeter Low
Liu et al. (2021) 8 Wearable sensor No None Moderate Neulog Respiration Monitor Belt logger sensor NUL-236 Moderate
Hussain et al. (2021) 1 Manual counting No None Low Manual Counting Low
Wang, Yang & Wang (2022) Not mentioned Wearable sensor No None Moderate HKH-11C Digital Respiration Wave Sensor Low
Qiu et al. (2022) Not mentioned Controlled breathing Yes Algorithms Moderate Metronome Moderate
Xie et al. (2022) 1 Wearable sensor Yes None Moderate Respiration band Moderate
Li et al. (2021) 8 Public dataset Yes Dataset (https://doi.org/10.7910/DVN/X7AYXQ) High PSG data High
Wang et al. (2020) 12 Controlled breathing Yes Algorithms Moderate Metronome Moderate
Sun, Bian & Li (2024) 9 Wearable sensor No None Moderate Lepu Medical PalmECG PC-80D Moderate
Zhuo et al. (2023) 11 Controlled breathing Yes None Moderate Metronome Moderate
Alzaabi, Arslan & Polydorides (2024) 1 Wearable sensor Yes Dataset (doi.org/10.21227/d7d1-sg82) High Neulog Respiration Monitor Belt logger sensor NUL-236 High
Zeng et al. (2019) 12 Wearable sensor Yes None Moderate Neulog Respiration Monitor Belt logger sensor NUL-236 Moderate
Wang & Liu (2024) Not mentioned Wearable device No None Moderate Not mentioned Low
Zeng et al. (2020) 21 Wearable sensor Yes None Moderate Neulog Respiration Monitor Belt logger sensor NUL-236 Moderate
Dou & Huan (2021) Not mentioned Wearable sensor Yes None Moderate Neulog Respiration Monitor Belt logger sensor NUL-236 Moderate
Wang & Sun (2022) Not mentioned Wearable sensor No None Moderate HKH-11C Digital Respiration Wave Sensor Low
Peng et al. (2024) 5 Controlled breathing Yes None Moderate Metronome Moderate
Atif et al. (2022) 5 Non-wearable sensor Yes None Moderate Beddit Moderate
Guo et al. (2023) 11 Controlled breathing Yes None Moderate Metronome Moderate
Gao et al. (2020) 3 Controlled breathing Yes Algorithms Moderate Metronome Moderate
Gui et al. (2023) 15 Wearable device Yes None Moderate Phone attached to the sleeper’s chest recording breathing audio Moderate
Gu et al. (2019) 5 Controlled breathing Yes None Moderate Simulated Patterns Moderate
Guo et al. (2021) 5 Wearable device No None Moderate Accelerometer Moderate
Wang, Yang & Mao (2020c) Not mentioned Wearable sensor Yes Algorithms Moderate Neulog Respiration Monitor Belt logger sensor NUL-236 Moderate
Guan et al. (2023) 5 Wearable sensor Yes Algorithms Moderate HKH-11C Digital Respiration Wave Sensor Moderate
Ali et al. (2023) 5 Non-wearable sensor No None Moderate Xethru X4M200 Respiration sensor Moderate
Zhang et al. (2022) 2 Non-wearable sensor No None Moderate RestOn Smart Sleep Monitor Moderate
Li et al. (2020) 19 Controlled breathing No None Moderate Simulated Patterns Low
Yin et al. (2022) 10 Manual counting No None Moderate Manual Counting Low
Khamis et al. (2020) 1 Non-wearable sensor Yes None Moderate Xethru X2-M200 Respiration Sensor Moderate
Zhuo, Zhong & Zhuo (2022) 11 Controlled breathing No None Moderate Metronome Moderate
DOI: 10.7717/peerj-cs.3375/table-3

Data extraction and analysis

After the final primary studies that met the quality assessment standards were identified, data were extracted to help answer the research questions. These studies were organized into two comparative tables: one focusing on AI-based healthcare monitoring using CSI, and the other on deterministic CSI-based healthcare monitoring systems. In both tables, the first three columns summarize the essential details of each publication.

In both tables, the “Estimation Method” column refers to the fundamental algorithms or techniques used for to evaluate the data. The “Preprocessing Phase” column identifies the first steps that data undergoes before being fed into the system. The approach used to ensure that the credibility of study results is presented under “Validation Strategy.” The “Dataset” column details the source of the data, while the “Results” column summarizes the main conclusions of the research. The “Limitations” column points to the restrictions or difficulties encountered during the study. This table structure facilitates a systematic comparison of various strategies in CSI-based monitoring across various applications.

Review of existing literature

CSI-based healthcare monitoring (AI-based approach)

Tables 4 and 5 present a comparative overview on how researchers have leveraged Wi-Fi CSI to detect and measure crucial physiological parameters such as respiration and heart rates. Table 4 contain articles utilizing AI-based methods and Table 5 includes those employing non-AI techniques. With this division, we can observe the evaluation of CSI-based vital sign monitoring from traditional signal processing approaches to more advanced machine learning algorithms. By examining these studies, we gain insights into the accuracy of non-invasive, contactless health monitoring systems and can compare the results achieved using both non-AI and AI methods.

Table 4:
Comparative review of CSI-based vital sign monitoring studies using AI, including key objectives related to eHealth applications.
Ref. Author, Year Article type eHealth objective Estimation method Preprocessing phase Validation strategy Dataset Results Limitations
Yu et al. (2021) Yu et al., 2021 Journal (IEEE) Sleep stage monitoring in home
environments
CNN-BiLSTM with CRF CSI ratio calculation, MRC-PCA fusion,
Doppler-MUSIC for body
movement detection
Experiments with 12 subjects
over 19 nights
Collected by authors 81.8% accuracy
for four-stage sleep
classification
Only applicable to single person
situations, Wi-Fi devices need to be placed in specific positions, can be influenced
by changes in the environment, and cannot identify specific sleep disorders
Bao et al. (2023) Bao et al., 2023 Journal (IEEE) Contactless respiration monitoring for remote healthcare Support Vector Machine (SVM) Phase flipping detection, antenna and subcarrier selection and discrete wavelet transform (DWT) Five subjects participated in the experiments. The distance between the transmitter and receiver was 2.5 m. Collected by authors The framework reached an average accuracy of around 91.2% Lacks the feature of monitoring the heart rate, does not recognize multiple people, and has a low accuracy in Non-Line of Sight (NLOS)
Khan et al. (2023) Khan et al., 2023 Journal (IEEE) Decision support for patient respiration in hospital settings ML classification and regression processes DC offset removal, smoothing, bandpass filtering, Principal Component Analysis (PCA) K-fold
cross-validation (10-fold)
Collected by authors Fine Tree model: 96.9% accuracy for multi-class, 95.8% for binary classification Single subject data, short data collection period, no real-world testing, and cannot support multiple people monitoring.
Kumar et al. (2022) Kumar et al., 2022 Journal
(SpringerLink)
Activity-aware vital signs tracking in smart homes CNN for breathing rate and heart rate prediction and SVM with various kernels CSI data extraction, complex data separation into real and imaginary parts Experiments with 84 volunteers in four different positions (sleeping, sitting, standing, walking) Collected by authors Best accuracy with CNN and SVM 98.46% for sleeping position It is only applicable in indoor environments, there may be problems with the recognition of multiple subjects, and there is no assessment of the speed of work.
Zhang et al. (2024) Zhang et al., 2024 Journal (ACM) Multi-person respiration monitoring in residential settings PCA, Density-Based Spatial Clustering of Applications with Noise (DBSCAN) clustering, and Autocorrelation PCA-based denoising, and K-Means-based DBSCAN Experiments with 12 participants in various scenarios Collected by authors 98% accuracy at 7 m distance with 5 m Line of Sight (LOS) Affected by body motions, a smaller number of people that can be tracked
simultaneously, and
low sensing sensitivity with some smart devices
Guo et al. (2023) Guo et al., 2023 Journal (ACM) Fine-grained respiration analysis in home/clinical environments Multi-antenna
CSI-Subpopulation Genetic algorithm, and Mixed Cluster
Gaussian-Hidden Markov Model
Data segmentation, interpolation, and subcarrier selection Experiments with multiple participants in various scenarios, and comparing with respiration monitor belt Collected by authors Mean absolute error of respiration rate approximately 0.1 brpm, average UPD around 1.2 It declines with distance, works only inside, and does not consider monitoring of multiple people, or number of participants.
Kontou, Smida & Anagnostou (2024) Kontou et al., 2024 Journal (IEEE) Respiration rate monitoring in small home spaces Principal Component Analysis (PCA) for subcarrier selection, and Artificial Neural Network (ANN) for respiration frequency detection Hampel filter, smoothening filter, and detrending, normalization Experiments at different distances (up to 2 m), compared to ground truth from wearable sensor Collected by authors Average error of 4.7% in
breathing rate estimation compared to FFT method across
20 measurement datasets
The accuracy was only tested for single person and not beyond 2 m, offline processing, not in real time, and the targets were non-moving; accuracy drops as the target moves.
Hu et al. (2023) Hu et al., 2023 Conference article (ACM) Multi-person respiration monitoring in crowded indoor areas Near-field domination effect, Sparse Recovery Algorithm (SRA), and Temporal Convolutional Network (TCN) based autoencoder Data transformation pipeline including segmentation, resampling, transformation, and normalization Experiments with eight subjects for respiration monitoring Collected by authors Respiration monitoring: Mean error < 1 brpm Subjects need to be in the near field of their Wi-Fi devices (within ~0. 2 m), the performance may decrease with the distance between the subject and the device, it can be influenced by changes in the dynamics and layout of the environment, and specific Wi-Fi hardware and software modifications are necessary.
Armenta-Garcia et al. (2021) A.
Armenta-Garcia et al., 2021
Journal (IEEE) Real-time contactless respiration monitoring Support Vector Machines (SVM), K-Nearest Neighbor (K-NN) and Quadratic Discriminant Classifier (QDC) Hampel, Savitzky-Golay, and Band-pass filters 10-fold cross- validation. Participants were asked to control their breath according to a device. Collected by authors Accuracy
K-NN: 99.09%
QDC: 99.65% SVM: 99.21%
The evaluation of the study was done on stationary data instead of using data from different subjects with different body types. Moreover, it lacks heart rate estimation.
Mosleh et al. (2022) S. Mosleh et al., 2022 Journal (IEEE) AI-based respiratory pattern classification Long Short-Term Memory (LSTM) FFT and normalization Confusion Matrix Accuracy, Precision, Recall, Specificity, and F1-score Collected by authors (https://doi.org/10.18434/mds2-2963) Accuracy:
99.97% in
perfect settings
The evaluation was conducted under controlled conditions with significant hardware setup, limiting its applicability in diverse scenarios. Additionally, BreatheSmart categorizes data into a just nine classes. not provide measurements of respiration rate.
Armenta-Garcia et al. (2023) Jesus A. Armenta-Garcia et al., 2023 Journal (ScienceDirect) Multi-parameter estimation for respiration and heart rate Support Vector Machines (SVM), K-Nearest Neighbor (K-NN) and Quadratic Discriminant Classifier (QDC) Hamel, Savitzky-Golay, band-pass, FFT, and DWT Accuracy, Recall, Precision, Specificity, and F1-score Collected by authors (https://github.com/AlbanyArmenta0711/WiFi-CSI-MiningTool) Accuracy: Breathing rate: K-NN: 99.18% QDC: 91.35% SVM: 90.71%, Heart Rate: QDC: 55%
SVM: 55.06%
Breathing rate is categorized to limited number of classes with steps of 2. Heart rate is categorized to three classes only giving high-level overview, but not suitable for clinical or remote monitoring use. In addition, the heart rate estimation yielded very poor results.
Yang et al. (2021) X. Yang et al., 2022 Article (SpringerLink) Sleep apnea detection in clinical and home settings Back-Propagation (BP) neural network to learn and map features (mean, variance, range, quartile) Phase error elimination using linear fitting; noise cancellation with median absolute deviation; wavelet denoising (‘db3’ basis); subcarrier selection based on variance Accuracy, Precision, Recall, Macro F1- score Dataset composed of 450 sets of data for each state Accuracy rate
over 95.6% for detecting sleep apnea.
Does not address multiple-person status. Some confusion between apnea and unmanned states. Lacks user interface for alarms and trigger notifications.
DOI: 10.7717/peerj-cs.3375/table-4
Table 5:
Comparative review of CSI-based vital sign monitoring studies using deterministic approach, including key objectives related to ehealth applications.
Ref. Author, year Article type eHealth objectives Estimation method Preprocessing phase Validation strategy Dataset Results Limitations
Zhang et al. (2023) Zhang et al., 2023 Journal
(ResearchGate)
Breathing and heart rate monitoring during sleep Fast Fourier Transform
(FFT)
Subcarrier selection, and Data denoising using Hampel filter Experiments with 10 volunteers in different sleeping postures Collected by authors 96.887% accuracy for breathing rate 94.708% accuracy for heart rate Cannot retrieve the actual waveform of the heartbeat. The article Discards periods with interfering sleep activities and the study is confined to single person situations
Zhang et al. (2019) Zhang et al., 2019 Journal (ResearchGate) Contactless breath rate estimation across scenarios Fast Fourier Transform
(FFT)
Hardware and software correction for phase distortions Experiments with eight participants in different scenarios Collected by authors Over 99% median accuracy in most scenarios for breath rate estimation Performance declines in NLOS situations and is restricted to a 4 m range because of lost packets, in addition, data is not updated in real time.
Wang, Yang & Mao (2020b) Wang et al.,
2020
Journal
(Google Scholar)
Multi-subject respiratory and HR monitoring in indoor environments Peak Detection
and FFT
Environment detection, data calibration and subcarrier selection Experiments with four
persons over 3 months in various indoor
environments
Collected by authors Median error of 0.25 brpm for breathing rate and 1.19 beats per minute (bpm) for heart rate Performance drops with distance and through-wall conditions; furthermore, the need for a directional antenna for heart rate monitoring increases the system’s complexity and cost.
Liu et al. (2021) Liu et al.,
2021
Journal (Google Scholar) Respiration rate monitoring in indoor spaces Fast Fourier Transform
(FFT)
DBSCAN-based clustering, threshold selection Experiments with eight subjects in various indoor scenarios Collected by authors Average respiration rate error of 0.31 brpm in NLoS scenarios Does not function when there are other human movements, not effective when there are multiple targets, and operates only with single antenna smartphones.
Hussain et al. (2021) Hussain et al., 2021 Journal (IEEE) Monitoring driver/passenger respiration in parked vehicles Peak Detection Conjugate Multiplication, Phase Sanitization, and Butterworth Filtering Experiments in a stationary vehicle with an adult volunteer, and compared to manual breath counting Collected by authors Median percentage errors of 8.5% for phase and 7.4% for amplitude with three antennas Valid only for cases when the vehicle is parked, tested only on adult subjects, needs a specific antenna setup, and the performance depends on the position of the passengers.
Wang, Yang & Wang (2022) Wang et al.,
2022
Journal (IEEE) Accurate respiratory rate estimation using Wi-Fi in labs Autocorrelation
and BNR-based selection
Motion detection, Exponentially Weighted Moving Average (EWMA), CSI ratio calculation Experiments in a laboratory setting, compared with contact respiratory sensor Collected by authors Over 97% accuracy for respiratory rate estimation using CSI ratio model Applicable only to one-person cases, only proved in laboratories, higher complexity of algorithms, may decrease performance with the growth of distance between antennas
Qiu et al. (2022) Qiu et al.,
2022
Journal (IEEE) Breath monitoring in high-dynamic environments Power Delay Profile and Least Square Method Wire Direct
Connection-based denoising and Phase offset compensation
Experiments in lab with multiple scenarios and moving subjects Collected by authors Less than 0.5 brpm detection error in high-dynamic environments Must connect the wires directly to TX/RX pins, applicable only to a stationary target subject, the
performance may decrease in some extreme conditions, the system has been tested only in a laboratory environment.
Xie et al. (2022) Xie et al.,
2022
Journal (IEEE) Lab-based single person respiration tracking at varied distances Principal Component Analysis,
Doppler Frequency Extraction
PCA, Human presence detection, Respiratory signal detection Experiments in lab with different distances and scenarios Collected by authors Less than 0.7 brpm error within 6 m range, velocity measurement error less than 0.4 cm/s Single person only, cannot operate through the wall, may raise privacy issues, never tested in real life settings
Li et al. (2021) Li et al.,
2021
Journal (IEEE) Breathing rate estimation for sleep monitoring validation Power Spectral Density (PSD) detection Amplitude calibration, Phase calibration, and Delay domain filtering Experiments using public dataset with ground truth from polysomnography Collected
by authors
86% of estimation errors below 0.5 brpm for breathing rates between 0.2–0.3 Hz (https://doi.org/10.7910/DVN/X7AYXQ) Applicable only to single-person cases, not verified in actual-time or actual-world scenarios, and needs special Wi-Fi hardware configuration.
Wang et al. (2020) Wang et al.,
2020
Journal (IEEE) Indoor respiratory sensing and people identification An adaptive subcarrier combination method that boosts the Signal-to-Noise Ratio (SNR) of breathing signals Adaptive signal processing techniques to combine subcarriers and links for optimal breathing signal enhancement. Experiments conducted in an indoor lab setting and a car setting, with 12 participants Collected by authors Achieved over 86% accuracy for people counting and 85.78% accuracy for people recognition. The system does not assume a priori how many people or different breathing rates there are, the system is set for static conditions, it would not be very efficient in a moving car because of the vibrations.
Sun, Bian & Li (2024) Sun et al.,
2024
Journal (ResearchGate) Vital sign monitoring in semi-realistic indoor scenarios Discrete Wavelet Transform,
FFT
Down sampling, CSI ratio calculation, and SG filtering Experiments with nine participants in two real-world scenarios (meeting room and dormitory) Collected by authors 96.8% average
accuracy, 0.8 brpm
median error
Distance degrades performance and it only works indoors.
Zhuo et al. (2023) Zhuo et al., 2023 Journal (IEEE) Breath monitoring during sleep in varied positions Projection model, PCA-VMD fusion, and Peak detection CSI ratio extraction, and Savitzky-Golay filtering Experiments with 11 volunteers in six different sleep positions Collected by authors >99% median
accuracy across all
sleep positions
Works only for indoor spaces, problems with several people, and does not consider the question of real-time data processing.
Alzaabi, Arslan & Polydorides (2024) Alzaabi et al., 2024 Journal (IEEE) Laboratory-based respiratory rate monitoring for clinical feasibility Discrete Wavelet Transform, Principal Component Analysis, and Peak detection Interpolation, down sampling, and Discrete Wavelet Transform filtering Experiments with one participant Collected by authors Limits of agreement of [−1.29, 1.06] BrPM with −0.11 BrPM bias for 60-s window (doi.org/10.21227/d7d1-sg82) Restricted to a laboratory setting, only one participant, the subject is quasi-stationary and does not consider real-time processing.
Zeng et al. (2019) Zeng et al.,
2019
Journal (ACM) Extended-range non-contact respiration monitoring PCA for subcarrier selection, and
autocorrelation for respiration rate estimation
CSI ratio calculation, and Savitzky-Golay filter for smoothing Experiments with 12 participants in various scenarios Collected by authors Sensing range increased to 8–9 m with 98% detection rate Through-wall-twice: some difficulties are not solved, and multi-person sensing not addressed
Wang & Liu (2024) Wang et al.,
2024
Conference article (ACM) Short-range respiration monitoring with sensitivity to breathing strength Discrete Fourier Transform (DFT), and Fast Fourier Transform (FFT) Down sampling, and mean subtraction Experiments with multiple participants at different distances and breathing intensities Collected by authors Best accuracy at 10 cm distance; accuracy decreases with weak breathing Highest accuracy at 10 cm range; decreases in the case of weak breathing; lacks provisions for monitoring multiple people, or the number of subjects.
Zeng et al. (2020) Zeng et al.,
2020
Journal (ACM) Multi-person abnormal respiration event detection Blind Source Separation (BSS) and Independent Component Analysis (ICA) Savitzky-Golay filter to smooth the respiration signals and Cross-multiply and DACM algorithm to estimate the rotation direction of the respiration signals. Recruiting 21 participants with a range of natural respiration rates, including some with similar rates Collected by authors Achieves a mean absolute respiration rate error of only 0.73 brpm even with four people present The ICA method requires the number of persons as an input, but ICA cannot generally identify the actual number of source signals and While MultiSense can detect abnormal respiration events, it cannot identify which respiration pattern belongs to which person.
Dou & Huan (2021) Chendan Dou and Hao Huan, 2021 Journal (ResearchGate) Non-contact respiratory rate detection across different home environments Doppler spectral energy extraction, and peak detection for respiration rate Random phase elimination, multipath decomposition, and STFT Experiments in three indoor environments (living room, bedroom, balcony) Collected by authors Mean estimation errors of 0.16, 0.15, 0.14 brpm in three scenarios: over 90% of errors <0.4 brpm Tested only one person use case, only works indoors, and does not support any real time functionality.
Wang & Sun (2022) Yi Wang and Jianan Sun, 2022 Conference article (IEEE) Single-subject respiratory monitoring with improved signal modeling CSI ratio model,
FFT, and autocorrelation
Random phase, and offset elimination using CSI ratio Experiments at different distances with 30 tests at each distance Collected by authors Improved accuracy from 99.2% to 99.7% compared to original CSI ratio model Higher computational cost and time and was tested only for one person and not for multiple people.
Peng et al. (2024) Peng et al.,
2024
Journal (IEEE) Orientation-aware respiration monitoring in hall and lab conditions Dominant Frequency-to-Amplitude Ratio (DFAR) for subcarrier selection, and Zero-Peak (ZP) detection CSI ratio operation, and DC bias removal Experiments in two environments (hall and laboratory) with five subjects at different orientations and distances Collected by authors Error less than 0.7 brpm for body orientations from 0 to 180°, nearly double the angle range compared to Fresnel zone model Suitable for only one person environment, maximum sensing distance is up to 3. 5 m, performance decreases in through-wall scenarios more than 1 m, sensitive to large-scale people motion in the environment, and needs certain type of antennas
Atif et al. (2022) Atif et al.,
2022
Journal (ResearchGate) Indoor respiration tracking across diverse real-life scenarios Power Spectral Density (PSD) analysis after subcarrier selection and fusion Hampel filter for outlier removal, and bandpass filtering Experiments with five participants in different scenarios Collected by authors Maximum accuracy of 99.6% and minimum accuracy of 91.81% across different scenarios Valid only for single subject cases, decreases with the distance between the subject and the devices, depends on the type of antennas used, can be influenced by the surrounding environment and multipath interference, and has only been tested in indoor conditions.
Guo et al. (2023) Guo et al.,
2023
Journal (IEEE) Respiration monitoring and human identification using CSI in controlled indoor scenarios CSI ratio model, Discrete Wavelet Transform (DWT), and fake peak removal method Data interpolation, Savitzky-Golay filter, down sampling, and Hampel filter Experiments with 11 participants in three scenarios Collected by authors Average absolute error of 0.1 brpm for respiration rate, 97.52% average accuracy for human identification across 11 users Accuracy declines after intensive movements, depends on the position of the user relative to the devices, needs 4–6 breaths to identify the user, and is affected by the presence of other people with 1 m distance required, and the experiment was not conducted in various real-life scenarios.
Gao et al. (2020) Gao et al., 2023 Journal (IEEE) Multi-person respiration monitoring using CSI-based Angle of Arrival (AoA) estimation Two-dimensional Doppler-Angle of Arrival (AoA) map construction using super resolution method, followed by DBSCAN clustering CSI calibration Experiments with 1–3 persons, and, compared to metronome-guided breathing Collected by authors Accuracy of 98.8%, 98.4%, and 97.5% for 1, 2, and 3 persons respectively It also does not take into consideration a person’s ability to move closer or further apart from each other and how this affects the respiration rates; the ability to separate close angles depends on data, time, length, and the number of antennas.
Gui et al. (2023) L. Gui et al., 2023 Journal (IEEE) Sleep position-based respiration tracking in smart bedroom environments Peak-to-Peak interval Hampel and Butterworth low-pass filters Tested in controlled environment with different sleeping positions No dataset 0.13 and 0.16 brpm for supine and prone,
respectively
The article does not take security into consideration and does not provide real-time data visualization graphs. Moreover, it lacks heart rate estimation. It takes only one subcarrier and does not utilize phase.
Gu et al. (2019) Y. Gu et al., 2019 Conference article (IEEE) Breathing and heart rate monitoring during sleep using non-invasive CSI sensing FFT-based method Hampel, Butterworth band-pass, and FFT filters and Frensel diffraction-based setup Tested with five participants and different sleeping positions No dataset - Breathing: AE:
0.575 brpm
- Accuracy: 96.636%
- Heart: AE: 3.9 bpm
- Accuracy: 94.215%
The results varied slightly based on the participant body type. AI is not used to enhance the accuracy of the breathing and heart rate estimations. In addition, it only takes amplitude from one subcarrier only, limiting the amount of information gathered from CSI.
Guo et al. (2021) L. Guo et al., 2021 Journal (IEEE) Abnormal breathing rate detection in home-like environment Number of Peaks Conjugate multiplication, Hampel filter, FFT, and db4 wavelet decomposition Experiments conducted on five volunteers with smartphone acetometer used to validate the system results No Dataset Average correlation of 80 and accuracy of 91.2% The study focuses more on detecting whether the breathing rate is abnormal or not rather than estimating specific value. In addition, the proposed system does not work in real-time and does not take heart rate into consideration
Wang, Yang & Mao (2020c) X. Wang et al., 2020 Journal (IEEE) Non-invasive respiration monitoring using CSI for static settings (e.g., home, car) Peak-to-Peak interval EWMA and Moving Average filter Many experiments were conducted investigating different setups. NEULOG Respiration Sensor is used to get ground truth respiration rate No dataset Accuracy:
91.54%
Median Error:
0.25 brpm
The study does not test the system against different participants, which may affect the system performance with the body change. Moreover, the study lacks estimating heart rate and vital signs visualization.
Guan et al. (2023) L. Guan et al., 2023 Journal (IEEE) Multi-subject respiration tracking using CSI-based clustering FFT-based method FFT Density-Based Spatial Clustering Experiments conducted with five volunteers, using piezoelectric respiratory belt (HKH-11C) as respiration rate ground truth Collected by authors Accuracy: 97.04% The study requires extra hardware to make use of four antennas. Moreover, the accuracy of the system drops significantly as the distance between the targets and transceivers increases.
Ali et al. (2023) K. Ali et al., 2023 Journal (IEEE) Overnight respiration monitoring during sleep in residential environments Number of Peaks Butterworth Low-pass filter PCA
Band-pass filter Median filter Exponential moving average filter Density-Based Spatial Clustering
More than 550 h of experimentation data from five users. Xethru X4M200 Breath sensors was used as the ground truth respiration rate No Dataset Median error:
1.19 brpm
Through experimentation it appeared that the activities done by other people while the user is sleeping affects the system performance. Furthermore, the error rates vary significantly throughout different sleeping postures and distance from the device. Additionally, subtle movements such as user turning over while sleeping would cause outages on the system where it is not able to detect any breathing movement.
Zhang et al. (2022) R. Zhang et al., 2022 Journal (IEEE) Simultaneous breathing and heart rate detection in a simplified smart health monitoring setup FFT-based method Hampel, threshold-based segmentation, Discrete Wavelet Transform, and FFT Experiments were conducted with different setups and two participants, one male and one female No Dataset Breathing rate:
Accuracy: 96.1%

The proposed system is not real-time; thus the heavy preprocessing might cause some issues when deploying the system to monitor in real time. Experiments were only conducted on two participants, so the system was not tested thoroughly.
Peak-to-Peak interval The ground truth data was extracted from RestOn Smart Sleep Monitor Heart Rate:
Accuracy: 94.3%
Li et al. (2020) Fangyu Li et al., 2020 Journal (ScienceDirect) COVID-19 symptom detection through respiratory tracking in home-isolation settings Using Principal Component Analysis (PCA) to extract the respiration component, then applies a high-resolution spectrogram to estimate respiration rate A Hampel filter is used to remove outliers and a bandpass filter to suppress noise Following self-quarantine guidelines for COVID-19 patients, to evaluate the shortness of breath indicators, the participant pretends to have a shortness of breath between 20 and 22 s. Collected by authors Not much testing was done, authors just mention that preliminary results were satisfying Training is on a small dataset and vital sign monitoring is limited to RR only.
Yin et al. (2022) Y. Yin et al, 2021 Journal (IEEE) Real-time respiration tracking using smartphone and Wi-Fi signals in smart home applications DWT was used to select subcarriers relevant to respiration activity, and FTT for respiration rate calculation Hampel and moving average filters for noise reduction, and a bandpass filter to extract subcarriers with rich respiratory information. Each participant was asked to breath for 10 min. For ground truth data, one observer visually counted the number of breaths. Collected by authors An error rate of 0.34 breaths per minute was recorded The system faces challenges such as the “dead zone” issue, where sensing performance can degrade at certain locations
Khamis et al. (2020) Abdelwahed Khamis et al., 2022 Journal (ScienceDirect) Hybrid Wi-Fi and UWB-based respiratory cycle and inhalation/exhalation monitoring Processing techniques include mapping PD to chest displacement and a novel subcarrier filtering and selection method Hampel filter for outlier removal, moving average filter for high-frequency noise reduction and Savitzky-Golay filter to smoothen the data A real-world setup captured data from a single subject using Wi-Fi and UWB radar for ground truth. Breathing sessions lasted 10-15 min with specific exercises. The system’s performance was evaluated by timing errors in breath cycle estimation and inhalation/exhalation ratio accuracy. Collected by authors The system achieved a median timing error of less than 0.25 s Small datasets which limit generalizability & the system requires calibration which involves a pre-defined breathing sequence to adapt the system to individual users and environmental conditions
Zhuo, Zhong & Zhuo (2022) H. Zhuo et al., 2022 Conference article (IEEE) Accurate respiration estimation during sleep in controlled multi-position scenarios Peak-to-Peak interval Savitzky-Golay filtering for noise reduction; Hampel filter for DC component removal; Outlier removal; Wavelet denoising; Subcarrier selection based on variance Experimental validation in different sleeping positions using a metronome as ground truth; Comparison of different weights for periodicity and variance. Data collected from 11 people in various sleeping positions over 3 months Achieved 97.25% accuracy for respiration rate estimation. Effective in various sleeping positions Limited to single-person scenarios.
Does not include a user interface for real-life alerts.
Limited to only respiration rate estimation.
DOI: 10.7717/peerj-cs.3375/table-5

Armenta-Garcia et al. (2021) proposed BReML, a system that combines Fast Fourier Transform (FFT) method for estimating breathing rate with machine learning and compares three models: Support Vector Machines (SVM), K-Nearest Neighbor (K-NN), and Quadratic Discriminant Classifier (QDC). BReML filters CSI amplitude using Hampel, Savitzky Golay (SG), and Butterworth band-pass filters. Rather than using all subcarriers, it selects the 10 most sensitive ones to reduce computational power. FFT is then applied to extract additional features for the AI models, including an initial estimation of the breathing rate. Then Discrete Wavelet Transform (DWT) is also applied to generate extra features, which are fed into multiple classification models for comparison. The classification consists of five classes covering 12 to 20 breaths per minute (brpm) in steps of two. The proposed system achieved high accuracy, with 99.09%, 99.21%, and 99.65% for K-NN, SVM, and QDC, respectively. However, the system was not tested on subjects outside the training data and did not estimate the heart rate.

To extend the research in Armenta-Garcia et al. (2021) and estimate both breathing and heart rate, Armenta-Garcia et al. (2023) proposed a system following similar concept of Armenta-Garcia et al. (2021). However, they collected more data to train the system for heart and breathing rate estimation and made both the dataset and the system publicly available for further research. As for the class division, they divided respiration rate into five classes: <=10, 11–13, 14–16, 17–19, and >=20 with only three classes for heart rate: <= 60, 61–79, and >= 80. Nevertheless, the proposed system achieved an accuracy of 99.18%, 90.71%, and 91.35% using K-NN, SVM, and QDC, respectively, for breathing rate. On the other hand, the system did not perform well when estimating the heart rate with accuracy of 56.55%, 55.06%, and 55% for K-NN, SVM, and QDC, respectively. Additionally, the heart rate classification used only three broad categories with wide ranges, making it unsuitable for continuous health monitoring use.

Mosleh et al. (2022) explored employing two off-the-shelf routers in their experimental setup. They adopted a machine learning approach by introducing an innovative algorithm, BreatheSmart, which comprises seven layers that leverage the LSTM network. BreatheSmart processes both the phase and amplitude information of each subcarrier and subsequently assigns the data to one of the predefined respiration pattern classes. Data are acquired from three transmitting and three receiving antennas, utilizing all 56 subcarriers within the 2.4 GHz network spectrum. The results revealed BreatheSmart’s ability to correctly predict respiration pattern classification with up to 99.97% accuracy. However, the evaluation was conducted under controlled conditions, with significant hardware setup, limiting its applicability in diverse scenarios. Additionally, BreatheSmart does not provide specific measurements for respiration rate, because it categorizes data into a limited number of nine distinct classes.

To summarize, the articles in Table 4 all leverage AI classifiers to assign respiration rate to one of the predefined classes. Reference (Mosleh et al., 2022) stands out with an impressive accuracy of 99.97% using an LSTM-based model. Unlike other studies, it uses both CSI amplitude and phase for estimating the breathing rate, rather than relying on one only. In addition, it is the only study that does not use conventional filters, such as Hampel filter, to remove noise and outliers, relying only on FFT. However, it is noteworthy that the system was evaluated in a controlled environment with a breathing simulator, which yields constant patterns unlike human respiration. Moreover, the nine classes had varying steps, with some classes much further apart and no intermediate class. Both Armenta-Garcia et al. (2021) and Armenta-Garcia et al. (2023) used Hampel, Savitzky Golay, and band-pass filters and experimented with SVM, K-NN, and QDC. Despite this, Armenta-Garcia et al. (2021) achieved higher accuracy of 99.65% with QDC. Nevertheless, the study did not address heart rate estimation, whereas (Armenta-Garcia et al., 2023) attempted to estimate pulse rate but achieved poor results, with only 55.65% accuracy using K-NN.

Wi-Fi signal-based monitoring systems using a deterministic approach

Guan et al. (2023) advances takes breathing estimation in proposing a new method to estimate multi-person breathing rates simultaneously. The proposed system utilizes the variance of CSI amplitude to determine the most sensitive subcarrier across different antennas. FFT is then applied on the CSI phase to produce a frequency spectrum. To distinguish between different individuals, the study utilizes Density-Based Spatial Clustering to separate each person’s data, so that each cluster represents one individual. Thereby, the system finds the breathing rate of each person by identifying the maximum frequency within each cluster. The system demonstrated good performance, with a 97.04% accuracy; however, the accuracy drops significantly as the distance between the targets and transceivers increases. In addition, the study did not consider two individuals with the same breathing rate, so it categorizes them as one person.

Gui et al. (2023) explored a setup in which a router served as the TX and a PC equipped with three antennas functioned as the RX. They introduced a system designed to monitor respiration rate and detect sleep-related activities utilizing Wi-Fi CSI data. The system uses the peak-to-peak interval method to calculate respiration rates in the absence of movement, deriving the rate as 60 divided by the average of peak-to-peak intervals. To enhance accuracy, Gui et al. (2023) proposed a new dynamic subcarrier selection method and peak identification technique. Upon detecting an activity, the system suspends respiration rate monitoring to identify specific activities such as lying on the spine, prone, or in lateral positions. Upon settling into a new position, the system recalibrates by repeating the subcarrier selection process to ensure optimal selection for each posture. Experimental results demonstrated promising performance, with estimation errors of 0.13 and 0.16 breaths per minute for supine and prone sleeping postures, respectively. However, while the proposed system exhibited robust performance, it did not operate in real time and did consider multiple persons.

Guo et al. (2021) proposed a system for emergency detection based on three vectors: position, behavior, and breathing condition. Position refers to the patient’s location within the household. Behavior encompasses the patient’s movements, such as falling. Breathing indicates whether respiration is abnormal, including apnea, dyspnea, or tachypnea. Position and behavior data are derived from keypoint maps generated via a neural network algorithm trained using camera data sourced from the OpenPose library. Breathing condition, notably apnea detection, is determined using filtered CSI data via the DWT method. The system leverages both phase and amplitude components of the CSI data, as they offer complementary information. To construct the keypoint map, data are collected from two receivers positioned in different locations to ensure comprehensive coverage. Patient movement is required for accurate keypoint map generation, as the method relies on the dynamic component of the CSI. In tests involving the same volunteer and within the same room, the system achieved impressive results, accurately matching 80.7% of points exactly to the original keypoint maps, with 99.3% showing close alignment. Furthermore, apnea detection accuracy reached 91.2%. However, when tested with different volunteers who were not part of the neural network training or when obstructed by a wall, accuracy dropped significantly.

Overall, the articles in Table 5 relied on manual estimation of vital signs, implemented in one of three ways. First, the FFT method, as followed by Gu et al. (2019), Guan et al. (2023), Zhang et al. (2022), selects the frequency corresponding to the highest peak within the normal respiration or heart rate range. Second, peak counting over a specific interval, used by Guo et al. (2021), Ali et al. (2023), relies on identifying inhalation and exhalation phases, with each peak representing one breathing cycle. Third, the peak-to-peak interval method, conceptually similar to peak counting, calculates the average interval between consecutive peaks (time from one breath to the next). Dividing 60 by this average interval yields the number of breaths per minute. This method, followed by Gui et al. (2023), Wang, Yang & Mao (2020c), Zhang et al. (2022), has achieved the most success in estimating breathing rate; for instance, Gui et al. (2023) reported an average estimation error of 0.13 breaths per minute (brpm) in the supine position.

In addition to using common filters such as Hampel and Butterworth low-pass filters, Gui et al. (2023) also proposed a new subcarrier selection method based on the relative difference using FFT. They also introduced additional steps to remove false peaks. However, the system did not estimate heart rate or operate in real time, limiting its efficiency for real life use. Furthermore, Gui et al. (2023) did not account for multiple people breathing in the same room, unlike (Guan et al., 2023), which applied a clustering algorithm after FFT to group data corresponding to each individual. Although (Guan et al., 2023) achieved high accuracy of 97.04%, the system could misclassify multiple people with the same breathing rates as one person.

Regarding heart rate estimation, both Gu et al. (2019) and Zhang et al. (2022) went for FFT-based method for estimating the heart rate. Both studies used Hampel filter and achieved an accuracy of 94% even though (Gu et al., 2019) used Butterworth band-pass filter and only amplitude while (Zhang et al., 2022) used DWT and considered both phase and amplitude. Still, Zhang et al. (2022) managed to achieve a slightly better result of 94.3%.

Discussion

While preparing this literature review, no standardized performance metrics were reported by all research articles, as highlighted in Table 2. Less than half of the AI-based articles included recall, precision, and F1-score, while the rest did not. Besides four non-AI articles that reported both accuracy and MAE, half of the others reported only accuracy, and the other half reported only MAE. This poses a major challenge for proper comparative analysis. In this article, we propose that the gold standard for performance metrics should be accuracy and MAE for all articles, with recall, precision, and F1-score included for AI-based research. To address this challenge, we selected accuracy as the primary performance metric for all articles, given its prevalence, and considered MAE as the secondary performance metric when accuracy was not reported. It is also worth noting that all articles discussed vital signs either in stable conditions, where heart rate and breathing rate are not changing rapidly, or in cases of apnea detection, addressing specific patterns of sleep apnea.

Principal findings

Across the included studies, approximately 70% were classified as having moderate quality, primarily due to testing with a sufficient sample size and using reliable methods to obtain ground truth, such as the Neulog Respiration Monitor Belt logger sensor (NUL-236), rather than relying on error-prone methods like manual counting. High-quality studies also increased reproducibility by using public datasets, sharing their dataset or code, and comparing their systems’ performance to previous studies. In contrast, low-quality studies typically did not compare their results to previous studies, did not report their sample size or used minimal samples, or employed less accurate methods for measuring ground truth. None of the articles attempted to validate performance against publicly available datasets. Non-AI studies more often compared their system performance with older studies; over 55% of non-AI articles did so, compared to only 25% of AI articles. Out of the 45 reviewed articles, only four shared their dataset, and just one shared the code as well. More than 75% of the studies did not disclose algorithms, code, or data, highlighting the need for stronger community standards around openness in CSI-based vital signs monitoring.

As presented in Table 2, the taxonomy of studies in CSI-based vital signs sensing reveals key trends and observations that highlight the current state of the field and provide insight for future of research in this field. In terms of the sensing techniques, using both amplitude and phase is the most common approach, especially in AI-based studies monitoring respiration rate, as it provides a holistic view and enables AI to analyze complex patterns in both amplitude and phase simultaneously. Deterministic approaches more commonly use amplitude alone for estimating heart rate and respiration rate and phase achieved similar accuracies of 94.5% and 94.3%, respectively. Conversely, an AI-based study using both amplitude and phase showed higher accuracy (98.5%) for heart rate estimation. Other applications, such as predicting sleep stage and sleep apnea, also utilized both amplitude and phase. Studies predicting heart rate and breathing rate with AI only investigated a single-person context, whereas deterministic studies researched various contexts, including multi-person, single-person, and single-person in real time. Regarding reported metrics, only four deterministic articles reported both MAE and accuracy; the remainder reported either MAE or Accuracy only. Among AI studies, only one did not report accuracy, while 41.66% reported accuracy, recall, precision, and F1-score, with the rest reporting either accuracy, MAE, or both. Figure 2 displays a graphical summary of the categorization of studies.

Taxonomy radar plot for the reviewed studies.

Figure 2: Taxonomy radar plot for the reviewed studies.

Our literature survey on AI-based health monitoring systems using Wi-Fi signal shows considerable improvements in accuracy, real-time operation, and multi-person monitoring. Heart rate estimation has achieved accuracies of up to 96% (Sun, Bian & Li, 2024), while breathing rate estimation has reached very high precision, with errors of 0.13 brpm (Gui et al., 2023). Multi-person tracking systems have successfully monitored up to four people simultaneously, achieving accuracies above 86% (Zeng et al., 2019). Subsequent developments in AI approaches, including deep learning models (Yu et al., 2021; Kumar et al., 2022; Mosleh et al., 2022), have further fine-tuned overall system performance, enabling highly sophisticated health assessments and increasing robustness across diverse environments. These advancements pave the way for less invasive, more user-friendly monitoring systems, with the possibility for use in sleep monitoring (Yu et al., 2021) and early detection of respiratory problems (Yang et al., 2021). Taken together, these findings suggest an optimistic trend toward developing efficient, individualized health monitoring systems that could radically impact both hospital and outpatient care.

With these results in mind, we present a comprehensive summary of the studies leveraging AI techniques for CSI-based vital signs monitoring. Table 6 offers an in-depth analysis of the AI algorithms employed, the results achieved, the datasets used, and the unique contribution of each study. The table highlights the diverse approaches applied to vital signs monitoring using CSI, with the “Contribution” column showcasing novel aspects such as new methodologies, multi-person tracking, and real-time capabilities. This detailed overview helps us better understand the progress made in applying AI to Wi-Fi-based vital signs monitoring and identifies areas for future research. Figure 3 illustrates the most commonly used AI models in Wi-Fi vital signs monitoring.

Table 6:
Summary of AI algorithms applied in CSI-based vital sign monitoring.
Ref. AI algorithms Results Dataset Contribution
Yu et al. (2021) CNN-BiLSTM with CFR Accuracy 81.8% Dataset collected by authors, 12 subjects Implemented a Muti-stage sleep monitoring classification with real-time capability
Bao et al. (2023) Support Vector Machine (SVM) Accuracy 91.2% Collected by authors, five subjects Developed a remote healthcare application for real-time respiration monitoring
Kumar et al. (2022) CNN, SVM Accuracy 98.46% Collected by authors, 84 subjects Built a multi-position health monitoring for sleeping, sitting, standing and walking
Zhang et al. (2024) PCA, DBSCAN Clustering, Autocorrelation Accuracy 98% Collected by authors, 12 subjects Developed a multi-person respiration monitoring system using a single antenna
Guo et al. (2023) Multi-antenna CSI-Subpopulation Genetic algorithm, Mixed Cluster Gaussian-Hidden Markov Model Mean absolute error of 0.1 brpm Collected by authors Real-time respiration monitoring with multi-antennas
Kontou, Smida & Anagnostou (2024) Artificial Neural Network (ANN) Average error of 4.7% Collected by authors ANN-based respiration frequency detection method
Hu et al. (2023) Temporal Convolutional Network (TCN) Mean error of <1 brpm Collected by authors, eight subjects Multi-person respiration monitoring system
Armenta-Garcia et al. (2021) SVM, K-NN, Quadratic Discriminant Classifier Accuracy SVM 99.21% K-NN 99.09% QDC 99.65% Collected by authors Compared multiple ML algorithms for breathing rate estimation
Mosleh et al. (2022) Long Short-Term Memory (LSTM) Accuracy 99.97% Collected by authors (https://doi.org/10.18434/mds2-2963) Respiration monitoring using LSTM
Armenta-Garcia et al. (2023) SVM, K-NN, Quadratic Discriminant Classifier K-NN accuracy of 99.18% for breathing and 56.55% for heart rate Collected by authors, 17 subjects (https://github.com/AlbanyArmenta0711/WiFi-CSI-MiningTool) Classification of both heart and respiration rates
Yang et al. (2021) Back-propagation (BP) neural network Accuracy 95.6% Collected by authors, 450 sets for each state Developed sleep apnea detection system
DOI: 10.7717/peerj-cs.3375/table-6
Widely applied AI models in Wi-Fi vital signs monitoring research.

Figure 3: Widely applied AI models in Wi-Fi vital signs monitoring research.

Table 7 presents an AI technique comparison framework for Wi-Fi-based health monitoring, summarizing all models from the surveyed articles and highlighting their respective strengths and weaknesses.

Table 7:
AI technique comparison framework for Wi-Fi-based health monitoring.
AI Model Accuracy Computational efficiency Scalability Real-time suitability Multi-person support Strengths Weaknesses
CNN (Convolutional Neural Network) High (>95%) in activity recognition and vital sign detection Moderate to high as it requires GPUs for complex tasks Scalable for large datasets but requires hardware Limited due to high
computational demands
Limited without further optimization Excellent for image-like data and feature extraction Expensive and
may overfit without large datasets
LSTM
(Long Short-Term Memory)
High for sequential data like breathing and heart rate (95–99%) Low to moderate as it depends on model complexity Limited for large user environments due to memory usage Suitable for near real-time but latency can occur Limited without multi-user-specific adaptation Effective for sequential time-series data High memory consumption, struggles in multi-user environments
SVM
(Support Vector Machine)
Moderate (85–95%) in simpler tasks like binary classification High (lightweight, even for large datasets) Scalable for small to medium datasets Suitable for real-time processing Poorly designed and primarily for single-user cases Simple, easy to implement, efficient for small datasets Performance declines with increasing dataset size and complexity
RF
(Random Forest)
Moderate to high (90–95%) depending on features used Moderate (more efficient than deep learning models) Good for medium datasets, not ideal for very large datasets Generally suitable for real-time applications Limited by design to single or basic multi-user environments Handles missing data well, robust for diverse input Inefficient for extremely large or complex datasets
KNN (K-Nearest Neighbors) Moderate (80–90%) High for small datasets, low for large datasets Poor scalability with larger datasets High latency for large datasets Designed for single-user environments Simple to implement, no training phase needed Not suitable for high-dimensional datasets or multi-user setups
DOI: 10.7717/peerj-cs.3375/table-7

To synthesize the reported performance across different use cases, Table 8 summarizes the minimum, maximum, and average values of the reported metrics for each use case. Heart rate estimation had the lowest accuracy when first explored through CSI, at 56.55%. However, more recent studies have achieved accuracies up to 94.71%, with an average of 84.22% across five studies. Respiration rate detection demonstrated better overall performance, with a minimum accuracy of 78.6%, a maximum of 99.65%, and the highest average among all use cases at 95.7%. Fewer studies investigated sleep apnea, sleep stage, and respiration cycle detection, reporting average accuracies of 93.4%, 81.8%, and a MAE of 0.27 s, respectively.

Table 8:
Summary of performance metrics (Minimum, Maximum, and Average) reported across the different use cases.
Metrics reported No. of studies Minimum Maximum Average
Heart rate
Accuracy 5 56.55 94.71 84.22
F1-score 1 30.91 30.91 30.91
MAE 2 1.19 3.9 2.55
Precision 1 40.28 40.28 40.28
Recall 1 35.91 35.91 35.91
Specificity 1 68.03 68.03 68.03
Respiration rate
Accuracy 25 78.6 99.65 95.70
F1-score 3 76.82 99.11 91.29
MAE 20 0.1 1.19 0.50
Precision 3 97.96 99.59 98.94
Recall 3 97.95 99.28 98.63
Specificity 3 99.49 99.97 99.79
Respiratory cycle
MAE (second) 2 0.25 0.28 0.27
Sleep Apnea
Accuracy 2 91.2 95.6 93.40
F1-score 1 98 98 98.00
Precision 1 99.33 99.33 99.33
Recall 1 99.11 99.11 99.11
Sleep stage
Accuracy 1 81.8 81.8 81.80
F1-score 1 83.5 83.5 83.50
Precision 1 83.8 83.8 83.80
Recall 1 83.2 83.2 83.20
DOI: 10.7717/peerj-cs.3375/table-8

To visually illustrate the relationship between healthcare use cases and the evaluation metrics reported in the studies, we created a snake diagram connecting the use cases to their corresponding metrics, as shown in Fig. 4. Studies monitoring respiration rate are strongly associated with accuracy and MAE, with 25 and 20 studies reporting these metrics, respectively. However, metrics such as precision, recall, and specificity appear less frequently. For heart rate estimation, accuracy remains the most commonly reported metric, while MAE and the other matrixes are less frequently included. Among the smaller use cases, articles reporting respiratory cycles are unique in reporting only MAE in seconds, whereas the remaining use cases report accuracy, F1-score, Precision, and Recall.

Evaluation metrics reported based on the use cases.

Figure 4: Evaluation metrics reported based on the use cases.

Comparison to prior work

Several recent studies have examined Wi-Fi-based sensing for healthcare applications, but they differ significantly from this work in scope, methodology, and focus. For instance, Soto et al. (2022) present a survey on vital signs monitoring using Wi-Fi CSI, reviewing approximately 34 articles (not explicitly listed) without following a systematic protocol such as PRISMA. Their inclusion criteria and article selection process are not clearly defined, and the survey does not cover articles that use AI approaches to enhance estimation. Similarly, Miao et al. (2025) introduce a general overview of Wi-Fi sensing approaches for human activity recognition. Their review is not PRISMA-compliant, includes no database search methodology, and focuses more on wireless signal processing and sensing theory than healthcare-specific applications. Ahmad, Ullah & Choi (2024) offer a narrative explanation of contactless Wi-Fi sensing in healthcare, listing emerging trends and application scenarios rather than systematically reviewing recent methodologies or implementation results. Their work does not distinguish between CSI-based and RSS/radar-based systems, nor does it classify studies by performance, signal type, or clinical feasibility.

In contrast, our systematic review strictly follows PRISMA and Kitchenham standards, with clearly defined inclusion and exclusion criteria, screening procedures, and documented reasons for study inclusion. We conducted a systematic multi-database search (IEEE Xplore, ScienceDirect, SpringerLink, ACM Digital Library, ResearchGate, and Google Scholar), downloading 8,573 articles, from which 45 primary studies were selected following rigorous screening. This review focuses on CSI-based heart and respiratory rate monitoring in particular, contrasting AI-based and non-AI-based methods while emphasizing their reliability, performance, and feasibility for clinical or home settings. The included studies, published between 2019 to 2024, provide a contemporary synthesis of the field. Additionally, we present structured tabular summaries, performance metrics, setup requirements, and limitations, allowing healthcare professionals and engineers to interpret and apply the findings. This level of systematic depth and clinically relevant focus is currently absent in existing literature, highlighting the novelty and value of our work. Table 9 provides a comparison between our systematic review and prior works.

Table 9:
Benchmark with prior work.
Aspect Soto et al. (2022) Miao et al. (2025) Ahmad, Ullah & Choi (2024) This review
Review type Narrative survey Narrative survey Narrative survey Systematic review
PRISMA followed No No No Yes (PRISMA + Kitchenham)
Focus CSI-based vital signs monitoring (esp. respiration & HR) Wi-Fi-based Human Activity Recognition (HAR) Wi-Fi-based sensing using deep learning for human activity recognition and different healthcare applications CSI-based vital signs monitoring specifically respiration & heart rate estimation. Examines both (AI + non-AI) based approaches.
Years covered 2014–2022 Not mentioned 2016–2022 2019–2024
Databases Not specified Not specified Not explicitly stated (general literature review across academic sources like IEEE, ACM, and Elsevier). 6 (IEEE, ACM, Springer, ScienceDirect, etc.)
Study selection method Not clearly stated Not clearly stated Not clearly stated Transparent, structured screening & inclusion
# of Papers included Not explicitly mentioned Not explicitly mentioned Not explicitly mentioned 45 (post-screening)
Limitation/novelty Overview of respiration/HR estimation but lack covering articles that use AI approaches to aid their estimation process. Broad survey on HAR but not vital signs estimation. CSI monitoring in various applications, but predominantly on human activity recognition (HAR) rather than vital signs monitoring. First to conduct structured PRISMA-based review specifically comparing AI & non-AI CSI-based vital signs monitoring systems with performance, hardware, and deployment insights
DOI: 10.7717/peerj-cs.3375/table-9

Methodological limitations and reproducibility concerns

A number of methodological concerns were identified across surveyed literature, which restrict the ability to compare, reproduce, and reliably interpret results in Wi-Fi-based health monitoring studies.

  • Statistical significance and robustness

    Only a small proportion of studies reported formal measures of statistical significance, such as p-values, confidence intervals, or variance across repeated runs. In most cases, results were presented as single-point performance data without any statistical analysis, making it difficult to determine whether observed differences are meaningful or a result of chance.

  • Replication and external validation

    Reproducibility is a major issue. As Table 3 shows, only 10 studies provided publicly available code, algorithms, or datasets, while 35 studies shared no implementation resources. Even fewer studies tested their models on independent, external datasets, relying instead on in-house or proprietary datasets that were not released publicly. This lack of transparency prevents independent validation and fair comparisons within the filed. Consequently, our search did not include clinical databases such as PubMed or Embase, which may contain reports of early-stage trials or clinical evaluations. This omission limits our ability to assess the clinical readiness or real-world deployments of CSI-based sensing.

  • Unpredictability of evaluation methods

    As Tables 2 and 3 show, the approaches used to measure system performance vary widely across studies.

  • Metrics: Accuracy was the most frequently reported metric (29 studies), followed by MAE in 23 studies coming in second. Five studies reported F1-score as well as precision and recall as primary metrics. Figure 5 illustrates the frequency of these reported metrics. A clear trend emerges: non-AI studies tend to report accuracy and MAE, while AI-based studies tend to report accuracy and F1-score, and sometimes precision and recall, to account for class imbalance and better reflect overall model performance.

  • Validation methods: Seven studies used k-fold cross-validation or similar resampling methods, whereas most relied on a single train-test split. Using only one split increases the risk of overfitting and can exaggerate reported performance.

  • Testing environments: Most systems were tested in controlled conditions with a single user. Only 10 studies directly measured performance in multi-user scenarios or more dynamic, realist environments, an important step toward measurement of real-world applicability in the healthcare setting.

  • Benchmarking practices

    Among the 45 studies, 22 engaged in benchmarking, directly comparing their results against findings reported in previous literature. This practice helps position new approaches within the broader research landscape and provides a relative measure of performance improvements over existing methods. However, benchmarking alone does not fully confirm a model’s generalizability, as many of these studies did not evaluate their systems on entirely independent datasets or in substantially different real-world environments. The remaining studies relied solely on internal validation, which, while useful for development, can overestimate real-world performance when not supplemented with external testing.

Frequency of the metrics reported.

Figure 5: Frequency of the metrics reported.

The lack of reliable evaluation frameworks, limited use of high-quality statistical reporting, and restricted availability of code and data combine to form a fragmented evidence base. To advance this technology toward clinical application, the research community needs to adopt uniform evaluation methods, prioritize reproducibility through open-source principles, and embrace statistical rigor when reporting performance.

Limitations and recommendations

Despite the accuracy and potential of using AI-enhanced Wi-Fi sensing non-invasive health monitoring systems, a range of technical and practical challenges remain. These limitations are interrelated and directly inform our proposed recommendations for future research.

  • High computational demands and edge deployment constraints

    Most AI-based solutions (e.g., Convolutional Neural Networks (CNNs), Long Short-Term Memory (LSTMs)) are computationally intensive and rely on powerful GPUs (Graphics Processing Units) or multi-core CPUs (central processing units) to analyze high-resolution CSI data in real time. This poses challenges for portable or embedded systems such as home gateways, low-power Internet of Things devices, or wearable hubs. The computational overhead affects latency, increases cost, and reduces viability in resource-limited clinical environments. Research should focus on lightweight, optimized models through pruning, quantization, knowledge distillation, and architectural simplification, enabling high performance on low-power, edge-based systems without costly hardware.

  • Energy efficiency limitations

    Constant CSI acquisition and high-rate sampling can be very energy-intensive, particularly for systems intended to monitor patients continuously over long intervals. This results in frequent battery charging, higher maintenance demands, and lower patient compliance in mobile or battery-based healthcare devices. It also wastes energy and bandwidth by processing data during inactivity. Adaptive sensing and event-based monitoring, where data are gathered more frequently when an anomaly is detected, providing a roadmap toward using less power without loss of monitoring capabilities of critical interest.

  • Integration gaps with healthcare IT systems

    Although numerous studies have demonstrated technical feasibility in controlled environments, few show interoperability with hospital electronic health record (EHR) systems, remote patient monitoring dashboards or telemedicine platforms. Without integration into existing healthcare IT workflows, these systems remain stand-alone prototypes that cannot contribute to continuous patient history or automated clinical decision support. Interoperability requires standardized data formats, secure Application Programming Interfaces (APIs), and compliance with regulations like the Health Insurance Portability and Accountability Act and General Data Protection Regulation. Using Standardized APIs, secure communication protocols, and compliance frameworks will help fill this gap so that CSI-generated health data can be transported, stored and retrieved within current healthcare infrastructure.

  • Security and privacy risks

    The transmission of sensitive physiological data over Wi-Fi raises risks of interception, tampering, or unauthorized access. Breaches may occur not only from weak or absent encryption but also from the passive nature of monitoring, where explicit consent may not be obtained. Security-by-design principles, such as robust encryption, authentication, secure storage, and anonymization, are essential for compliance and building patient trust.

  • Lack of high-quality, standardized datasets

    Most studies use small, self-generated datasets with limited demographic diversity, environmental variation, or sensing conditions. Public datasets, when available, tend to be incomplete, restricted to amplitude or phase data, or often partially pre-processed. This limits reproducibility and hinders cross-study comparison, forcing researchers to recreate data collection pipelines. Standardized, well-documented datasets would establish a common foundation for rapid innovation, reproducibility, and stringent performance testing between research teams.

  • Limited scope of monitored vital signs

    Current research focuses on respiration and heart rate, omitting other important measures, such as blood pressure, oxygen saturation, or arrhythmia. This narrow scope limits clinical applicability, particularly in use cases such as intensive care unit monitoring where multiple vitals must be measured simultaneously. Future work should focus on the development of algorithms and sensing methods that would be able to derive more pertinent physiological data concerning CSI data, and in that way, these systems could be upgraded into holistic health monitoring systems.

  • Environmental sensitivity and performance degradation

    CSI-based monitoring is strongly influenced by environmental factors such as furniture formations, wall compositions, or Wi-Fi signal fluctuations. These can degrade performance, increase false positives, or cause missed detections. Although some techniques freeze measurements during disturbances, they fail under frequent or persistent changes. Environment-adaptive calibration and robust signal processing methods are needed to guarantee stable performance with a large change in environmental conditions.

  • Multi-person monitoring

    Overlapping CSI patterns of multiple individuals remain a major challenge. Current methods perform poorly in multi-user or unsynchronized movement scenarios, limiting applicability in hospital wards, senior care facilities, or family homes. Research into clustering, angle-of-arrival estimation, and deep learning-based separation could help achieve credible performance in multi-person environments.

In addition to system level constraints, it is important to note the methodological limitations revealed in the literature. Inconsistencies in statistical reporting, evaluation measures, and validation procedures, combined with insufficient sharing of codes and datasets discussed in the previous section, compromise reliable cross-study comparison and replication. Addressing both technical and methodological challenges in parallel to advance Wi-Fi-based health monitoring toward standardized, clinically deployable solutions.

By explicitly linking these shortcomings to actionable recommendations, this review not only highlights what current Wi-Fi-based health monitoring systems lack but also provides a viable guide for how future research and development should proceed. Solving these problems will be central to the transition from exciting prototypes to dependable, secure, and clinically integrated systems that can function productively across diverse environments and patient populations.

Future directions and implications

Based on the outlined limitations and recommendations, there are several promising directions for the continued development of Wi-Fi-based health monitoring systems. These directions are not only oriented toward technical refinement but also toward integrating the technology into healthcare requirements, policy systems, and user acceptance. From a research perspective, investigation into the following areas is worth consideration:

  • Edge-optimized architectures: Train highly efficient AI models that avoid major accuracy trade-offs while being deployable on low-power and embedded devices. Techniques such as model pruning, quantization, and on-device inference pipelines can support real-time monitoring without reliance on high-performance servers.

  • Energy-aware sensing strategies: Develop adaptive sensing methods where data collection and processing rates vary according to patient activity, health status, or environmental stability. This minimizes power consumption without compromising the necessary measurements.

  • Multi-person tracking: Seek efficient algorithms to differentiate concurring CSI designs in a common area. Hybrid signal separation, beamforming, and spatial filtering with AI are potential solutions for enabling multi-user monitoring in hospitals and home care.

  • Increased physiological coverage: Extend monitoring beyond heart rate and respiration to include blood pressure, oxygen saturation and early detection of arrhythmia. An integration of multi-parameter monitoring and predictive analytics can be used to deliver proactive interventions in healthcare.

  • Diverse and well-labeled datasets: Design standardized, diverse, and well-labeled datasets that capture different environments, demographics, and medical conditions, and make them publicly available. This would accelerate innovation, reproducibility, and fair benchmarking across research groups.

The implications of these directions are equally important from a practical and policy standpoint:

  • Clinical integration and workflow adaptation: Collaboration between technologists and healthcare providers is required to ensure interoperability with EHRs, clinical dashboards, and telemedicine platforms.

  • Regulatory and ethical regulation: Policymakers should establish clear deployment guidelines on deployment addressing data ownership, informed consent, and monitoring ethics, while also creating reimbursement models for remote patient monitoring.

  • Public awareness and acceptance: Communication plans should ensure that patients and caregivers are informed about the benefits, limitations, and security of Wi-Fi-based monitoring, thereby leading to user acceptance and participation.

Following these directions, the field can move from isolated experimental successes toward sustainable real-world healthcare solutions that are technically dependable, ethically responsible, and socially acceptable. This two-pronged approach, innovation paired with readiness, will be crucial in making Wi-Fi-based health monitoring reach its full potential in improving patient outcomes and streamlining healthcare delivery. To conclude our synthesis, we have compiled all the keywords and search terms used during our research and created a network map showing the most frequently utilized terms in the literature. Figure 6 presents this keyword network map.

Bibliographic network map based on the articles’ keywords.

Figure 6: Bibliographic network map based on the articles’ keywords.

Conclusion

Considering the findings of the present review on Wi-Fi signal-based health monitoring systems, this technology has high potential in non-invasive, continuous health monitoring. Key findings include high accuracy in identifying respiratory and heart rates, successful tracking of multiple people, and diverse applications such as monitoring sleep patterns or COVID-19 symptoms. AI techniques have also been incorporated to enhance their effectiveness and precision, enabling additional forms of health evaluation. The following is a summary of the major findings and contributions of this survey:

  • Accuracy improvements: AI-driven models, particularly CNN, LSTM, and hybrid models, demonstrate significant improvements in accuracy for heart and respiration monitoring, with some models achieving over 95% accuracy in controlled environments.

  • Non-invasive monitoring: Wi-Fi-based systems provide a contactless solution, enabling accessible and less intrusive monitoring in clinical and home settings, particularly for vulnerable populations such as elderly or chronic patients.

  • Diverse use cases: Wi-Fi-based sensing demonstrate versatility across healthcare applications, including fall detection, activity recognition, and real-time vital sign monitoring.

Nevertheless, some issues remain, such as data protection and privacy, enhancing the model’s performance in various environments, and multi-user cases. The following is a summary of this survey’s major challenges and limitations:

  • Multi-user cases: Current systems struggle with multi-person monitoring due to overlapping signals and interference, highlighting the need for stronger algorithms for distinguishing between individuals.

  • Privacy concerns: Given the sensitivity of health data, robust data protection and secure Wi-Fi transmission are critical. Strategies such as encryption and user anonymization should be implemented.

  • Real-time processing: Real-time monitoring remains challenging, optimization to reduce latency and computational overhead without compromising accuracy.

The importance of these findings is the ability to ensure a significant change to healthcare delivery, particularly in remote and continuous monitoring. Future research must consider the following areas to address the gaps identified in the current literature.

  • Develop a multi-person health monitoring system that can simultaneously track complex health parameters like heart and breathing rates in real time, with a focus on mitigating environmental interventions and improving overall system performance.

  • Implement robust data protection strategies, including advanced encryption methods, secure storage solutions, and privacy-preserving algorithms to safeguard sensitive health information.

  • Expand the scope of AI-based health monitoring by integrating additional vital signs such as blood pressure and body temperature, emphasizing the creation of predictive models that can anticipate and proactively alert potential health emergencies.

  • Contribute to the research ecosystem by developing high-quality, publicly available datasets that include annotated activities, precise vital sign measurements, and comprehensive phenotypical information to support more effective AI model training, validation, and future advancements in health monitoring technologies.

  • Develop standardized procedures of data collection, preprocessing, and performance assessment that will allow comparing and reproducing studies.

  • Collaborate with healthcare regulators and policymakers to define regulatory pathways, compliance requirements, and ethical principles of clinical deployment of Wi-Fi-based monitoring systems.

  • Share codes and datasets publicly in the research community to foster innovation, allow independent verification, and translate research findings to real life.

Appendix

Search strategy overview

The search strategy was developed to identify peer-reviewed articles focusing on Wi-Fi CSI) and AI-based non-invasive vital signs monitoring. The search was conducted across six databases: IEEE Xplore, ScienceDirect, SpringerLink, ACM Digital Library, Google Scholar, and ResearchGate.

The overall search strategy could be outlined below:

  • 1.

    Research questions formulation.

  • 2.

    Develop search phrases.

  • 3.

    Choose digital libraries and conducting searches.

  • 4.

    Apply specific inclusion/exclusion criteria to select articles relevant to our topics of interest.

  • 5.

    Gather essential details from the articles we chose.

  • 6.

    Analyze the research questions in depth, allowing us to conduct a comprehensive study on the existing methods, their advantages, and the potential for future research, as well as their limitations.

Figure S1 illustrates the visual breakdown of these steps. A detailed list of the keyword combinations, Boolean query strings, and individual database inputs used during the search process is provided below.

Boolean query and keywords

To retrieve relevant articles that address the research questions from various search resources, a query string was necessary. Initially, we determined the primary search terms derived from the research questions. Subsequently, we listed synonyms and alternative expressions for these primary terms. These were then linked to each corresponding primary term using the OR operator. Finally, all primary search terms were connected using the AND operator. The culmination of these processes produced the query string that was utilized to search in different databases. The Boolean logic used is shown below:

(“Vital signs” OR “Heart Rate” OR “Respiration Rate” OR “Body Temperature” OR “Breathing” OR “Cardiac”) AND (“Wi-Fi Signals” OR “Channel State Information” OR “CSI” OR “Wireless security” OR “Router” OR “Wireless” OR “CSI-based healthcare monitoring”) AND (“Health monitoring” OR “Remote monitoring” OR “non-invasive” OR “Healthcare” OR “Tracking” OR “ECG” OR “Health Monitoring Application” OR “Patient-monitoring” OR “Sleep monitoring” OR “Home monitoring” OR “Secure Remote Monitoring” OR “Smart Health Monitoring” OR “Signal Processing in Healthcare” OR “Monitoring” OR “Detection” OR “Abnormality” OR “AI-driven Healthcare” OR “COTS device" OR “COTS router” OR “Wireless sensing” OR “Contactless Monitoring” OR “Multi-Person Detection” OR “Wi-Fi device”)

After experimenting with several combinations and permutations of key terms, we finalized our search terms, which are presented in Table S1. Searches were conducted using six major academic databases listed in Table S2.

Screening and selection process

Screening rationale and initial results

We opted to focus our search on research articles published in the last five years up to August 2024. Following this, we removed duplicates, non-journal sources, and articles out of our scope left us with 851 unique records.

In the next step, we screened the articles identified using the inclusion-exclusion criteria as shown in Table S3. This refinement process narrowed our selection down to 269 records that met our criteria. Additionally, we eliminated records that were inaccessible, studies that merely offered discussions without demonstrating model performance and outcomes, and those unrelated to our defined research questions. Moreover, we excluded articles that relied on frequency Modulated Continuous Wave (FMCW) radar systems. Despite the higher proven accuracy of these systems, they require specialized hardware which is often costly. In contrast, the CSI-based approach is considered more convenient and cost effective as it utilizes off-the-shelf Wi-Fi devices. CSI is increasingly supported by modern Wi-Fi chipsets and can be extracted with open-source tools, making it both practical and scalable for clinical and real-world deployment. We also excluded articles that used RSS for sensing, as RSS provides only basic signal strength information, which is not detailed enough to detect small body movements like breathing or heart rate. Furthermore, any studies incorporating wearable technology were excluded, since it contradicts our primary goal of contactless vital sign monitoring. The remaining records underwent further screening to identify articles pertinent to our research inquiries. Moreover, assessing metrics of accuracy, F1-score, specificity, sensitivity, and area under curve (AUC) for machine learning or deep learning models was essential for our review. Consequently, we discarded studies that failed to report on model performance.

Final selection outcome and visuals

Overall, this phase concluded with the identification of 45 trustworthy research articles from esteemed journals that align with our established research questions.

  • Figure S2 displays the detailed steps involved in this process and the breakdown of methodology, including search term design, screening criteria, inclusion/exclusion filters, and thematic classification for research conducted across global datasets.

  • Figure S3 presents a flowchart based on PRISMA guidelines illustrating identification, screening, eligibility, and inclusion of 45 peer-reviewed studies related to AI-based Wi-Fi signal monitoring systems in healthcare.

  • Table S4 reveals the origin of these articles.

  • Figure S4 presents the distribution of 45 eligible studies related to CSI-based health monitoring using Wi-Fi signals, highlighting the increasing trend in publications post-2020.

  • Figure S5 shows the distribution of these articles among all the researched databases. The data indicate that scientific research on Wi-Fi signal-based vital signs health monitoring is uncommon and has only recently seen significant growth.

Abbreviations

The nomenclature used in this article is listed in Table 10.

Table 10:
Nomenclature used in this article.
Acronym Term Definition
1. ANN Artificial Neural Network A computing system that constitutes animal brains.
2. BiLSTM Bidirectional Long Short-Term Memory A type of recurrent neural network architecture that processes data in both forward and backward directions.
3. CNN Convolutional Neural Network A deep learning algorithm designed to process structured grid data like images.
4. CSI Channel State Information Refers to the estimation of channel properties such as amplitude and phase in a wireless communication medium.
5. DACM Direction-based Adaptive Combination Method An algorithm for combining multiple signals based on their directions.
6. DBSCAN Density-Based Spatial Clustering of Applications with Noise An unsupervised learning method used for model construction; utilized for separating high-density clusters from low-density clusters.
7. DWT Discrete Wavelet Transform A signal processing technique that decomposes a signal into its wavelet components.
8. ECG Electrocardiogram A test that records the electrical activity of the heart over a period.
9. ENN Ensemble Neural Network A learning paradigm where multiple neural networks are combined to improve performance.
10. EWMA Exponentially Weighted Moving Average chart A type of control chart used to monitor attributes and describe a time series.
11. FFT Fast Fourier Transformation An efficient algorithm to compute the Discrete Fourier Transform (DFT) and its inverse, transforming a signal into the frequency domain.
12. FMCW Frequency Modulated Continuous Wave A radar technique using frequency modulation of a continuous wave signal to measure distance and velocity via frequency shift.
13. K-NN K-Nearest Neighbor A classification algorithm that classifies data points according to the k nearest neighbors.
14. MAE Mean Absolute Error A measure of errors between paired observations.
15. NLoS Non-Line of Sight Refers to a signal path between transmitter and receiver being obstructed by physical objects.
16. PCA Principal Component Analysis A dimensionality reduction method used to simplify large datasets.
17. PSD Power Spectral Density A measure of a signal’s power content against frequency, used to describe broadband random signals.
18. RSS Received Signal Strength The power level of a radio signal as received by a device, used to assess wireless connection quality.
19. SNR Signal-to-Noise Ratio A measure of signal strength relative to background noise.
20. SVM Support Vector Machine A classification algorithm that finds a hyperplane with the maximum margin to separate data points of different classes.
DOI: 10.7717/peerj-cs.3375/table-10

Supplemental Information

Literature search strategy flow.

DOI: 10.7717/peerj-cs.3375/supp-1

Detailed steps in the systematic review process.

DOI: 10.7717/peerj-cs.3375/supp-2

PRISMA-Based article selection workflow.

DOI: 10.7717/peerj-cs.3375/supp-3

Temporal Distribution of selected studies (2019–2024).

DOI: 10.7717/peerj-cs.3375/supp-4

Database-wise distribution of final studies.

DOI: 10.7717/peerj-cs.3375/supp-5

Considered Databases.

DOI: 10.7717/peerj-cs.3375/supp-7

The Inclusion-Exclusion Criteria for the Literature Selection.

DOI: 10.7717/peerj-cs.3375/supp-8

Summary of Search Results Across Digital Databases.

DOI: 10.7717/peerj-cs.3375/supp-9