Update of machine learning for ultrasound diagnosis of metabolic dysfunction-associated steatotic liver disease: a bright future for deep learning

Jiawen Li; Jianhui Chen; Xiaohong Zeng; Guorong Lyu; Shu Lin; Shaozheng He

doi:10.7717/peerj.19645

Update of machine learning for ultrasound diagnosis of metabolic dysfunction-associated steatotic liver disease: a bright future for deep learning

Jiawen Li¹, Jianhui Chen¹, Xiaohong Zeng², Guorong Lyu¹, Shu Lin ^3,4, Shaozheng He ¹

1Department of Ultrasound, the Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian Province, China

2Department of Health Care, the Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian Province, China

3Centre of Neurological and Metabolic Research, the Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian Province, China

4Group of Neuroendocrinology, Garvan Institute of Medical Research, Sydney, New South Wales, Australia

DOI: 10.7717/peerj.19645

Published: 2025-06-30
Accepted: 2025-06-02
Received: 2024-12-16

Academic Editor: Daniel van den Hoek

Subject Areas: Bioengineering, Gastroenterology and Hepatology, Internal Medicine, Nutrition, Obesity
Keywords: Non-alcoholic fatty liver disease, Metabolic dysfunction-associated steatotic liver disease, Ultrasound, Artificial intelligence, Machine learning, Deep learning

Copyright: © 2025 Li et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Li J, Chen J, Zeng X, Lyu G, Lin S, He S. 2025. Update of machine learning for ultrasound diagnosis of metabolic dysfunction-associated steatotic liver disease: a bright future for deep learning. PeerJ 13:e19645 https://doi.org/10.7717/peerj.19645

The authors have chosen to make the review history of this article public.

Abstract

Metabolic dysfunction-associated steatotic liver disease (MASLD) is the most common liver disease and the burden is increasing around the world. Ultrasound diagnosis of MASLD is the preferred method due to its convenience, absence of radiation, and high accuracy. The application of artificial intelligence (AI) in MASLD diagnosis has been explored across electronic medical records, laboratory tests, ultrasound and radiographic imaging, and liver histopathological data. Notably, AI’s application in ultrasound diagnosis has garnered significant attention. Deep learning (DL), known for its exceptional image recognition and classification capabilities, has become a focal point in ultrasound research. This paper reviews and analyzes the application of various machine learning (ML) algorithms in the ultrasound diagnosis of MASLD, highlighting the advantages and potential of AI in this field. It is intended for clinicians, AI researchers, and healthcare innovators, aiming to enhance diagnostic accuracy, expand MASLD screening in primary care, and support early diagnosis, prevention, and treatment.

Introduction

Non-alcoholic fatty liver disease (NAFLD) is the most prevalent liver disease worldwide, with a global prevalence of 38% (Wong et al., 2023). NAFLD is characterized by the accumulation of fat in more than 5% of hepatocytes, in association with metabolic risk factors, particularly obesity and type 2 diabetes, and the absence of excessive alcohol consumption (≥30 g/day in men and ≥20 g/day in women) or other chronic liver diseases. The condition encompasses a spectrum of liver disorders, ranging from simple steatosis, which may or may not involve mild inflammation, to non-alcoholic steatohepatitis (NASH) (Powell, Wong & Rinella, 2021). It is increasingly recognized that NAFLD is a multisystem disease, where insulin resistance and associated metabolic dysfunction play a key role in its development and in the progression of liver-related morbidities, including cirrhosis, liver failure, and hepatocellular carcinoma (HCC), as well as extrahepatic complications such as cardiovascular disease (CVD), type 2 diabetes mellitus (T2DM), chronic kidney disease (CKD), and certain types of extrahepatic cancers (Byrne & Targher, 2015; Devarbhavi et al., 2023). Although the progression of NAFLD can lead to cirrhosis, liver failure, and HCC, most deaths in NAFLD patients are due to CVD (Fig. 1). In 2023, three major multinational liver associations recommended replacing the term NAFLD with metabolic dysfunction-associated steatotic liver disease (MASLD), and the term metabolic dysfunction-associated steatohepatitis (MASH) was proposed to replace NASH. The diagnostic criteria for MASLD involve the presence of hepatic steatosis (identified through imaging or biopsy) alongside at least one cardiometabolic risk factor (CMRF), with no other identifiable causes of liver steatosis (Rinella et al., 2023a). Emerging evidence suggests a near-perfect concordance between the definitions of NAFLD and MASLD—approximately 99% of individuals with NAFLD meet MASLD criteria (Targher, Byrne & Tilg, 2024).

Figure 1: Evolution of NAFLD.

Download full-size image

DOI: 10.7717/peerj.19645/fig-1

Auxiliary screening methods for MASLD mainly include ultrasonography, magnetic resonance imaging (MRI), computed tomography (CT), laboratory tests, and pathological tests. Ultrasound technology is the first choice for imaging examination of MASLD because it is real-time, inexpensive, non-invasive, and non-radioactive. Accurate ultrasound diagnosis of MASLD requires an experienced physician, as it has a subjective component and can be affected by instrument noise and speckle. Ultrasound-based AI has already been used for various applications, such as thyroid, breast, and liver-related diseases. Currently, AI for ultrasound in MASLD is a research hotspot, which has evolved from quantitative ultrasound (QUS) to machine learning (ML) to deep learning (DL), with increasing diagnostic accuracy. In this review, we will discuss the application of ultrasound-based machine learning in the diagnosis of MASLD, focusing on the research and prospects of deep learning, with a view to its wide clinical application.

Survey Methodology

To conduct an unbiased review of machine learning applications in ultrasound diagnosis of metabolic dysfunction-associated steatotic liver disease (MASLD), the following approach was used:

Search engines and databases: PubMed, Web of Science, Scopus.

Search terms: (“machine learning” OR “deep learning”) AND (“ultrasound” OR “ultrasonography”) AND (“NAFLD” OR “MASLD”), including relevant synonyms, related terms, and MeSH terms.

Inclusion criteria: Peer-reviewed original English articles from 2010 to 2024, focusing on machine learning in ultrasound for MASLD diagnosis or evaluation, with quantitative diagnostic data (e.g., accuracy, sensitivity, specificity).

Exclusion criteria: Review articles, editorials, non-original research, studies using non-ultrasound imaging modalities (e.g., MRI, CT), or not validating machine learning models for MASLD.

Potential bias: Reliance on indexed databases may exclude non-English or non-indexed studies. Publication and reviewer bias may affect the selection of studies.

Screening process: Two independent reviewers screened titles and abstracts, and full-text articles were assessed based on the criteria. Disagreements were resolved through discussion, with a third reviewer consulted if needed.

Contributions: This review differs from existing literature by providing a comprehensive comparison of various machine learning models applied in the ultrasound diagnosis of metabolic MASLD, with a particular focus on the advancements in deep learning technologies. This review covers a wide range of models, from classical machine learning techniques to advanced deep learning architectures, highlighting their respective strengths and challenges. Additionally, the review offers insights into the future clinical applications of deep learning in MASLD diagnosis and outlines key areas for future research.

Metabolic dysfunction-associated steatotic liver disease

Although MASLD can be diagnosed by imaging tests such as ultrasound, CT, or MRI, diagnosing MASH still requires a liver biopsy to identify features such as inflammation, hepatocyte ballooning, Mallory-Denk bodies, and early fibrosis (Neuschwander-Tetri, 2017). Furthermore, early identification, prevention, and treatment of MASLD can significantly reduce its serious consequences.

Conventional ultrasound is the first choice for diagnosing fatty liver disease. This non-invasive technique is frequently used to screen for MASLD and is recommended by the European Guidelines for MASLD Management as the first-line imaging method for patients at risk (Miele et al., 2020). The typical ultrasound features of MASLD include liver echogenicity higher than the right kidney, with varying degrees of distal attenuation and intrahepatic vascular blurring. Based on these characteristics, the degree of steatosis can be subjectively classified as mild, moderate, or severe. Traditional B-mode ultrasonography has high diagnostic accuracy for moderate and severe hepatic steatosis (≥20% hepatic fat) but low diagnostic accuracy for mild steatosis (<20% hepatic fat) (Dasarathy et al., 2009; Lee et al., 2010) and does not reliably detect individuals with a high body mass index (BMI) (>40 kg/m²). Ultrasound is inexpensive, versatile, non-invasive, and non-radioactive. Current European practice guidelines recommend identifying hepatic steatosis by ultrasound, as it is more widespread and cheaper than the gold standard and MRI (European Association for the Study of the Liver (EASL), 2016).

Quantitative ultrasound in diagnosis of MASLD

Advances in ultrasound technology have significantly enhanced the diagnosis of MASLD, including traditional B-mode ultrasound, Doppler, elastography, and quantitative ultrasound (QUS). QUS analyzes raw, unprocessed radiofrequency (RF) data returned from tissue by inferring the mechanical properties of the tissue through the interaction of the ultrasound beam with the tissue and applying appropriate modeling (Tamaki, Ajmera & Loomba, 2021). QUS can model various physical properties, including sound backscattering, sound attenuation, and speed of sound, to produce objective, absolute indicators for quantifying hepatic steatosis.

Hepatorenal index

In conventional ultrasound, the diagnosis of fatty liver is typically made by comparing the echoes of the liver and the right kidney on the same image. However, this method is affected by various factors including gain, depth, power, and patient anatomy. The hepatorenal index (HRI), a semi-quantitative biomarker of steatosis, improves upon this method by dividing the signal intensity of the liver by that of the renal cortex on the same ultrasound image (Pirmoazen et al., 2022). Higher HRI values indicate increased liver echogenicity, corresponding to higher steatosis levels. Johnson et al. (2021) demonstrated that an HRI of ≥1.4 is associated with a positive predictive value of over 95% for ≥10% steatosis. However, no corresponding guidelines currently exist. HRI is usually obtained by manually setting the region of interest (ROI) to avoid vascular and focal lesions. However, it cannot be used in the presence of ectopic kidneys or the absence of the right kidney.

Attenuation coefficient

The attenuation of sound waves in fatty liver tissue differs from that in normal liver parenchyma. The attenuation coefficient (AC) quantifies the rate of energy loss of sound waves as they propagate through tissue, which depends on the wave’s frequency and the tissue characteristics (Ozturk et al., 2023). Several algorithms are available for estimating AC, including the controlled attenuation parameter (CAP), which is the most widely studied ultrasound technique for quantifying fatty liver. CAP is measured in dB/m, with higher values indicating more severe liver steatosis. CAP is particularly sensitive in detecting fatty livers with more than 10% steatosis (Pirmoazen et al., 2020). Karlas et al. (2017) demonstrated that the diagnostic performance of CAP was reflected by an AUC value ranging from 0.823 to 0.882. However, CAP measurements are obtained without visualizing the liver, meaning they may be influenced by masses, vessels, or uneven steatosis, which can affect the accuracy of the results. Its accuracy is also diminished in the presence of obesity, ascites, and advanced fibrosis. Therefore, CAP should be combined with other ultrasound techniques to improve diagnostic reliability. In response to these limitations, researchers have developed methods for measuring liver fat content in real time on conventional ultrasound images, with the ability to accurately localize the region of interest. These methods, such as ultrasound-guided attenuation parameters (UGAP), attenuation coefficients (ATT), and attenuation imaging (ATI), utilize similar principles of attenuation measurement. For example, ATI has demonstrated greater accuracy than CAP, with an AUC ranging from 0.79 to 0.97 (Zeng et al., 2023). Despite their promising performance, UGAP, ATT, and ATI techniques are limited by factors such as operator dependence, image quality, and a lack of standardization, which can affect diagnostic accuracy.

Backscattering coefficient

The backscattering coefficient (BSC) is a quantitative value that reflects the scattering of an ultrasound pulse back to the echo probe (Wear et al., 2022). The number of scattered backward-facing ultrasound pulses increases with higher levels of liver fat because fat vesicles in liver cells enhance the scattered ultrasound signal. Lin et al. (2015) demonstrates that the backscatter coefficient (BSC) shows strong diagnostic performance in identifying hepatic steatosis, with an AUC of 0.98 for diagnosing steatosis defined by MRI-PDFF ≥5%. However, its sensitivity and specificity may vary depending on factors like BMI, and it requires specialized equipment and software for analysis.

Speed of sound

Speed of sound (SoS) can be used to characterize tissue properties based on changes in ultrasonic echo velocity in various media (Ferraioli & Monteiro, 2019). The speed of sound decreases with increasing liver fat content. Dioguardi Burgio et al. (2018) demonstrated that the novel ultrasound-based SoS provides high diagnostic performance for detecting and grading hepatic steatosis, with an AUC of 0.882 for detecting any grade of steatosis and 0.989 for moderate to severe steatosis. However, SoS is susceptible to confounding factors such as inflammation, parenchymal edema, and temperature changes.

Speckle statistics

Speckle statistics, also known as ultrasound envelope statistical parametric imaging (Park et al., 2022), analyze the speckle pattern caused by the scattering of ultrasound signals from tissue microstructures, which appears in the ultrasound image. Speckle statistics are based on the parameterization of ultrasound speckle patterns using an established statistical model that describes the scattering properties of the tissue. Acoustic structure quantification (ASQ) was introduced as a more advanced approach that builds upon the principles of speckle statistics. ASQ improves upon traditional methods by directly analyzing the backscattered ultrasound signals, offering more precise and reliable quantification of tissue composition. Lin et al. (2019) indicate that, among different scanning planes, the intercostal approach using ASQ achieves the highest AUC value of 0.92. ASQ improves precision over traditional speckle-based methods, but it may be less accessible and more dependent on equipment and scanning conditions compared to CAP.

Shear wave elastography indicators

Shear wave elastography (SWE) is an imaging technology that generates shear waves in tissues by emitting acoustic radiation pulses and converts the propagation speed of these waves into corresponding tissue hardness (Ozturk et al., 2021). Real-time, two-dimensional shear wave elastography images can be obtained through SWE, and the value of Young’s modulus (measured in kPa) of the liver can be quantitatively detected, reflecting the absolute hardness of the liver. The higher the SWE value, the harder the tissue (Taru et al., 2023). The modulus of elasticity or stiffness of diseased tissues often differs from that of normal tissues, allowing SWE to distinguish between normal and abnormal tissues. SWE is primarily used to assess liver fibrosis, an important component of MASLD and other liver diseases. It is useful in identifying patients with advanced fibrosis or cirrhosis (Castera, Friedrich-Rust & Loomba, 2019). However, for assessing steatosis, SWE showed poor correlation with steatosis grades and did not distinguish between different steatosis grades effectively (Wang et al., 2025). On the other hand, SWE could be used alongside other diagnostic tools like CAP to provide a comprehensive assessment of liver health.

Multimodal ultrasound improves the accuracy of MASLD diagnosis

The development of quantitative ultrasound holds significant promise for liver disease diagnosis, yet the establishment of standardized values remains a critical challenge, influenced by various patient and technical factors. For example, Shi et al. (2019) incorporated shear wave attenuation, shear wave absorption, elasticity, dispersion slope, and echo attenuation, achieving an impressive AUC of 0.93. While this result suggests a high degree of accuracy, the study’s reliance on a complex model may limit its clinical applicability. Similarly, Labyed & Milkowski (2020) developed the ultrasound-derived fat fraction (UDFF) by combining attenuation and backscattering coefficients, achieving an AUC range of 0.83 to 0.94 for liver fat quantification. Dillman et al. (2022) supported these findings and demonstrated similar results. While UDFF holds significant promise as a reliable, non-invasive tool for liver fat quantification, challenges related to performance variability and the need for methodological standardization must be addressed.

Moreover, quantitative ultrasound techniques have increasingly been integrated with clinical data to improve diagnostic outcomes. Notably, Newsome et al. (2020) introduced the FibroScan-AST (FAST) score, a composite score that combines vibration-controlled transient elastography (VCTE), CAP, and aspartate aminotransferase (AST), which demonstrated the best predictive properties for MASH and advanced fibrosis. In external validation, the FAST score achieved AUCs between 0.74 and 0.95, with sensitivity potentially compromised at lower AUC thresholds. More recently, the American Association for the Study of Liver Diseases (AASLD) (Rinella et al., 2023b) recommended a stepwise approach, beginning with the FIB-4 index score, followed by liver stiffness measurement (LSM) via VCTE, as the initial method for identifying high-risk MASH. While this recommendation aligns with current evidence, it highlights the challenge of developing a universally applicable screening protocol, as LSM’s effectiveness varies with liver condition and technology.

Quantitative ultrasound shows strong potential for diagnosing MASLD, but challenges remain in achieving consistency due to variability in equipment, operator skill, and patient factors. While techniques like UDFF and the FAST score show high accuracy, their performance varies, emphasizing the need for better standardization. Combining ultrasound with clinical data is promising, but creating universally applicable protocols remains challenging. Further research is needed to enhance these methods for wider clinical use.

AI in diagnosis of MASLD based on ultrasound (Table 1)

The diagnosis of MASLD using artificial intelligence (AI) is primarily achieved through machine learning (ML). ML can be further divided into supervised learning, unsupervised learning, and deep learning. Supervised learning involves creating predictive models based on input and output data, while unsupervised learning focuses on grouping and pattern recognition using only input data (De Bruijne, 2016). Deep learning, a subset of ML, utilizes neural networks to analyze large datasets (Dinani, Kowdley & Noureddin, 2021) (Fig. 2). Supervised learning techniques include linear regression, logistic regression, decision trees, K-nearest neighbors, support vector machines, random forests, naive Bayes classification, and gradient boosting, among others. Traditional ML methods can classify and diagnose conditions using input data, which can be either features or raw data. Features are quantifiable data variables derived from expert knowledge that accurately describe the data from the ROI. When raw data is used as input, the algorithm must identify the features autonomously. To compare the performance of different ML methods and the diagnoses made by diagnostic sonographers, several classification metrics are employed. Key indicators include accuracy, sensitivity, specificity, and the area under the curve (AUC) from receiver operating characteristic (ROC) analysis. Below are some major ML methods and related studies.

Table 1:

ML algorithm for diagnosis of MASLD patients based on ultrasound images.

Author	Years		Classification categories	Parameters in the model	ML algorithm	Results	Reference standard	Data preprocessing /augmentation
Ribeiro, Tato Marinho & Sanches (2014)	2014	74 patients	Normal, steatosis	Textural features	CAD (Bayes)	Acc = 0.933 Sen = 0946 Spec = 0.921	/	Radiofrequency image estimation; image decomposition; speckle and despeckle separation; normalization and standardization; feature set comparison
Hwang & Cho (2023)	2023	89 children	Normal, steatosis	ATI	Bayes classifier	Sen = 0.894 Spec = 1	/	Envelope estimation; speckle decomposition
Basavarajappa et al. (2021)	2021	21 rats	Normal liver and mild, severe NAFLD	Multiparametric ultrasound	SVM	Acc = 0.92	Pathology	Z-score normalization; principal component analysis
Nagy et al. (2015)	2015	228 patients	None, mild, moderate, and severe steatosis	CVL	SVM	AUC = 0.923 Sen = 0.813 Spec = 0.89	Biopsy	Intensity histogram analysis
Tang et al. (2018)	2018	60 rats	None, mild, moderate, and severe steatosis	Elastography +QUS	RF	AUC = 0.66 (mild) AUC = 0.84 (moderate) AUC = 0.87 (severe)	Pathology	Echo envelope extraction
Destrempes et al. (2022)	2022	82 patients	None, mild, moderate, and severe steatosis	Elastography +QUS	RF	AUC = 0.90(mild) AUC = 0.81(moderate) AUC = 0.78 (severe)	Biopsy	Echo envelope extraction; compensation for time gain compensation; sliding window technique; Winsorization
Mihai Mihailescu (2013)	2013	120 patients	Stage of steatosis	Minimum and maximum attenuation, median gray levels	RF	Acc = 0.908	/	Robust brightness estimation
Acharya et al. (2012)	2012	100 images	Normal, fatty liver	Textural features	CAD (DT)	Acc = 0.933	/	Image standardization
Subramanya et al. (2014)	2014	53 images	Normal, fatty liver	Texture features	CAD (SVM)	Acc = 0.849	/	Feature combination; feature selection
Saba et al. (2016)	2016	62 patients	Normal, fatty liver	Texture features	CAD (BPN)	Acc = 0.976 Sen = 0981 Spec = 0.972	/	Standardization; feature combination and Scaling
Acharya et al. (2016)	2016	150 images	Normal, fatty liver and cirrhosis	Texture features	CAD (PNN)	Acc = 0.973 Sen = 1 Spec = 0.960	/	Morphological processing; image resizing and contrast enhancement; curvelet transform; feature reduction
Kuppili et al. (2017)	2017	63 patients	Normal, fatty liver	Texture features	CAD (ELM)	Acc = 0.968 AUC = 0.97	Biopsy	Standardization; data subsampling;
Biswas et al. (2018)	2018	63 patients	Normal, fatty liver	Features	CNN	Acc = 0.92	Biopsy	Image optimization; image cropping and border stripping

DOI: 10.7717/peerj.19645/table-1

Notes:

Sen: Sensitivity
Spec: Specificity
Acc: Accuracy
AUC: Area Under Curve

Figure 2: Deep learning.

Download full-size image

DOI: 10.7717/peerj.19645/fig-2

Bayes classifier

The Bayes classifier is a probabilistic method widely used in medical diagnostics due to its ability to handle uncertainty and small datasets. In Ribeiro, Tato Marinho & Sanches (2014), a Bayes classifier was used to detect hepatic steatosis from ultrasound images, achieving 93.33% accuracy, with 94.59% sensitivity and 92.11% specificity. Similarly, Hwang & Cho (2023) applied a Bayes latent model to identify MASLD predictors, finding that ultrasound attenuation imaging (ATI) was the most effective for predicting hepatic steatosis, with an AUC of 0.923, a sensitivity of 90.2% and specificity of 85.4%. Both studies demonstrate the Bayes classifier’s potential in accurately diagnosing MASLD and hepatic steatosis non-invasively. However, the effectiveness of Bayes classifiers may be limited when there is a strong correlation or high dimensionality among features.

Support vector machine

Support vector machine (SVM) is a type of linear classifier used for supervised learning, where it separates data into different classes by finding the optimal hyperplane. Basavarajappa et al. (2021) applied SVM to six ultrasound imaging measurements, including H-mode ultrasound, and found it achieved the highest accuracy with H-mode data. Similarly, Nagy et al. (2015) used SVM to classify hepatic steatosis in 228 patients, showing that the coefficient of variation of luminance was most effective for distinguishing mild and moderate-to-severe steatosis. In summary, SVM excels in high-dimensional classification tasks but requires careful feature selection and parameter tuning to avoid overfitting or underfitting, and can be computationally intensive with large datasets. Despite these challenges, SVM remains a powerful tool in the diagnosis of MASLD due to its robustness and ability to handle complex data.

Random Forest

Random Forest (RF) is a classifier that uses multiple decision trees to improve accuracy and reduce overfitting by averaging the results from several trees. Tang et al. (2018) used RF to classify liver conditions in rats, finding that combining QUS and shear-wave elastography improved accuracy over elastography alone, though human data are still needed to confirm these results. Destrempes et al. (2022) demonstrated this view on human data. Mihai Mihailescu (2013) compared RF and SVM in assessing MASLD severity, demonstrating that RF performed better in terms of accuracy without the need for feature selection. In conclusion, RF is effective in improving classification accuracy, especially when combining multiple features. Its main advantages are high accuracy and resistance to overfitting. However, it can be computationally expensive and may require careful tuning of hyperparameters for optimal results.

Deep learning

Deep learning (DL) is a type of machine learning (ML) that builds on the development of artificial neural networks. It outperforms traditional ML, brings ML closer to Artificial Intelligence, and holds the promise of being far superior to previous techniques (Table 2). DL is not affected by image variations, does not ROIs, is not limited to expert-defined features, and can be trained on a large number of images. The algorithm uses multilayer neural networks for detection, classification, and segmentation of biomedical images. Convolutional neural networks (CNNs) are a special type of neural network in deep learning, primarily consisting of convolutional layers, pooling layers, and fully connected layers (Hosny et al., 2018) (Fig. 3). They are mainly used to process data with a grid structure, extract feature maps, and perform feature aggregation. Currently, CNNs are the most popular type of architecture in medical imaging.

Table 2:

DL algorithm for diagnosis of MASLD patients based on ultrasound images.

Author	Years	Sample size	Classification categories	DL algorithm	Results	Reference standard	Data preprocessing/ augmentation	Pros and cons
Cao et al. (2019)	2019	240 participants	Normal liver and mild, moderate, and severe NAFLD	CNN	AUC = 0.958	/	Image resizing; envelope signal quantification; data expansion	High diagnostic accuracy, especially for severe NAFLD cases. Struggles with mild/moderate distinction.
Yang et al. (2023)	2023	928 participants (1,856 images)	None, mild, moderate, and severe steatosis	2S-NNet	AUC≥0.90 (mild) AUC ≥ 0.85 (moderate) AUC≥0.85 (severe)	/	Image stitching	A novel two-section deep learning model. Offers a solution for large-scale population screening.
Cha et al. (2021)	2021	294 participants	Normal and mild fatty liver	DCNN	ICC = 0.734	/	Image resizing	A novel automated HRI quantification method. Unvalidated generalizability to severe disease.
Zsombor et al. (2023)	2023	102 patients	None, mild, moderate, and severe steatosis	CNN	AUC = 0.758 (mild) AUC = 0.803 (moderate/severe)	MRI-PDFF	Data normalization; various image transformations such as rotations and flips	Easy to implement in clinical practice. Lacks histology validation for comparison.
Nguyen et al. (2021)	2021	60 rabbits	Normal, steatosis	CNN	Acc = 0. 738	Pathology	Removal of noisy data and outliers; resizing; normalization; regularization techniques	Simplifying clinical workflow. Accuracy affected by liver fibrosis. Small dataset leading to variability in test results.
Han et al. (2020)	2020	204 participants	NAFLD, no NAFLD	1D-CNN	Acc = 0. 96 AUC = 0.958	MRI-PDFF	remove noise and artifacts; normalization; random transformations (e.g., scaling, rotation)	Robust to system setting changes. Potential saturation effect for high fat fractions. Limited generalizability due to single platform and operator
Sanabria et al. (2022)	2022	31 patients	None, mild, moderate, and severe steatosis	2D-CNN, 3D-CNN	AUC≥0.90	MRI-PDFF	Logarithmic compression; denoising; normalization; extracting multiple patches	Uses raw data for enhanced diagnostics. Limited by available ultrasound machines and small patient sample.
Jeon et al. (2023)	2023	173 participants	Normal liver and mild, moderate, and severe NAFLD	2D-CNN	AUC = 0.97	MRI-PDFF	Envelope extraction; logarithmic compression; noise reduction; normalization; extracting multiple patches	QUS parametric maps and B-mode images for diagnosing. Limited by single-center data, saturation in severe steatosis.
Vianna et al. (2023)	2023	199 patients	None, mild, moderate, and severe steatosis	VGG16	AUC = 0.98 (mild) AUC = 0.67 (moderate) AUC = 0.66 (severe)	Biopsy	Image cropping; image resizing; standardization	Outperformed most radiologists in detecting steatosis. Limited generalizability due to single-center study and no data augmentation.
Liu et al. (2024)	2024	710 participants	None, mild, moderate to severe steatosis	VGG16	AUC = 0.85 (mild) AUC = 0.95 (moderate to severe)	/	Image resizing; normalization; random transformations (e.g., scaling, rotation)	The new multi-input model showed significant improvement.
Che et al. (2021)	2021	55 patients	Normal, fatty liver	2D-ResNet	AUC = 0.978	Biopsy	Image cropping; image resizing; random transformations (e.g., scaling, rotation)	Utilizes advanced feature fusion techniques and multi-scale analysis. Limited by dataset size. Dependence on quality data.
Chou et al. (2021)	2021	2070 patients (21855 images)	None, mild, moderate, and severe steatosis	ResNet-50 v2	AUC = 0.974 (mild) AUC = 0.971 (moderate) AUC = 0.981 (severe)	/	Image cropping; image resizing; normalization; random transformations (e.g., scaling, rotation)	A large dataset for better accuracy. Limited by variations in image quality, motion artifacts, and regional bias from a single hospital dataset.
Zamanian et al. (2021)	2021	55 patients	Normal, fatty liver	ResNet101+ SVM	Acc 0.986 AUC = 0.9998	Biopsy	Image resizing; color modifications; random transformations (e.g., scaling, rotation)	Enhances model generalization and reduces overfitting for improved accuracy. Relies heavily on data augmentation
Hardy et al. (2023)	2023	55 patients	Normal, fatty liver	ResNet-50	AUC = 0.904	Biopsy	Center cropping; linear scaling; random resized crop	Enhances classification accuracy with synthetic data. Limited by small dataset size.
Constantinescu et al. (2020)	2020	60 patients	Normal, steatosis	Inception-v3	Acc = 0.932 AUC = 0.93	/	Image cropping; image rescaling; random transformations (e.g., scaling, rotation); dropout, activity regularization, kernel regularization	Reduces the need for large datasets by using fine-tuning. Limited by small dataset size and no comparison with expert diagnoses.
Santhosh Reddy, Bharath & Rajalakshmi (2018)	2018	157 images	Normal, fatty liver	VGG-16	Acc = 0.96	/	Image cropping; image resizing; random transformations (e.g., scaling, rotation)	Improves diagnosis efficiency with limited data. Limited by small dataset size and no comparison with expert diagnoses.
Byra et al. (2021)	2021	135 participants	Normal, fatty liver	ResNet-50	AUC 0.91	MRI-PDFF	Image cropping; image resizing; image shiftin; horizontal flipping	Achieves high diagnostic performance with multiple liver views. Limited by a small dataset and potential bias.
Li et al. (2022)	2022	3,310 patients	None, mild, moderate, and severe steatosis	ResNet18	AUC = 0.85 (mild) AUC = 0.91 (moderate) AUC = 0.93 (severe)	Biopsy	Automatic cropping; viewpoint filtering; multi-scanner inclusion; multi-view Inclusion	Generalizable across multiple scanners and viewpoint. The model may be sensitive to the imaging protocol and dataset biases.

DOI: 10.7717/peerj.19645/table-2

Notes:

AUC: Area Under Curve
Acc: accuracy
ICC: Intraclass Correlation Coefficient

Figure 3: Convolutional neural network.

Download full-size image

DOI: 10.7717/peerj.19645/fig-3

Deep learning based on feature acquisition.

DL has shown promise in extracting features from ultrasound images to aid in diagnosing MASLD. Cao et al. (2019) analyzed liver images from 240 patients using envelope signals, gray signals, and a DL index derived from three image processing technologies. The DL index outperformed the envelope and gray values in diagnostic performance. Similarly, Yang et al. (2023) developed and validated a DL system using ultrasound images from 928 subjects. Their system, which employed a two-segment neural network (2S-NNet), classified the severity of hepatic steatosis based on three diagnostic features: bright liver, blurred intrahepatic catheter, and impaired visibility of more than half of the diaphragm. Both studies found high accuracy for diagnosing MASLD, though accuracy was lower for mild and moderate cases. However, the studies have limitations: the limited sample size of Cao et al. (2019) further restricts the broader applicability of their findings. Yang et al. (2023) lacked secondary validation (e.g., MRI or biopsy), and sonographer training was not standardized, which could affect the results. Therefore, while DL shows potential for diagnosing MASLD, further validation with standardized protocols is necessary to confirm its reliability.

Deep learning based on hepatorenal index.

Many DL studies of MASLD have utilized the HRI. Cha et al. (2021) analyzed 294 ultrasound images from living liver donors and found that an HRI automatically quantified using a deep convolutional neural network (DCNN) showed high consistency with measurements made by ultrasound diagnosticians. However, this study focused only on normal and mild fatty liver, excluding moderate and severe cases, which limits its broader applicability. In contrast, Zsombor et al. (2023), in a single-center prospective study, diagnosed mild and moderate steatohepatitis in 102 subjects using an AI-based liver and kidney index (AI-HRI) calculated by DCNN. Their results showed a higher AUC, higher sensitivity, and lower specificity compared to manually measured indices from previous studies. AI-HRI also outperformed grayscale ultrasound in detecting mild steatosis. Nevertheless, AI-HRI should not replace grayscale ultrasound, as a combination of both methods is necessary to enhance diagnostic accuracy. Furthermore, the sample size in this study was limited, and further validation in a multicenter setting is needed to confirm the findings.

Deep learning based on raw data.

The current study indicates that diagnosing MASLD using raw data may offer advantages over traditional quantitative ultrasound (QUS) indicators. Nguyen et al. (2021) developed a rabbit MASLD model and showed that a one-dimensional CNN processing raw radiofrequency signals achieved 74% accuracy, outperforming the 59% accuracy of a QUS-SVM model. However, as this study was animal-based, its findings require further validation in clinical settings. Han et al. (2020) reported a 96% classification accuracy using a one-dimensional CNN with raw ultrasound radiofrequency signals from 102 subjects. Additionally, studies by Sanabria et al. (2022) and Jeon et al. (2023) highlighted that two-dimensional CNNs outperform one-dimensional models for classifying hepatic steatosis using RF data, showing superior diagnostic performance over traditional methods. The combination of raw radiofrequency data and deep learning holds significant promise for improving MASLD diagnosis by providing more accurate liver fat quantification. However, its clinical applicability is still limited by challenges such as data standardization, validation, and the need for large-scale patient datasets.

Transfer learning in deep learning.

Deep learning has achieved great success in image recognition due to its ability to learn hierarchical features from large datasets. However, training deep models from scratch requires vast amounts of labeled data and significant computational resources. Transfer learning addresses this issue by leveraging pre-trained models, that have been trained on large, well-established datasets like ImageNet (Morid, Borjali & Del Fiol, 2021). These pre-trained models learn general features (e.g., edge detection, texture recognition) that can be reused for new, related tasks (Cheplygina, de Bruijne & Pluim, 2019). As a result, transfer learning allows deep learning models to perform effectively even on smaller, domain-specific datasets.

Pre-trained models like VGG-16 (Vianna et al., 2023; Liu et al., 2024), ResNet (Che et al., 2021; Chou et al., 2021; Zamanian et al., 2021; Hardy et al., 2023) and Inception (Constantinescu et al., 2020) have been applied to liver condition classification, improving diagnostic accuracy with less labeled data. For instance, Byra et al. (2018) demonstrated that a ResNet-v2 model, fine-tuned for liver ultrasound images, outperformed traditional methods, showing higher AUC scores in identifying fatty liver. Similarly, Santhosh Reddy, Bharath & Rajalakshmi (2018) utilized a VGG-16 pre-trained model and achieved 90.6% accuracy in classifying fatty liver from ultrasound images, showcasing the effectiveness of transfer learning in improving diagnostic performance in smaller datasets. Byra et al. (2021) and Li et al. (2022) both demonstrated that multi-view ultrasound enhances diagnostic accuracy, with Byra et al. (2021) finding the right posterior portal vein to be the most accurate, while Li et al. (2022) showed that combining multi-view data from 3,310 patients using ResNet-18 achieved performance comparable to CAP measurements and high consistency across ultrasound scanners.

In recent years, different teams have optimized datasets (e.g., expanding dataset size, acquiring ultrasound images with different instruments) and compared various pre-trained models to identify the most suitable model, but no unified conclusion has been reached. Despite its success, transfer learning is not without limitations. One challenge is the domain gap, where the source data (such as general images used to train pre-trained models) may differ significantly from the target data (e.g., medical images). This discrepancy can result in suboptimal performance if not properly addressed, as the model might struggle to generalize from one domain to another (Hosseinzadeh Taher et al., 2025). Additionally, fine-tuning is needed to avoid overfitting with smaller datasets (Xie et al., 2021). The effectiveness of pre-trained models depends on their alignment with the specific task, requiring adjustments for optimal performance in medical imaging (Atasever et al., 2023).

Despite these challenges, the increasing availability of pre-trained models and the advancement of transfer learning techniques continue to enhance diagnostic accuracy, solidifying transfer learning as a leading approach in medical image analysis. In conclusion, transfer learning is a transformative technique within deep learning. By reusing knowledge from pre-trained models, transfer learning significantly improves performance while reducing the need for large-scale training datasets. As the field evolves, integrating transfer learning with emerging technologies, such as synthetic data generation and multi-modal learning, promises to further enhance the accuracy and robustness of diagnostic systems.

Automated machine learning

Recently, a new method in machine learning, known as automated machine learning (AutoML), has been proposed. AutoML automates key aspects of the machine learning process, including data preprocessing, optimal algorithm selection, and hyperparameter tuning, significantly reducing the time required to build machine learning models (Wever et al., 2021). This automation allows for faster model development, making it more accessible to non-experts and streamlining the workflow for experienced practitioners. Tahmasebi et al. (2023) conducted a study involving 120 subjects, both with and without MASLD, using MRI-PDFF as the reference criterion. They employed AutoML Vision to develop a supervised machine learning model for assessing MASLD, achieving a sensitivity of 72.2% and a specificity of 94.6%. This study highlights AutoML’s potential in medical diagnostics, demonstrating its ability to build accurate models for complex tasks like assessing MASLD. However, despite these promising results, there is still a scarcity of relevant studies utilizing AutoML in this field, which suggests a need for further research to fully explore its capabilities and limitations.

Computer-aided design based on feature input

Computer-aided design (CAD) techniques have become increasingly integral in enhancing the diagnosis of non-alcoholic fatty liver disease (MASLD) through ultrasound imaging. The incorporation of ML in CAD systems has shown significant promise in analyzing liver ultrasound images with greater accuracy and efficiency than traditional methods. As a result, researchers have focused on developing the most effective CAD systems, exploring various classifiers to identify the best-performing models for diagnosing MASLD.

Early studies, such as those by Acharya et al. (2012), employed the Symtosis CAD method to extract features from liver ultrasound images, combining texture analysis, higher-order spectra, and wavelet transforms. This technique, paired with a decision tree (DT) classifier, accurately distinguished between normal and fatty liver images. Similarly, Subramanya et al. (2014) used a SVM-based CAD system to diagnose fatty liver, finding that Laws ratio features yielded the best classification results.

As technology advanced, CAD systems evolved, incorporating more sophisticated algorithms and a broader range of features. Saba et al. (2016) applied Symtosis to extract additional features, including Haralick, Gabor, and Fourier transforms, improving the system’s ability to capture subtle liver texture changes. Their study showed that a back propagation neural network (BPNN) classifier outperformed traditional DT classifiers. Acharya et al. (2016) introduced a curvelet-wave transform method with a probabilistic neural network (PNN), which showed high classification accuracy using only six features. These advancements marked a shift toward more efficient and robust models, reducing the complexity of the feature set required for accurate classification.

Further improvements were made in more recent studies. Kuppili et al. (2017) and Biswas et al. (2018) explored the combination of Symtosis with extreme learning machines (ELM) and CNNs, both of which led to faster and more accurate liver image classification. These models capitalized on the ability of CNNs to automatically learn and refine features from raw data, removing the need for extensive manual feature extraction.

While CAD systems using various classifiers have been developed for diagnosing hepatic steatosis, several challenges remain. Factors such as image quality, feature selection, and the complexity of classification algorithms can all impact the performance of these systems. The application of machine learning techniques like SVM, CNN, and PNN has significantly improved liver image classification accuracy. The future of ultrasonic CAD holds great promise, particularly in enhancing the early detection and management of MASLD, offering more reliable and efficient diagnostic tools.

Summary and Prospectives

The application of AI in MASLD has gained significant attention. Currently, ultrasound is the most widely used tool for MASLD screening, and AI-assisted ultrasound has shown considerable promise. A meta-analysis (Decharatanachart et al., 2021) demonstrated that AI-assisted ultrasound achieved a sensitivity of 0.97, specificity of 0.98, positive predictive value (PPV) of 0.98, negative predictive value (NPV) of 0.95, and area under the curve (AUC) of 0.98, outperforming AI-assisted clinical datasets. Additionally, neural networks have shown superior performance compared to non-neural network models. However, most studies to date rely on retrospective, single-center data, which introduces selection bias, limits generalizability, and results in variability in diagnostic tools and operational procedures.

AI offers key advantages over traditional statistical models, such as identifying complex patterns, integrating multiple factors, and creating predictive models. These models aid risk stratification, improve diagnostic accuracy, and enhance patient outcomes (Dinani, Kowdley & Noureddin, 2021). In MASLD diagnosis, AI utilizes diverse data sources, including electronic health records, laboratory tests, imaging, and liver histopathology data (Li et al., 2021). AI-assisted ultrasound improves diagnostic accuracy, reduces reliance on operator experience, and minimizes subjectivity. Additionally, AI can enhance diagnostic efficiency, lower costs, and reduce the burden on sonographers without replacing them. Its integration is expected to significantly impact primary care, telemedicine, clinical decision support systems, and early intervention for disease progression.

Despite its promising benefits, current research on AI in MASLD diagnosis faces several limitations. First, many studies rely on retrospective, single-center data, typically characterized by small sample sizes and limited representativeness, leading to potential selection bias and reduced generalizability. Variability in diagnostic tools and operational procedures across settings also contributes to inconsistencies in model performance. Furthermore, AI models are often overfitted to training data, performing well on the data they were trained on but poorly when applied to new, unseen data. The absence of external validation exacerbates this issue. To address these challenges, AI models must undergo robust validation in multi-center, prospective studies involving diverse patient populations to ensure their consistent performance across various settings.

Another significant challenge is the lack of interpretability, commonly referred to as the “black box” problem. Many advanced AI models, particularly deep learning systems, process data through complex algorithms that are difficult for human experts to interpret. While these models often yield accurate predictions, they do so without providing clear explanations of how decisions are made. This lack of transparency is particularly problematic in healthcare, where clinicians must understand and justify diagnostic decisions. As a result, clinicians may hesitate to rely on AI recommendations, especially when they cannot validate the underlying logic behind a diagnosis or treatment suggestion. Furthermore, this lack of interpretability raises significant legal and ethical concerns. If an AI model makes a diagnostic error that harms a patient, determining accountability—whether it lies with the developer, the clinician using the tool, or the institution implementing it—becomes difficult. Enhancing explainable AI, which aims to make AI models more transparent and understandable, is crucial for addressing these concerns and promoting the adoption of AI in clinical settings.

The ethical implications of AI in healthcare are multifaceted, raising concerns about data privacy, bias, and accountability. AI systems require large volumes of sensitive patient data to function effectively, raising significant concerns about how this data is collected, stored, and utilized. Ensuring compliance with data protection regulations is critical for safeguarding patient privacy. As AI models become increasingly integrated into clinical decision-making, secure handling of data is essential to maintain patient trust and ensure system integrity. One of the most pressing ethical concerns is the potential for AI models to inherit biases from the data on which they are trained. If training datasets are not diverse or representative of global populations, AI systems could perpetuate or even exacerbate existing healthcare disparities. To mitigate these biases, it is essential that AI training datasets reflect a broad range of ethnicities, ages, genders, and socioeconomic backgrounds. The integration of AI in clinical practice also raises complex questions about responsibility and accountability. In the event of diagnostic errors or adverse patient outcomes caused by AI-assisted tools, it may be unclear who should be held accountable. This lack of clarity could undermine trust in AI systems and complicate medical malpractice frameworks. These concerns underscore the need for clear ethical guidelines and frameworks to govern AI deployment in healthcare.

The future potential of AI in MASLD diagnosis is vast, particularly with the integration of multi-modal data. In addition to traditional ultrasound images, AI is expected to incorporate clinical data, laboratory test results, and multi-omics information to further enhance diagnostic accuracy. Future research should focus on multi-center, prospective studies to gather high-quality, representative data and address current limitations in data quality. Moreover, there is a need to develop more transparent and interpretable models to improve clinicians’ trust in AI. Beyond early screening for MASLD, AI-assisted ultrasound may expand to diagnose other liver diseases, monitor disease progression, and evaluate treatment efficacy. With continued advancements, AI has the potential to play a crucial role in primary care and telemedicine, providing accessible, cost-effective diagnostic tools that support early intervention and precision treatment for liver diseases globally.

[1] Acharya UR, Raghavendra U, Fujita H, Hagiwara Y, Koh JEW, Jen Hong T, Sudarshan VK, Vijayananthan A, Yeong CH, Gudigar A, Ng KH. 2016. Automated characterization of fatty liver disease and cirrhosis using curvelet transform and entropy features extracted from ultrasound images. Computers in Biology and Medicine 79:250-258

[2] Acharya UR, Sree SV, Ribeiro R, Krishnamurthi G, Marinho RT, Sanches J, Suri JS. 2012. Data mining framework for fatty liver disease classification in ultrasound: a hybrid feature extraction paradigm. Medical Physics 39:4255-4264

[3] Atasever S, Azginoglu NUH, Terzi DS, Terzi R. 2023. A comprehensive survey of deep learning research on medical image analysis with focus on transfer learning. Clinical Imaging 94:18-41

[4] Basavarajappa L, Baek J, Reddy S, Song J, Tai H, Rijal G, Parker KJ, Hoyt K. 2021. Multiparametric ultrasound imaging for the assessment of normal versus steatotic livers. Scientific Reports 11:2655

[5] Biswas M, Kuppili V, Edla DR, Suri HS, Saba L, Marinhoe RT, Sanches JM, Suri JS. 2018. Symtosis: a liver ultrasound tissue characterization and risk stratification in optimized deep learning paradigm. Computer Methods and Programs in Biomedicine 155:165-177

[6] Byra M, Han A, Boehringer AS, Zhang YN, O’Brien WD, Erdman JW, Loomba R, Sirlin CB, Andre M. 2021. Liver fat assessment in multiview sonography using transfer learning with convolutional neural networks. Journal of Ultrasound in Medicine 41:175-184

[7] Byra M, Styczynski G, Szmigielski C, Kalinowski P, Michałowski Ł, Paluszkiewicz R, Ziarkiewicz-Wróblewska B, Zieniewicz K, Sobieraj P, Nowicki A. 2018. Transfer learning with deep convolutional neural network for liver steatosis assessment in ultrasound images. International Journal of Computer Assisted Radiology and Surgery 13:1895-1903

[8] Byrne CD, Targher G. 2015. NAFLD: a multisystem disease. Journal of Hepatology 62:S47-S64

[9] Cao W, An X, Cong L, Lyu C, Zhou Q, Guo R. 2019. Application of deep learning in quantitative analysis of 2-dimensional ultrasound imaging of nonalcoholic fatty liver disease. Journal of Ultrasound in Medicine 39:51-59

[10] Castera L, Friedrich-Rust M, Loomba R. 2019. Noninvasive assessment of liver disease in patients with nonalcoholic fatty liver disease. Gastroenterology 156:1264-1281

[11] Cha DI, Kang TW, Min JH, Joo I, Sinn DH, Ha SY, Kim K, Lee G, Yi J. 2021. Deep learning-based automated quantification of the hepatorenal index for evaluation of fatty liver by ultrasonography. Ultrasonography 40:565-574

[12] Che H, Brown LG, Foran DJ, Nosher JL, Hacihaliloglu I. 2021. Liver disease classification from ultrasound using multi-scale CNN. International Journal of Computer Assisted Radiology and Surgery 16:1537-1548

[13] Cheplygina V, De Bruijne M, Pluim JPW. 2019. Not-so-supervised: a survey of semi-supervised, multi-instance, and transfer learning in medical image analysis. Medical Image Analysis 54:280-296

[14] Chou T-H, Yeh H-J, Chang C-C, Tang J-H, Kao W-Y, Su IC, Li C-H, Chang W-H, Huang C-K, Sufriyana H, Su EC-Y. 2021. Deep learning for abdominal ultrasound: a computer-aided diagnostic system for the severity of fatty liver. Journal of the Chinese Medical Association 84:842-850

[15] Constantinescu EC, Udriştoiu A-L, Udriştoiu ŞC, Iacob AV, Gruionu LG, Gruionu G, Săndulescu L, Săftoiu A. 2020. Transfer learning with pre-trained deep convolutional neural networks for the automatic assessment of liver steatosis in ultrasound images. Medical Ultrasonography 23:135-139

[16] Dasarathy S, Dasarathy J, Khiyami A, Joseph R, Lopez R, McCullough AJ. 2009. Validity of real time ultrasound in the diagnosis of hepatic steatosis: a prospective study. Journal of Hepatology 51:1061-1067

[17] De Bruijne M. 2016. Machine learning approaches in medical image analysis: from detection to diagnosis. Medical Image Analysis 33:94-97

[18] Decharatanachart P, Chaiteerakij R, Tiyarattanachai T, Treeprasertsuk S. 2021. Application of artificial intelligence in non-alcoholic fatty liver disease and liver fibrosis: a systematic review and meta-analysis. Therapeutic Advances in Gastroenterology 14:17562848211062807

[19] Destrempes F, Gesnik M, Chayer B, Roy-Cardinal M-H, Olivié D, Giard J-M, Sebastiani G, Nguyen BN, Cloutier G, Tang A. 2022. Quantitative ultrasound, elastography, and machine learning for assessment of steatosis, inflammation, and fibrosis in chronic liver disease. PLOS ONE 17:e0262291

[20] Devarbhavi H, Asrani SK, Arab JP, Nartey YA, Pose E, Kamath PS. 2023. Global burden of liver disease: 2023 update. Journal of Hepatology 79:516-537

[21] Dillman JR, Thapaliya S, Tkach JA, Trout AT. 2022. Quantification of hepatic steatosis by ultrasound: prospective comparison with MRI proton density fat fraction as reference standard. American Journal of Roentgenology 219:784-791

[22] Dinani AM, Kowdley KV, Noureddin M. 2021. Application of artificial intelligence for diagnosis and risk stratification in NAFLD and NASH: the state of the art. Hepatology 74:2233-2240

[23] Dioguardi Burgio M, Imbault M, Ronot M, Faccinetto A, Van Beers BE, Rautou P-E, Castera L, Gennisson J-L, Tanter M, Vilgrain V. 2018. Ultrasonic adaptive sound speed estimation for the diagnosis and quantification of hepatic steatosis: a pilot study. Ultraschall in Der Medizin—European Journal of Ultrasound 40:722-733

[24] European Association for the Study of the Liver (EASL). 2016. EASL–EASD–EASO clinical practice guidelines for the management of non-alcoholic fatty liver disease. Journal of Hepatology 64:1388-1402

[25] Ferraioli G, Monteiro LBS. 2019. Ultrasound-based techniques for the diagnosis of liver steatosis. World Journal of Gastroenterology 25:6053-6062

[26] Han A, Byra M, Heba E, Andre MP, Erdman JW, Loomba R, Sirlin CB, O’Brien WD. 2020. Noninvasive diagnosis of nonalcoholic fatty liver disease and quantification of liver fat with radiofrequency ultrasound data using one-dimensional convolutional neural networks. Radiology 295:342-350

[27] Hardy R, Klepich J, Mitchell R, Hall S, Villareal J, Ilin C. 2023. Improving nonalcoholic fatty liver disease classification performance with latent diffusion models. Scientific Reports 13:21619

[28] Hosny A, Parmar C, Quackenbush J, Schwartz LH, Aerts HJWL. 2018. Artificial intelligence in radiology. Nature Reviews Cancer 18:500-510

[29] Hosseinzadeh Taher MR, Haghighi F, Gotway MB, Liang J. 2025. Large-scale benchmarking and boosting transfer learning for medical image analysis. Medical Image Analysis 102:103487

[30] Hwang SM, Cho KY. 2023. Noninvasive assessment of paediatric hepatic steatosis by using attenuation imaging. European Radiology 33:8353-8365

[31] Jeon SK, Lee JM, Joo I, Yoon JH, Lee G. 2023. Two-dimensional convolutional neural network using quantitative US for noninvasive assessment of hepatic steatosis in NAFLD. Radiology 307:e221510

[32] Johnson SI, Fort D, Shortt KJ, Therapondos G, Galliano GE, Nguyen T, Bluth EI. 2021. Ultrasound stratification of hepatic steatosis using hepatorenal index. Diagnostics 11:1443

[33] Karlas T, Petroff D, Sasso M, Fan J-G, Mi Y-Q, De Lédinghen V, Kumar M, Lupsor-Platon M, Han K-H, Cardoso AC, Ferraioli G, Chan W-K, Wong VW-S, Myers RP, Chayama K, Friedrich-Rust M, Beaugrand M, Shen F, Hiriart J-B, Sarin SK, Badea R, Jung KS, Marcellin P, Filice C, Mahadeva S, Wong GL-H, Crotty P, Masaki K, Bojunga J, Bedossa P, Keim V, Wiegand J. 2017. Individual patient data meta-analysis of controlled attenuation parameter (CAP) technology for assessing steatosis. Journal of Hepatology 66:1022-1030

[34] Kuppili V, Biswas M, Sreekumar A, Suri HS, Saba L, Edla DR, Marinhoe RT, Sanches JM, Suri JS. 2017. Extreme learning machine framework for risk stratification of fatty liver disease using ultrasound tissue characterization. Journal of Medical Systems 41:152

[35] Labyed Y, Milkowski A. 2020. Novel method for ultrasound-derived fat fraction using an integrated phantom. Journal of Ultrasound in Medicine 39:2427-2438

[36] Lee SS, Park SH, Kim HJ, Kim SY, Kim MY, Kim DY, Suh DJ, Kim KM, Bae MH, Lee JY, Lee SG, Yu ES. 2010. Non-invasive assessment of hepatic steatosis: prospective comparison of the accuracy of imaging examinations. Journal of Hepatology 52:579-585

[37] Li B, Tai D-I, Yan K, Chen Y-C, Chen C-J, Huang S-F, Hsu T-H, Yu W-T, Xiao J, Le L, Harrison AP. 2022. Accurate and generalizable quantitative scoring of liver steatosis from ultrasound images via scalable deep learning. World Journal of Gastroenterology 28:2494-2508

[38] Li Y, Wang X, Zhang J, Zhang S, Jiao J. 2021. Applications of artificial intelligence (AI) in researches on non-alcoholic fatty liver disease (NAFLD): a systematic review. Reviews in Endocrine and Metabolic Disorders 23:387-400

[39] Lin SC, Heba E, Wolfson T, Ang B, Gamst A, Han A, Erdman JW, O’Brien WD, Andre MP, Sirlin CB, Loomba R. 2015. Noninvasive diagnosis of nonalcoholic fatty liver disease and quantification of liver fat using a new quantitative ultrasound technique. Clinical Gastroenterology and Hepatology 13:1337-1345

[40] Lin Y-H, Wan Y-L, Tai D-I, Tseng J-H, Wang C-Y, Tsai Y-W, Lin Y-R, Chang T-Y, Tsui P-H. 2019. Considerations of ultrasound scanning approaches in non-alcoholic fatty liver disease assessment through acoustic structure quantification. Ultrasound in Medicine & Biology 45:1955-1969

[41] Liu Y, Yu W, Wang P, Huang Y, Li J, Li P. 2024. Deep learning with ultrasound images enhance the diagnosis of nonalcoholic fatty liver. Ultrasound in Medicine & Biology 50:1724-1730

[42] Miele L, Zocco MA, Pizzolante F, De Matthaeis N, Ainora ME, Liguori A, Gasbarrini A, Grieco A, Rapaccini G. 2020. Use of imaging techniques for non-invasive assessment in the diagnosis and staging of non-alcoholic fatty liver disease. Metabolism: Clinical and Experimental 112:154355

[43] Mihai Mihailescu D. 2013. Computer aided diagnosis method for steatosis rating in ultrasound images using random forests. Medical Ultrasonography 15:184-190

[44] Morid MA, Borjali A, Del Fiol G. 2021. A scoping review of transfer learning research on medical image analysis using ImageNet. Computers in Biology and Medicine 128:104115

[45] Nagy G, Munteanu M, Gordan M, Chira R, Iancu M, Crisan D, Mircea PA. 2015. Computerized ultrasound image analysis for noninvasive evaluation of hepatic steatosis. Medical Ultrasonography 17:431-436

[46] Neuschwander-Tetri BA. 2017. Non-alcoholic fatty liver disease. BMC Medicine 15:45

[47] Newsome PN, Sasso M, Deeks JJ, Paredes A, Boursier J, Chan W-K, Yilmaz Y, Czernichow S, Zheng M-H, Wong VW-S, Allison M, Tsochatzis E, Anstee QM, Sheridan DA, Eddowes PJ, Guha IN, Cobbold JF, Paradis V, Bedossa P, Miette V, Fournier-Poizat C, Sandrin L, Harrison SA. 2020. FibroScan-AST (FAST) score for the non-invasive identification of patients with non-alcoholic steatohepatitis with significant activity and fibrosis: a prospective derivation and global validation study. The Lancet Gastroenterology & Hepatology 5:362-373

[48] Nguyen TN, Podkowa AS, Park TH, Miller RJ, Do MN, Oelze ML. 2021. Use of a convolutional neural network and quantitative ultrasound for diagnosis of fatty liver. Ultrasound in Medicine & Biology 47:556-568

[49] Ozturk A, Kumar V, Pierce TT, Li Q, Baikpour M, Rosado-Mendez I, Wang M, Guo P, Schoen S, Gu Y, Dayavansha S, Grajo JR, Samir AE. 2023. The future is beyond bright: the evolving role of quantitative US for fatty liver disease. Radiology 309:e233146

[50] Ozturk A, Olson MC, Samir AE, Venkatesh SK. 2021. Liver fibrosis assessment: MR and US elastography. Abdominal Radiology 47:3037-3050

[51] Park J, Lee JM, Lee G, Jeon SK, Joo I. 2022. Quantitative evaluation of hepatic steatosis using advanced imaging techniques: focusing on new quantitative ultrasound techniques. Korean Journal of Radiology 23:13-29

[52] Pirmoazen AM, Khurana A, El Kaffas A, Kamaya A. 2020. Quantitative ultrasound approaches for diagnosis and monitoring hepatic steatosis in nonalcoholic fatty liver disease. Theranostics 10:4277-4289

[53] Pirmoazen AM, Khurana A, Loening AM, Liang T, Shamdasani V, Xie H, El Kaffas A, Kamaya A. 2022. Diagnostic performance of 9 quantitative ultrasound parameters for detection and classification of hepatic steatosis in nonalcoholic fatty liver disease. Investigative Radiology 57:23-32

[54] Powell EE, Wong VW-S, Rinella M. 2021. Non-alcoholic fatty liver disease. The Lancet 397:2212-2224

[55] Ribeiro RT, Tato Marinho R, Sanches JM. 2014. An ultrasound-based computer-aided diagnosis tool for steatosis detection. IEEE Journal of Biomedical and Health Informatics 18:1397-1403

[56] Rinella ME, Lazarus JV, Ratziu V, Francque SM, Sanyal AJ, Kanwal F, Romero D, Abdelmalek MF, Anstee QM, Arab JP, Arrese M, Bataller R, Beuers U, Boursier J, Bugianesi E, Byrne CD, Castro Narro GE, Chowdhury A, Cortez-Pinto H, Cryer DR, Cusi K, El-Kassas M, Klein S, Eskridge W, Fan J, Gawrieh S, Guy CD, Harrison SA, Kim SU, Koot BG, Korenjak M, Kowdley KV, Lacaille F, Loomba R, Mitchell-Thain R, Morgan TR, Powell EE, Roden M, Romero-Gómez M, Silva M, Singh SP, Sookoian SC, Spearman CW, Tiniakos D, Valenti L, Vos MB, Wong VW-S, Xanthakos S, Yilmaz Y, Younossi Z, Hobbs A, Villota-Rivas M, Newsome PN, NAFLD Nomenclature consensus group. 2023a. A multisociety Delphi consensus statement on new fatty liver disease nomenclature. Journal of Hepatology 79:1542-1556

[57] Rinella ME, Neuschwander-Tetri BA, Siddiqui MS, Abdelmalek MF, Caldwell S, Barb D, Kleiner DE, Loomba R. 2023b. AASLD practice guidance on the clinical assessment and management of nonalcoholic fatty liver disease. Hepatology 77:1797-1835

[58] Saba L, Dey N, Ashour AS, Samanta S, Nath SS, Chakraborty S, Sanches J, Kumar D, Marinho R, Suri JS. 2016. Automated stratification of liver disease in ultrasound: an online accurate feature classification paradigm. Computer Methods and Programs in Biomedicine 130:118-134

[59] Sanabria SJ, Pirmoazen AM, Dahl J, Kamaya A, El Kaffas A. 2022. Comparative study of raw ultrasound data representations in deep learning to classify hepatic steatosis. Ultrasound in Medicine & Biology 48:2060-2078

[60] Santhosh Reddy D, Bharath R, Rajalakshmi P. 2018. A novel computer-aided diagnosis framework using deep learning for classification of fatty liver disease in ultrasound imaging. In: IEEE 20th international conference on e-health networking. Piscataway. IEEE.

[61] Shi X, Ye W, Liu F, Zhang R, Hou Q, Shi C, Yu J, Shi Y. 2019. Ultrasonic liver steatosis quantification by a learning-based acoustic model from a novel shear wave sequence. BioMedical Engineering OnLine 18:121

[62] Subramanya MB, Kumar V, Mukherjee S, Saini M. 2014. A CAD system for B-mode fatty liver ultrasound images using texture features. Journal of Medical Engineering & Technology 39:123-130

[63] Tahmasebi A, Wang S, Wessner CE, Vu T, Liu JB, Forsberg F, Civan J, Guglielmo FF, Eisenbrey JR. 2023. Ultrasound-based machine learning approach for detection of nonalcoholic fatty liver disease. Journal of Ultrasound in Medicine 42:1747-1756

[64] Tamaki N, Ajmera V, Loomba R. 2021. Non-invasive methods for imaging hepatic steatosis and their clinical importance in NAFLD. Nature Reviews Endocrinology 18:55-66

[65] Tang A, Destrempes F, Kazemirad S, Garcia-Duitama J, Nguyen BN, Cloutier G. 2018. Quantitative ultrasound and machine learning for assessment of steatohepatitis in a rat model. European Radiology 29:2175-2184

[66] Targher G, Byrne CD, Tilg H. 2024. MASLD: a systemic metabolic disorder with cardiovascular and malignant complications. Gut 73:691-702

[67] Taru M-G, Neamti L, Taru V, Procopciuc LM, Procopet B, Lupsor-Platon M. 2023. How to identify advanced fibrosis in adult patients with non-alcoholic fatty liver disease (NAFLD) and non-alcoholic steatohepatitis (NASH) using ultrasound elastography—a review of the literature and proposed multistep approach. Diagnostics 13:788

[68] Vianna P, Calce S-I, Boustros P, Larocque-Rigney C, Patry-Beaudoin L, Luo YH, Aslan E, Marinos J, Alamri TM, Vu K-N, Murphy-Lavallée J, Billiard J-S, Montagnon E, Li H, Kadoury S, Nguyen BN, Gauthier S, Therien B, Rish I, Belilovsky E, Wolf G, Chassé M, Cloutier G, Tang A. 2023. Comparison of radiologists and deep learning for US grading of hepatic steatosis. Radiology 309:e230659

[69] Wang K, Bao J, Wang M, Yu Y, Wang M. 2025. Prospective comparative diagnostic performance of quantitative ultrasound parameters for the measurement of hepatic steatosis in a biopsy-proven metabolic dysfunction associated steatotic liver disease cohort. British Journal of Radiology 98:160-169

[70] Wear KA, Han A, Rubin JM, Gao J, Lavarello R, Cloutier G, Bamber J, Tuthill T. 2022. US backscatter for liver fat quantification: an AIUM-RSNA QIBA pulse-echo quantitative ultrasound initiative. Radiology 305:526-537

[71] Wever M, Tornede A, Mohr F, Hullermeier E. 2021. AutoML for multi-label classification: overview and empirical evaluation. IEEE Transactions on Pattern Analysis and Machine Intelligence 43:3037-3054

[72] Wong VW-S, Ekstedt M, Wong GL-H, Hagström H. 2023. Changing epidemiology, global trends and implications for outcomes of NAFLD. Journal of Hepatology 79:842-852

[73] Xie X, Niu J, Liu X, Chen Z, Tang S, Yu S. 2021. A survey on incorporating domain knowledge into deep learning for medical image analysis. Medical Image Analysis 69:101985