Prognostic value of a modified pathological staging system for gastric cancer based on the number of retrieved lymph nodes and metastatic lymph node ratio

Guiru Jia; Dagui Zhou; Xiao Tang; Jianpei Liu; Purun Lei

doi:10.7717/peerj.18165

Prognostic value of a modified pathological staging system for gastric cancer based on the number of retrieved lymph nodes and metastatic lymph node ratio

Guiru Jia, Dagui Zhou, Xiao Tang, Jianpei Liu , Purun Lei

Department of Gastrointestinal Surgery, Third Affiliated Hospital of Sun Yat-Sen University, Guangzhou, Guangdong, China

DOI: 10.7717/peerj.18165

Published: 2024-10-01
Accepted: 2024-09-02
Received: 2023-07-13

Academic Editor: Mehmet Burak Ateş

Subject Areas: Gastroenterology and Hepatology, Oncology, Pathology
Keywords: Gastric cancer, Staging system, Lymph node ratio, Examined lymph node, Prognosis

Copyright: © 2024 Jia et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Jia G, Zhou D, Tang X, Liu J, Lei P. 2024. Prognostic value of a modified pathological staging system for gastric cancer based on the number of retrieved lymph nodes and metastatic lymph node ratio. PeerJ 12:e18165 https://doi.org/10.7717/peerj.18165

The authors have chosen to make the review history of this article public.

Abstract

Aim

The prognosis for gastric cancer (GC) remains grim, underscoring the importance of accurate staging and treatment. Given the potential benefits of using lymph node ratio (LNR) for improved prognostication and treatment planning, it is critical to incorporate examined lymph nodes (ELN) count in an integrated GC staging system.

Methods

Patients data from the Surveillance, Epidemiology, and End Results (SEER) database between 2010 and 2015 was utilized as training set. The Mantel-Cox survival test was used to calculate chi-square values for 40 LNR segments with a 0.025 interval, defining a novel LNR-based N (rN) classification based on the cutoff points. A revised AJCC (rAJCC) staging system was established by replacing the 8th AJCC N staging with a rN classification. The relationship between the ELN count and prognosis or positive lymph node detection was conducted by using multivariable models. The series of the odds ratios and hazard ratios were fitted with a locally weighted scatterplot smoothing (LOWESS) smoother, and the structural break points were determined by Chow test to clarify an optimal minimum ELN count. The integrated GC staging system incorporated both rAJCC system and the ideal ELN count. Discriminatory ability and prognostic homogeneity of the rAJCC and integrated staging system was compared with AJCC staging system in the SEER validation set (2016–2017), the Cancer Genome Atlas Program (TCGA) database, and the Third Affiliated Hospital of Sun Yat-sen University database.

Results

The current study found that LNR and ELN count are both significantly associated with the prognosis of GC patients (HR = 0.98, p < 0.001 and HR = 2.51, p < 0.001). Four peaks of the chi-square value were identified as LNR cut-off points at 0.025, 0.175, 0.45 and 0.6 to define a novel rN stage. In comparison to the 8th AJCC staging system, the rAJCC staging system demonstrated significant prognostic advantages and discriminatory ability in the training set (5-Y OS AUC: 71.7 vs. 73.0; AIC: 57,290.7 vs. 57,054.9). The superiority of the rAJCC staging system was confirmed in all validation sets. Using a LOWESS smoother and Chow test, a threshold ELN count of 30 was determined to maximum improvement in the prognosis of node-negative patients without downgrading due to potential metastasis, while also maximizing the detection efficiency of at least one involved lymph node. The integrated staging system, combining the refined rAJCC classification with an optimized ELN count threshold, has demonstrated superior discriminatory performance compared to the standalone rAJCC or the traditional AJCC system.

Conclusion

The development of a novel GC staging system, which integrated the LNR-based N classification and the minimum ELN count, has exhibited superior prognostic accuracy, holding promise as a valuable asset in the clinical management of GC. However, it is crucial to recognize the limitations from the retrospective database, which should be addressed in subsequent analyses.

Introduction

Gastric cancer (GC) is the fifth most common cancer globally (Bray et al., 2024) and the sixth most common malignancy in China (Zheng et al., 2024). Radical resection, in conjunction with adjuvant treatment based on pathological staging, provides a potential for a cure, especially in cases of early-stage gastric cancer (Smyth et al., 2020; Rosa et al., 2022). However, the prognosis remains poor, highlighting the need for accurate staging and appropriate treatment (Degiuli et al., 2021).

The tumor-node-metastasis (TNM) staging system by the American Joint Committee on Cancer (AJCC) for GC globally evaluates primary tumor invasiveness and size (T), regional lymph node involvement (N), and absence of distant metastasis (M) (In et al., 2018). Lymph node involvement is a significant factor in predicting patients’ recurrence and survival, but assessing it solely based on the number of positive lymph nodes can lead to inaccuracy due to incomplete lymphadenectomy (Kinami, Saito & Takamura, 2022; Zeng et al., 2023b). The lymph node ratio (LNR) denotes the proportion of metastatic regional lymph nodes (LN) to the total number of examined lymph nodes (ELN) obtained from the specimen (Yamashita et al., 2016). LNR has been shown to be a better prognostic indicator than the number of positive lymph nodes or the total number of ELNs in multiple malignancies, including gastric cancer (Kano et al., 2020; Kotecha et al., 2022; Ergenç et al., 2023). The utilization of an LNR-based modified staging system has demonstrated a higher accuracy in predicting survival compared to the 8th edition of the AJCC staging system (Huang et al., 2020; Yin et al., 2024). However, all previous studies have overlooked the significance of total ELN in accurately staging cancer (Gu et al., 2020; Zeng et al., 2023b). While AJCC guidelines suggest assessing at least 16 LNs per patient, it is uncertain how many ELN are needed for reliable stage assignment and strong prognostic value (In et al., 2018). A higher ELN count can indicate a more thorough lymphadenectomy and aid in detecting metastatic LNs (Macalindong et al., 2018). Therefore, a combined assessment of both ELN count and lymph node involvement evaluation is required.

The Surveillance, Epidemiology, and End Results (SEER) Program collects cancer information that encompass 50% of the U.S. population, including patients with gastric cancer (Daly & Paquette, 2019). However, there were no studies that incorporated both ELN and LNR in the revised staging system for gastric cancer. In the current study, the SEER database was used for establishing a staging system by replacing the 8th AJCC N classification with the LNR classification and incorporating minimum ELN count. Data from the Cancer Genome Atlas Program (TCGA) and Third Affiliated Hospital of Sun Yat-sen University was used for external validation for the novel system.

Methods

Study population and data collection

Clinical data from the US Surveillance, Epidemiology, and End Results (SEER) Program from 2010–2015 (https://seer.cancer.gov/) was extracted and analyzed as training set, data from 2016–2017 was adopted as internal validation set. Data from The Cancer Genome Atlas Program (TCGA) (https://portal.gdc.cancer.gov/) and prognosis data from the gastrointestinal surgery department, Third Affiliated Hospital of Sun Yat-sen University were applied as external validation sets.

Screening criteria for gastric cancer cases were as follow: exclusion of cases with only autopsy or death certificate, cases where initial tumor location was not stomach, patients with stage 0 and stage IV, cases without radical surgery, non-adenocarcinoma cases, death cases within 1 month after operation, and cases with unknown lymph node information and AJCC TNM stage.

The study analyzed various factors such as age of diagnosis (<50 years, 50–69 years, >69 years), gender, race (white, black, other), AJCC T stage (T1–T4b), AJCC TNM stage (I–III), primary tumor location (stomach body, antrum/pylorus, cardia/fundus, greater gastric recurve, lesser gastric recurve, overlapping area, NOS), clinical features such as tumor size (≥5 cm, <5 cm, unknown), tumor grade (I–IV), chemotherapy, radiotherapy, number of lymph nodes retrieved and number of metastases, and lymph node positive rate. The populations of American Indian/Alaskan and Asian/Pacific Islander were classified as “other” due to small sample sizes. Tumor grade was also analyzed, with grades I–IV representing highly differentiated, moderately differentiated, poorly differentiated, and signed-ring cell carcinoma, respectively. Overall survival (OS) is the time from cancer diagnosis to death from any cause, while disease-specific survival (DSS) is the time from cancer diagnosis to death specifically due to the disease.

Statistical analysis

The chi-square test was used to compare differences between categorical variables, while the t-test was used for continuous variables. Univariate and multivariate Cox regression were used to examine the association between prognosis and the covariates. The Kaplan-Meier method is used to compare overall OS and DSS among groups. Hazard ratios (HR) for mortality were reported following adjusting for covariates including age, year of diagnosis, gender, race, tumor features (site, size, and grade).

The study used Mantel-Cox survival test to calculate chi-square values for 40 segments of LNR with a 0.025 interval, identified four peaks as cutoff points, and defined LNR-based N (rN) classification based on these cutoff points. Bootstraps with 1,000 resamples was conducted to internally validate the rN staging system. Patients included in the study were then redistributed according to the revised AJCC (rAJCC) staging system. We compared the performance of the rAJCC system with that of the 8th AJCC staging system in terms of discriminatory ability and the prognostic homogeneity. These comparisons were assessed using the area under the receiver operating characteristic curve (AUC), Akaike’s information criterion (AIC), and Bayesian Information Criterion (BIC). A lower AIC value or a higher AUC indicates a stronger discriminatory capacity of the staging system.

The proper threshold of ELN count analysis was conducted in two steps. COX regression firstly performed based on OS and DSS to examine the HR of each ELN count in patients with negative LNs. Patients with only one ELN were used as a reference, HR value for each ELN count group for both OS and DSS was analyzed using locally weighted scatterplot smoothing (LOWESS) scatter curve fitting. The minimum count of ELNs was determined by Chow test at which the slope of the curve changes significantly. Logistic regression analysis was then performed based on patients’ data with negative or only one LN metastasis, using node status as the outcome variable. Patients with only one ELN was defined as reference. LOWESS and Chow test help draw the odds ratio curve of each ELN count group and defined the ideal cutoff LN count for maximum detection efficacy of at least one involved LN.

The minimum ELN count and the rAJCC staging system merged to develop the integrated GC staging system. We compared the performance of the integrated GC staging system to that of the 8th AJCC staging system in terms of discriminatory ability and prognostic homogeneity. These aspects were evaluated using metrics such as AUC, AIC and BIC.

A significance level of p < 0.05 was used for all data analysis. Statistical analyses were conducted using IBM SPSS Statistics for Windows v. 20.0 (IBM Corp., Armonk, NY, USA) and R version 3.6.2 software (Bell Laboratories, Murray Hill, NJ, USA).

The current study is performed following the principles from the declaration of Helsinki. Patient from our center was informed before database enrollment individually, data from the TCGA database were publicly available and de-identified. The study was approval by the institutional review board of the Third Affiliated Hospital of Sun Yat-sen University for human data analysis as No. II2023-062-01.

Results

Patient characteristics

The study analyzed data from 8,137 patients in the SEER database, 277 patients in the TCGA database, and 719 patients from the authors’ center. The SEER patients were divided into training (2010–2015) and validation (2016–2017) sets based on years of diagnosis. The selection process is shown in Fig. 1. The datasets utilized in our analysis were designated as follows: the training set as dataset 1, the validation set as dataset 2, the TCGA set as dataset 3, and the data from the author’s institution as dataset 4.

Figure 1: Flow chart for patient selection.

Download full-size image

DOI: 10.7717/peerj.18165/fig-1

The majority of patients in the training set, validation set, TCGA set, and data from author’s center were over 50 years old (90.35%, 90.15%, 92.42% and 79.28%) and had moderate to poor differentiation (85.72%, 84.15%, 97.12% and 90.41%). In the datasets derived from the SEER database, namely dataset 1 and dataset 2, the majority of patients identified as White, comprising 65.67% and 62.88%, respectively. This demographic distribution was mirrored in the TCGA dataset 3, where the White ethnicity constituted 68.23% of the patient population. In contrast, all patients from our single-center study were of Chinese ethnicity, with no representation from other ethnic groups.

Regarding gastric tumor size, approximately half of the patients in the SEER and TCGA datasets had tumors measuring less than 5 cm, with percentages of 56.54%, 56.66%, and 48.38%, respectively. In our single-center study, this proportion was higher, at 63.00%. The primary tumor sites included the body, antrum/pylorus, cardia/fundus, greater curvature, lesser curvature, overlapping regions, and stomach without detailed specification. Notably, the cardia/fundus was the most common primary tumor site in the SEER and TCGA datasets, whereas in our single-center study, the antrum/pylorus was the predominant site. Most patients were diagnosed with T3 or T4 gastric cancer, accounting for 59.49%, 61.09%, 69.32%, and 67.32% of cases. Furthermore, patients found to have node-positive disease according to the 8th AJCC staging criteria, account for 56.53%, 57.39%, 69.31%, and 60.08% in each group. The median ELN counts were higher in the data from our center (35 [IQR26-45]) than in the other sets (16 [IQR10-24], 18 [IQR12-27], 17[IQR10-31]), while the PLN counts were similar across all sets (1 [IQR 0, 4], 0 [IQR 0, 3], 2 [IQR 0, 7] and 2 [IQR 0, 7]). In terms of treatment modalities, across dataset 1, dataset 2, and dataset 3, 56.95%, 61.42%, 35.02% and 59.25% of patients, respectively, received postoperative chemotherapy. Radiotherapy was administered to 37.77%, 33.26%, and 10.83% of patients in the respective datasets, none underwent radiotherapy in our single center. The median follow-up time was longest in the training set (50 months, [19, 77]) and data from author’s center (45 months, [20, 73]). Detailed clinicopathologic characteristics can be found in Table S1.

Univariate and multivariate analysis

Prognosis data and related variables from the training set were analyzed. As shown in Table 1, univariate analysis revealed that male patients (1.13 [1.05–1.21], p = 0.001), age over 69 years old (1.54, [1.36–1.75], p < 0.001), non-white/black patients (p < 0.001), degree of differentiation (p < 0.001), tumor size ≥5 cm (1.68, [1.57–1.80], p < 0.001), tumor locate at cardia/fundus (1.26, [1.11–1.44], p < 0.001), depth of tumor invasion (p < 0.001), N stage (p < 0.001), AJCC pathological classification (p < 0.001), chemotherapy (1.28, [1.20–1.37], p < 0.001), radiotherapy (1.28, [1.19–1.37], p < 0.001), ELN count (0.99, [0.99–1.00], p < 0.001), PLN count (1.06, [1.06–1.06], p < 0.001), and LNR (7.84, [7.05–8.72], p < 0.001) were significantly associated with overall survival of patients diagnosed with gastric cancer patients in dataset 1.

Table 1:

Univariate and multivariate Cox analyses of overall survival (OS) in dataset 1.

(OS) Characteristics	Univariate analysis		Multivariate analysis
(OS) Characteristics	HR (95% CI)	p-value	HR (95% CI)	p-value
Sex
Female	1	Ref.	1	Ref.
Male	1.13 [1.05–1.21]	0.001	1.14 [1.06–1.23]	0.001
Age
<50	1	Ref.	1	Ref.
50–69	1.09 [0.96–1.23]	0.190	1.18 [1.04–1.34]	0.009
>69	1.54 [1.36–1.75]	<0.001	1.88 [1.66–2.14]	<0.001
Year
2010	1	Ref.	/	/
2011	1.09 [0.97–1.21]	0.137	/	/
2012	0.99 [0.89–1.11]	0.882	/	/
2013	1.01 [0.91–1.14]	0.799	/	/
2014	0.99 [0.89–1.12]	0.927	/	/
2015	1.04 [0.92–1.18]	0.506	/	/
Race
White	1	Ref.	1	Ref.
Black	0.99 [0.90–1.10]	0.875	1.11 [1.00–1.23]	0.054
Other	0.75 [0.69–0.81]	<0.001	0.78 [0.71–0.85]	<0.001
Unknown	0.20 [0.08–0.53]	0.001	0.26 [0.10–0.70]	0.008
Grade
I	1	Ref.	1	Ref.
II	1.55 [1.33–1.82]	<0.001	1.28 [1.09–1.50]	0.002
III	2.24 [1.93–2.61]	<0.001	1.51 [1.29–1.76]	<0.001
IV	2.18 [1.64–2.90]	<0.001	1.44 [1.08–1.92]	0.013
Unknown	1.23 [0.98–1.55]	0.068	1.04 [0.83–1.31]	0.744
Tumor size
<5 cm	1	Ref.	1	Ref.
≥5 cm	1.68 [1.57–1.80]	<0.001	1.10 [1.02–1.19]	0.014
Unknown	1.14 [1.01–1.27]	0.029	1.09 [0.97–1.22]	0.154
Primary site
Body	1	Ref.	1	Ref.
Antrum/pylorus	1.14 [1.00–1.30]	0.052	1.04 [0.91–1.19]	0.525
Cardia/fundus	1.26 [1.11–1.44]	<0.001	1.39 [1.22–1.59]	<0.001
Greater curvature	1.21 [0.99–1.48]	0.057	1.01 [0.83–1.24]	0.920
Lesser curvature	1.10 [0.94–1.28]	0.235	1.07 [0.92–1.25]	0.382
Overlapping regions	1.39 [1.17–1.65]	<0.001	1.03 [0.87–1.23]	0.742
Stomach NOS	1.31 [1.10–1.56]	0.002	1.14 [0.96–1.36]	0.141
8th AJCC T stage
T1	1	Ref.	1	Ref.
T2	1.58 [1.39–1.79]	<0.001	1.26 [1.08–1.47]	0.003
T3	2.68 [2.43–2.95]	<0.001	1.58 [1.33–1.88]	<0.001
T4a	4.20 [3.75–4.71]	<0.001	2.01 [1.66–2.44]	<0.001
T4b	4.83 [4.14–5.63]	<0.001	2.41 [1.93–3.02]	<0.001
8th AJCC TNM stage
I	1	Ref.	1	Ref.
II	1.93 [1.74–2.14]	<0.001	1.34 [1.14–1.58]	<0.001
III	3.87 [3.53–4.24]	<0.001	1.70 [1.40–2.07]	<0.001
Chemotherapy
No/Unknown	1	Ref.	1	Ref.
Yes	1.28 [1.20–1.37]	<0.001	0.76 [0.70–0.84]	<0.001
Radiotherapy
No/Unknown	1	Ref.	1	Ref.
Yes	1.28 [1.19–1.37]	<0.001	0.98 [0.90–1.06]	0.576
The number of ELNs	0.99 [0.99–1.00]	<0.001	0.98 [0.98–0.99]	<0.001
The number of pLNs	1.06 [1.06–1.06]	<0.001	1.03 [1.02–1.04]	<0.001
LNR	7.84 [7.05–8.72]	<0.001	2.51 [2.04–3.08]	<0.001

DOI: 10.7717/peerj.18165/table-1

The following factors were independently correlated with poorer OS using multivariate analysis, male patients (HR = 1.14, p = 0.001); patients between 50 to 69 years old (HR = 1.18, p = 0.009) and over 69 years old (HR = 1.88, p < 0.001); non-white/black patients (HR = 0.78, p < 0.001); tumor diameter ≥5 cm (HR = 1.10, p = 0.014); tumors locate at cardia/fundus area (HR = 1.39, p < 0.001); with advancing of pT and pTNM classification (p < 0.05). Furthermore, chemotherapy could help improve the overall prognosis (HR = 0.76, p < 0.001). Higher ELN count was significantly associated with better prognosis of GC (HR = 0.98, p < 0.001), while increasing PLN count and LNR were adverse prognostic factors (HR = 1.03, p < 0.001 and HR = 2.51, p < 0.001). The detailed data was shown in Table 1.

LNR classification and revised GC staging evaluation

LNR was then divided into 40 segments using a 0.025 interval, and chi-square values were calculated through Mantel-Cox survival test between adjacent segments in patients from the training set. Four peaks of the chi-square value were identified as cut-off points at 0.025, 0.175, 0.45, and 0.6 (p < 0.05), as shown in Table S2. The rN classification was used to distinguish five groups: rN0 (0 ≤ LNR < 0.025), rN1 (0.025 ≤ LNR < 0.175), rN2 (0.175 ≤ LNR < 0.45), rN3a (0.45 ≤ LNR < 0.6), and rN3b (0.6 ≤ LNR ≤ 1) based on LNR values. An 8th AJCC N classification was replaced with the corresponding rN classification to develop a rAJCC staging system with rI, rII, and rIII stages.

According to the 8th AJCC, N classification was divided into N0, N1, N2, N3a, and N3b with percentages of 2,761 (43.47%), 1,558 (24.53%), 1,044 (16.44%), 717 (11.29%) and 271(4.27%), respectively. In the revised classification system, the rN categories were distributed as 3,192 (50.26%) for N0, 1,204 (18.96%) for N1, 1,023 (16.11%) for N2, 328 (5.16%) for N3a, and 604 (9.51%) for N3b.

Patients were initially classified into AJCC stage categories of I, II, and III with percentages of 28.99%, 28.61%, and 42.40% respectively. The patients were then regrouped using rAJCC staging system resulting in 32.96% classified as rI, 35.18% as rII, and 31.87% as rIII. atients from SEER validation set, TCGA validation set, and our single center were regrouped subsequently according to the modified N classification.

The revised LNR-based rAJCC system, as illustrated in Fig. 2, it has validated the clinical relevance and effectiveness. The rAJCC system effectively stratifies patient prognoses across all evaluated datasets based on the LNR-based N classification (panels a, c, e, g), exhibiting statistically significant differences (p < 0.001). When patients were regrouped according to the rAJCC staging system, survival plots (panels b, d, f, h) remained distinct and showed significant statistical differences (p < 0.001). The detailed OS data and comparison results were shown in Table S3.

Figure 2: Comparison of Kaplan-Meier survival curves for four datasets depicted according to the rN classification or rAJCC staging system.
The Kaplan-Meier survival curves of patients with gastric cancer in the (A) dataset 1, (C) dataset 2, (E) dataset 3 and (G) dataset 4 were depicted according to the rN classification. The Kaplan-Meier survival curves of patients with gastric cancer in the (B) dataset 1, (D) dataset 2, (F) dataset 3 and (H) dataset 4 were depicted according to the rAJCC staging system.

Download full-size image

DOI: 10.7717/peerj.18165/fig-2

When evaluating the performance of two models, a lower AIC value suggests a model that fits the data well while balancing model complexity, whereas a lower BIC value implies a more stringent control over complexity, thus reducing the risk of overfitting. In our analysis, the rAJCC staging system demonstrated superior statistical performance over the 8th AJCC staging system, with both AIC (57,042.6 vs. 57,278.3) and BIC (57,054.9 vs. 57,290.7) values being lower in the primary training dataset. In the current study, we chose to depict the AUC across all time intervals using box plots, rather than focusing solely on the 3-year and 5-year AUCs. This methodological choice provides a more nuanced view of the rAJCC staging system’s predictive capabilities over time, as opposed to the 8th edition AJCC staging system for gastric cancer, across multiple datasets. The rAJCC staging system showed a significantly higher AUC (73%, CI [71.7–74.2%]) compared to the 8th AJCC staging system (71.7%, CI [70.4–72.9%]) in primary training dataset 1 indicating greater discriminatory power, as illustrated in Fig. 2 and Table S4.

Further validation of the model using validation set from SEER (2016–2017), data from The Cancer Genome Atlas (TCGA), and a single-center dataset consistently demonstrated the rAJCC staging system’s enhanced discriminatory power. This finding is corroborated by the visual representation in Fig. 3 and the numerical data presented in Table 2.

Figure 3: Performance of the rAJCC staging systems compared with the 8th AJCC staging system in four datasets.
Performance of the rAJCC staging systems compared with the 8th AJCC staging system in the (A) dataset 1, (B) dataset 2, (C) dataset 3 and (D) dataset 4.

Download full-size image

DOI: 10.7717/peerj.18165/fig-3

Table 2 :

Comparison of the performance of the 8th AJCC and rAJCC classifications in the datasets.

Staging system	AIC	BIC
Dataset 1
AJCC	5,7278.3	57,290.7
rAJCC	5,7042.6	57,054.9
Dataset 2
AJCC	8,291.7	8,300.4
rAJCC	8,222.2	8,230.9
Dataset 3
AJCC	1,061.6	1,066.9
rAJCC	1,056.4	1,061.8
Dataset 4
AJCC	1,966.4	1,972.6
rAJCC	1,947.5	1,953.7

DOI: 10.7717/peerj.18165/table-2

Ideal number of ELNs analysis based on LN involvement status and survival

The ELN count was assumed to be similar between radical total and distal gastric resection due to insufficient data on the surgical procedures, despite the theoretical difference in LN numbers harvested from the different lymphadenectomy ranges. The optimal threshold for ELN count analysis was established through a two-step approach. COX regression was initially applied, based on OS and DSS, to assess the HR for each ELN count in patients with negative LNs, using patients with a single ELN as the reference. The HR values for each ELN count group were analyzed with LOWESS scatter curve fitting. The Chow test identified the minimum ELN count where the curve’s slope indicated a significant change. Logistic regression was then conducted on data from patients with negative or single LN metastasis, defining node status as the outcome variable and patients with a single ELN as the reference. The LOWESS and Chow test were utilized to plot the odds ratio curve for each ELN count group, identifying the ideal cutoff LN count for maximizing the detection efficacy of at least one involved LN.

Patients with negative LN was analyzed first, multivariate analysis have proved a greater number of LN was examined, the risk of potentially positive LN decreases thus improved the prognosis. HR of each ELN count compared with only one ELN (as reference) was analyzed using Cox proportional hazards regression model to determine the effect of ELN number on OS/DSS, after adjusting for other significant prognostic factors. LOWESS analysis was utilized to visualize the survival curves. Structural break ELN count was determined by Chow test as significant curvature change (p < 0.05) in the HR curve, which was minimum ELN count to obtain survival benefit, as shown in Figs. 4A, 4C. In the current study, curve of OS revealed 30 ELNs at least to guarantee survival benefit while which was 30 in DSS curve in Figs. 4B, 4D.

Figure 4: Ideal number of ELNs analysis based on HR/OR values of each ELN count on OS/DSS.
(A, C) The HR values of the number of retrieved lymph nodes (LNs) from the Cox multivariate regression were fitted by the Lowess curves. (B, D) Describing the slope change of Lowess curves with Chow tests. (E) The OR values of the number of retrieved LNs from the Logistic multivariate regression were fitted by the Lowess curve. (F) Describing the slope change of Lowess curve ‘e’ with Chow tests.

Download full-size image

DOI: 10.7717/peerj.18165/fig-4

With regard to the correlation between ELN count and positive LNs identified, a greater ELN count was associated with a greater number of PLNs. Patients from SEER (2010–2015) cohort was then redistributed into node-negative and only one positive LN group. LN status was then assessed by correlating the ELN number and the proportion of each node stage category (node negative vs. one node positive) by using a binary logistic regression model after adjusting for other potential confounders. The curves of odds ratios (ORs; LN involvement) were fitted by using a LOWESS smoother. The minimum ELN count in detecting at least one involved LN was clarified using Chow test (p < 0.05) as 29 lymph nodes. Both approaches indicated a minimum ELN of 30 was necessary. All curved and Chow test results were shown in Fig. 4.

Integration and validation of both ideal ELN count and LNR GC staging system

Patients with an ELN count of 30 or more were identified from each dataset, were categorized as Internal validation set-1, Internal validation set-2, External validation set-1, and External validation set-2, and subsequently reassessed using the rAJCC staging system. All validation datasets, both internal and external, showed a significantly different prognosis (p < 0.001) and improved HR performance compared to the 8th edition AJCC system in patients with ELN > 30, as illustrated in Fig. 5 (panels a, b, c, d) and detailed in Table S5. The AUC of the novel classification significantly exceeded that of the 8th AJCC classification, with a clear advantage in terms of AIC and BIC, as depicted in Fig. 5 (panels e, f, g, h), and summarized in Table 3. Furthermore, for patients with ELN < 30, the rAJCC staging system still showed superiority than the 8^th AJCC system, as shown in Fig. S1.

Figure 5: Comparison of Kaplan-Meier survival curves for four validation sets depicted according to the rAJCC staging system.
The Kaplan-Meier survival curves of patients with gastric cancer in the (A) Internal validation set-1, (B) Internal validation set-2, (C) External validation set-1 and (D) External validation set-2 were depicted according to the rAJCC staging system.Performance of the rAJCC staging systems compared with the 8th AJCC staging system in the (E) Internal validation set-1, (F) Internal validation set-2, (G) External validation set-1 and (H) External validation set-2.

Download full-size image

DOI: 10.7717/peerj.18165/fig-5

Table 3:

Comparison of the performance of the 8th AJCC and rAJCC staging systems in the validation sets in patient with ELN count ≥ 30.

Staging system	AIC	BIC
Internal validation set-1
AJCC	6,168.6	6,177.0
rAJCC	6,143.5	6,151.9
Internal validation set-2
AJCC	1,270.2	1,275.7
rAJCC	1,257.5	1,263.0
External validation set-1
AJCC	185.5	188.1
rAJCC	172.6	175.2
External validation set-2
AJCC	978.6	983.6
rAJCC	966.0	971.1

DOI: 10.7717/peerj.18165/table-3

Furthermore, the integrated model, which combines the ELN count with the rAJCC staging system, was compared with the original rAJCC model without ELN count integration. Given the inability to evaluate long-term survival rates for patients registered between 2016 and 2017 in the SEER database, the comparative analysis was restricted to the TCGA database and our single-center dataset. The findings indicated that the integrated model achieved a higher AUC, outperforming the rAJCC system in terms of predictive accuracy, as demonstrated in Fig. S2.

Discussion

The current study found that LNR and ELN count are both associated with the prognosis of gastric cancer patients, consistent with previous studies. Higher ELN count suggests less residual micro-metastatic disease and can aid in precise staging, leading to more effective adjuvant therapy and improved outcomes. LNR was also shown to be a more accurate prognostic factor than AJCC N classification. LNR could be easily flattered when ELN was inadequate, but previous studies did not analyze ELN count as a parameter in revised staging system construction. An integration GC staging system for nonmetastatic GC was developed using LNR and ELN count incorporated into the 8th AJCC staging system. Results indicate that the integration GC staging system provides better prognosis and discriminatory capacity compared to the 8th AJCC staging system, as evidenced by improved AUC square and better AIC/BIC value.

LNR was analyzed variously in previous studies. The classification applied by Zeng et al. (2023a), and He et al. (2022) divided lymph node status into three categories: low LNR (0%–20%/25%), middle-LNR (20/25%–50%), high-LNR (>50%). Kano et al. (2020) and Jiang et al. (2022) classifications stratified patients into three subgroups with different cut-offs: N0 (0%–10/30%), N1 (10/30%–25/45%), N3 (>30/45%). The classification applied by Zhang et al. (2014) and Chen et al. (2022) stratified patients into four groups: N0 (0%), N1 (1–20%), N2 (21–50%/69%) and N3 (>50%/70%). In this study, we initially conducted statistical analyses using data from dataset 1 to precisely determine the threshold values for the LNR to define a novel N stage. Subsequently, the 8th edition of the AJCC N classification was replaced with the rN classification, leading to the development of an enhanced rAJCC staging system. Patients were then reclassified using this rAJCC staging system. Survival data analysis demonstrated that all pairwise comparisons revealed significant differences in prognosis for patients with the modified rN classification and the rAJCC staging system. The system’s discriminatory power was meticulously assessed through the application of the AIC, BIC and AUC. The AIC and the BIC are recognized as essential tools in the statistical analysis of regression models, including both linear and logistic regression. These criteria play a pivotal role in the model selection process, providing a quantitative framework for comparing the fit of different models. While the AUC is a relevant metric for evaluating the performance of classification models. In addition, our analysis deviated from conventional methods by utilizing a box plot representation of the AUC across all time intervals derived from the rAJCC staging system. This approach differs from the traditional comparison of solely the 3-year and 5-year AUCs, offering a more comprehensive perspective on the system’s predictive capabilities over time. The median AUC was chosen as the evaluative metric, providing a cumulative measure of predictive performance across all considered time points. By using the median, our analysis is less affected by extreme values or outliers that could skew the results if only specific time intervals were examined. This approach allows for a more refined assessment of the model’s predictive power, reflecting its performance throughout the entire follow-up period. In the context of model comparison using AIC and BIC, the rAJCC model exhibited lower values for both, signifying its superiority. The median AUC, as an aggregate measure of predictive performance, also displayed a higher value, indicating enhanced discrimination capability across all datasets. However, relying solely on LNR to determine pathological staging may be inaccurate due to incomplete lymphadenectomy, requiring a combined assessment of lymph node dissection.

ELN count is an independent prognostic factor in multiple cancers, including GC, and higher ELN counts are associated with more accurate nodal staging and improved survival. The count of ELNs can significantly impact the survival outcomes of patients with gastric cancer (GC). This is because it directly affects the accuracy of staging, allowing for more personalized postoperative treatment and ultimately leading to improved survival rates. Additionally, ELN count can serve as a valuable indicator of surgical quality, as a higher count reduces the likelihood of residual positive lymph nodes (PLNs) and nodal micro-metastases. Ultimately, this lowers the risk of postoperative recurrence. Various factors can affect the number of ELNs in gastrectomy, such as the surgical technique, extent of surgery, diligence and thoroughness of pathologic examination, condition of specimens, and innate number of LNs for each patient. While AJCC recommends a minimum of 16 ELNs for accurate staging, there is no consensus on the optimal threshold number of ELNs to address both stage migration and long-term survival.

Research has suggested that patients with a higher number of examined lymph nodes may have better survival due to stage migration and more accurate selection for adjuvant systemic therapy. Zhao et al. (2023). recommended ≥24 ELNs in patients with advanced GC and demonstrated that the AJCC recommendation of ≥16 ELNs was insufficient for determining the N stage. Guo et al. (2021). further pointed out that ≥27 ELNs was associated with a maximum survival advantage in patients with GC undergoing surgery, using the SEER database and 144 patients from China. Zhang et al. (2020) and Huang et al. (2021) reported a minimum of 31 and 33 ELN counts would improve the prognosis according to data from SEER database and institutions from China. The current study aimed to determine the optimal cutoff value of ELN by considering two requirements: ensuring maximum improvement in the prognosis of node-negative patients without downgrading due to potential metastasis and maximizing the detection efficiency of at least one involved LN. In this study, COX regression was used to examine the prognostic value of ELN count in patients with negative LNs. Patients with only one ELN were used as a reference, and the HR value for each ELN count group for both OS and DSS was analyzed using LOWESS scatter curve fitting. The minimum count of ELNs was determined by Chow test at which the slope of the curve changes significantly. Logistic regression analysis was performed on patients with negative lymph nodes and only one LN metastasis from the SEER database, using node status as the outcome variable. The reference value was set as patients with only one ELN. LOWESS and chow test help draw the curve and defined as the ideal cutoff LN count for maximum detection efficacy of at least one involved LN. Both approaches suggested that 30 or more ELNs contributed to maximum improvement in the prognosis of node-negative patients and earlier detection of at least one LN metastasis.

The integration of revised staging system incorporating LNR and ELN count was conducted and evaluated. Patients with more than 30 ELNs were included and regrouped according to the cut-offs of LNR as mentioned above. The integrated system presents valuable criteria for identifying patients with a favorable prognosis, thereby preventing overtreatment. Conversely, patients with poorer survival prospects can be treated with a more intensive approach. Additionally, the integrated staging scheme is likely to be clinically feasible since the enhancement in prognostic accuracy does not result in increased complexity. For patients with ELN < 30, the current rAJCC system can provide pathological staging for risk and prognosis assessment, but additional analysis is needed to determine the risk of inadequate lymphadenectomy and the need for adjuvant therapy.

The log odds of positive lymph nodes (LODDS) is a new method for assessing lymph node status in cancer (Que et al., 2023). Studies have shown that LNR and LODDS perform similarly in predicting prognosis for gastric cancer patients when an adequate number of lymph nodes are harvested (Cao et al., 2019), but LODDS can be influenced by the total number of retrieved nodes (Lai, Zheng & Li, 2022; Díaz Del Arco et al., 2024). Although LODDS may have better predictive value as a continuous variable for disease-specific survival, it is difficult to interpret and use in clinical practice (Lu et al., 2017; Wang et al., 2023). Therefore, LNR was used as the parameter in the current analysis.

The current study has several limitations. Despite our diligent efforts to guarantee the accuracy and quality of data retrieved from the SEER database, there still exist concerns regarding data miscoding and insufficiency. In addition, surgical procedures, surgical instruments, surgical skills, examinations of lymph nodes and adjuvant chemotherapies changed during the evolution of the cohort, which may have influenced patients’ prognosis; Furthermore, the incidence and treatment protocol for locally advanced GC of the same TNM category differs between Asian and Western, which may explain the lower 5-year OS rate in the SEER cohort compared with the Chinese cohort.

Conclusion

However, it is important to acknowledge that the accuracy and quality of data derived from large databases may introduce limitations. Despite these challenges, the integration of molecular biomarkers and genetic profiling, alongside personalized therapeutics and a global data collection strategy, is expected to yield a more accurate and effective staging system in the future.

Supplemental Information

Supplementary Figures and Tables.

DOI: 10.7717/peerj.18165/supp-1

Download

[1] Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, Jemal A. 2024. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal for Clinicians 74(3):229-263

[2] Cao H, Tang Z, Yu Z, Wang Q, Li Z, Lu Q, Wu Y. 2019. Comparison of the 8th union for international cancer control lymph node staging system for gastric cancer with two other lymph node staging systems. Oncology Letters 17:1299-1305

[3] Chen JX, Sun JW, Wang Y, Pan T, Zhuang LP, Lin LZ, Lv BC. 2022. Lymph node ratio-based the ypTNrM staging system for gastric cancer after neoadjuvant therapy: a large population-based study. Surgery Today 52(5):783-794

[4] Daly MC, Paquette IM. 2019. Surveillance, Epidemiology, and End Results (SEER) and SEER-medicare databases: use in clinical research for improving colorectal cancer outcomes. Clinics in Colon and Rectal Surgery 32(01):61-68

[5] Degiuli M, Reddavid R, Tomatis M, Ponti A, Morino M, Sasako M, Rebecchi F, Garino M, Vigano L, Scaglione D, Locatelli L, Mello Teggia P. 2021. D2 dissection improves disease-specific survival in advanced gastric cancer patients: 15-year follow-up results of the Italian Gastric Cancer Study Group D1 versus D2 randomised controlled trial. European Journal of Cancer 150:10-22

[6] Díaz Del Arco C, Estrada Muñoz LM, Sánchez Pernaute A, Ortega Medina L, García Gómez De Las Heras S, García Martínez R, Fernández Aceñero MJ. 2024. Prognostic role of the log odds of positive lymph nodes in Western patients with resected gastric cancer: a comparison with the 8th edition of the TNM staging system. American Journal of Clinical Pathology 161(2):186-196

[7] Ergenç M, Uprak TK, Akın Mİ, Hekimoğlu EE, Çelikel ÇA, Yeğen C. 2023. Prognostic significance of metastatic lymph node ratio in gastric cancer: a Western-center analysis. BMC Surgery 23:209

[8] Gu P, Deng J, Wang W, Wang Z, Zhou Z, Xu H, Liang H. 2020. Impact of the number of examined lymph nodes on stage migration in node-negative gastric cancer patients: a Chinese multi-institutional analysis with propensity score matching. Annals of Translational Medicine 8(15):938

[9] Guo S, Shang M, Dong Z, Zhang J, Wang Y, Zhao Y. 2021. The assessment of the optimal number of examined lymph nodes and prognostic models based on lymph nodes for predicting survival outcome in patients with stage N3b gastric cancer. Asia-Pacific Journal of Clinical Oncology 17(2):e117-e124

[10] He Z, Li D, Xu Y, Wang H, Gao J, Zhang Z, Chen K. 2022. Prognostic significance of metastatic lymph node ratio in patients with gastric cancer after curative gastrectomy: a single-center retrospective study. Scandinavian Journal of Gastroenterology 57(7):832-841

[11] Huang Z, Chen Y, Zhang W, Liu H, Wang Z, Zhang Y. 2020. Modified gastric cancer AJCC staging with a classification based on the ratio of regional lymph node involvement: a population-based cohort study. Annals of Surgical Oncology 27(5):1480-1487

[12] Huang L, Zhang X, Wei Z, Xu A. 2021. Importance of examined lymph node number in accurate staging and enhanced survival in resected gastric adenocarcinoma-the more, the better? a cohort study of 8,696 cases from the US and China, 2010–2016. Frontiers in Oncology 10:394

[13] In H, Ravetch E, Langdon-Embry M, Palis B, Ajani JA, Hofstetter WL, Kelsen DP, Sano T. 2018. The newly proposed clinical and post-neoadjuvant treatment staging classifications for gastric adenocarcinoma for the American Joint Committee on Cancer (AJCC) staging. Gastric Cancer 21:1-9

[14] Jiang Q, Zeng X, Zhang C, Yang M, Fan J, Mao G, Shen Q, Yin Y, Liu W, Tao K, Zhang P. 2022. Lymph node ratio is a prospective prognostic indicator for locally advanced gastric cancer patients after neoadjuvant chemotherapy. World Journal of Surgical Oncology 20:209

[15] Kano K, Yamada T, Yamamoto K, Komori K, Watanabe H, Hara K, Shimoda Y, Maezawa Y, Fujikawa H, Aoyama T, Tamagawa H, Yamamoto N, Cho H, Shiozawa M, Yukawa N, Yoshikawa T, Morinaga S, Rino Y, Masuda M, Ogata T, Oshima T. 2020. Association between lymph node ratio and survival in patients with pathological stage II/III gastric cancer. Annals of Surgical Oncology 27:4235-4247

[16] Kinami S, Saito H, Takamura H. 2022. Significance of lymph node metastasis in the treatment of gastric cancer and current challenges in determining the extent of metastasis. Frontiers in Oncology 11:11

[17] Kotecha K, Singla A, Townend P, Merrett N. 2022. Association between neutrophil-lymphocyte ratio and lymph node metastasis in gastric cancer: a meta-analysis. Medicine 101(25):E29300

[18] Lai H, Zheng J, Li Y. 2022. Comparison of four lymph node staging systems in gastric adenocarcinoma after neoadjuvant therapy—a population-based study. Frontiers in Surgery 9:264

[19] Lu J, Wang W, Zheng CH, Fang C, Li P, Xie JW, Wang JB, Lin JX, Chen QY, Cao LL, Lin M, Huang CM, Zhou ZW. 2017. Influence of total lymph node count on staging and survival after gastrectomy for gastric cancer: an analysis from a two-institution database in China. Annals of Surgical Oncology 24(2):486-493

[20] Macalindong SS, Kim KH, Nam BH, Ryu KW, Kubo N, Kim JY, Eom BW, Yoon HM, Kook MC, Choi IJ, Kim YW. 2018. Effect of total number of harvested lymph nodes on survival outcomes after curative resection for gastric adenocarcinoma: findings from an eastern high-volume gastric cancer center. BMC Cancer 18:71

[21] Que SJ, Zhong Q, Chen QY, Truty MJ, Yan S, Bin MY, Ding FH, Zheng CH, Li P, Bin WJ, Lin JX, Lu J, Cao LL, Lin M, Tu RH, Lin JL, Zheng HL, Huang CM. 2023. A novel ypTLM staging system based on LODDS for gastric cancer after neoadjuvant therapy: multicenter and large-sample retrospective study. World Journal of Surgery 47(7):1762-1771

[22] Rosa F, Schena CA, Laterza V, Quero G, Fiorillo C, Strippoli A, Pozzo C, Papa V, Alfieri S. 2022. The role of surgery in the management of gastric cancer: state of the art. Cancers 14(22):5542

[23] Smyth EC, Nilsson M, Grabsch HI, van Grieken NC, Lordick F. 2020. Gastric cancer. The Lancet 396(10251):635-648

[24] Wang L, Ge J, Feng L, Wang Z, Wang W, Han H, Qin Y. 2023. Establishment and validation of a prognostic nomogram for postoperative patients with gastric cardia adenocarcinoma: a study based on the Surveillance, Epidemiology, and End Results database and a Chinese cohort. Cancer Medicine 12(12):13111-13122

[25] Yamashita K, Hosoda K, Ema A, Watanabe M. 2016. Lymph node ratio as a novel and simple prognostic factor in advanced gastric cancer. European Journal of Surgical Oncology 42(9):1253-1260

[26] Yin K, Jin X, Pan Y, Zi M, Zheng Y, Ma Y, Pang C, liu K, Chen J, Wei Y, Liu D, Cheng X, Yuan L. 2024. Revolutionizing T3-4N0-2M0 gastric cancer staging with an innovative pathologic N classification system. Journal of Gastrointestinal Surgery: Official Journal of the Society for Surgery of the Alimentary Tract 28(8):1283-1293

[27] Zeng Y, Cai F, Wang P, Wang X, Liu Y, Zhang L, Zhang R, Chen L, Liang H, Ye Z, Deng J. 2023a. Development and validation of prognostic model based on extragastric lymph nodes metastasis and lymph node ratio in node-positive gastric cancer: a retrospective cohort study based on a multicenter database. International Journal of Surgery (London, England) 109(4):794-804

[28] Zeng Y, Chen LC, Ye ZS, Deng JY. 2023b. Examined lymph node count for gastric cancer patients after curative surgery. World Journal of Clinical Cases 11(9):1930-1938