Whole genome sequencing analysis identifies recurrent structural alterations in esophageal squamous cell carcinoma

Munmee Dutta; Hidewaki Nakagawa; Hiroaki Kato; Kazuhiro Maejima; Shota Sasagawa; Kaoru Nakano; Aya Sasaki-Oku; Akihiro Fujimoto; Raúl Nicolás Mateos; Ashwini Patil; Hiroko Tanaka; Satoru Miyano; Takushi Yasuda; Kenta Nakai; Masashi Fujita

doi:10.7717/peerj.9294

Whole genome sequencing analysis identifies recurrent structural alterations in esophageal squamous cell carcinoma

Munmee Dutta^1,2, Hidewaki Nakagawa ³, Hiroaki Kato⁴, Kazuhiro Maejima³, Shota Sasagawa³, Kaoru Nakano³, Aya Sasaki-Oku³, Akihiro Fujimoto⁵, Raúl Nicolás Mateos^1,2, Ashwini Patil², Hiroko Tanaka², Satoru Miyano^2,6, Takushi Yasuda⁴, Kenta Nakai^1,2, Masashi Fujita ³

1Department of Computational Biology and Medical Sciences, Graduate school of Frontier Sciences, The University of Tokyo, Chiba, Japan

2Human Genome Center, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan

3Laboratory for Cancer Genomics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan

4Department of Surgery, Faculty of Medicine, Kindai University, Osaka, Japan

5Department of Drug Discovery Medicine, Kyoto University Graduate School of Medicine, Kyoto, Japan

6Health Intelligence Center, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan

DOI: 10.7717/peerj.9294

Published: 2020-06-26
Accepted: 2020-05-14
Received: 2019-10-07

Academic Editor: Minjun Chen

Subject Areas: Bioinformatics, Genomics, Gastroenterology and Hepatology, Oncology, Translational Medicine
Keywords: Esophageal squamous cell carcinoma, Whole genome sequencing, Coding mutation, Mutational signature, Structural variation, Copy number alteration, Druggable gene, LRP1B, FGFR1, FAT1

Copyright: © 2020 Dutta et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Dutta M, Nakagawa H, Kato H, Maejima K, Sasagawa S, Nakano K, Sasaki-Oku A, Fujimoto A, Mateos RN, Patil A, Tanaka H, Miyano S, Yasuda T, Nakai K, Fujita M. 2020. Whole genome sequencing analysis identifies recurrent structural alterations in esophageal squamous cell carcinoma. PeerJ 8:e9294 https://doi.org/10.7717/peerj.9294

The authors have chosen to make the review history of this article public.

Abstract

Esophageal squamous cell carcinoma (ESCC) is the predominant type of esophageal cancer in the Asian region, including Japan. A previous study reported mutational landscape of Japanese ESCCs by using exome sequencing. However, somatic structural alterations were yet to be explored. To provide a comprehensive mutational landscape, we performed whole genome sequencing (WGS) analysis of biopsy specimens from 20 ESCC patients in a Japanese population. WGS analysis identified non-silent coding mutations of TP53, ZNF750 and FAT1 in ESCC. We detected six mutational signatures in ESCC, one of which showed significant association with smoking status. Recurrent structural variations, many of which were chromosomal deletions, affected genes such as LRP1B, TTC28, CSMD1, PDE4D, SDK1 and WWOX in 25%–30% of tumors. Somatic copy number amplifications at 11q13.3 (CCND1), 3q26.33 (TP63/SOX2), and 8p11.23 (FGFR1) and deletions at 9p21.3 (CDKN2A) were identified. Overall, these multi-dimensional view of genomic alterations improve the understanding of the ESCC development at molecular level and provides future prognosis and therapeutic implications for ESCC in Japan.

Introduction

Esophageal cancer is the eighth most aggressive cancer type and sixth most common cause of cancer-related death worldwide (Zhang et al., 2015). Esophageal cancer has two major subtypes: esophageal squamous cell carcinoma (ESCC) and esophageal adenocarcinoma (EAC). The incidence rate of ESCC is high in the Asian regions including Japan, China and India (Sawada et al., 2016; Chattopadhyay et al., 2010). On the other hand, EAC predominates in the Western countries. Alcohol drinking and tobacco smoking are the two main risk factors of ESCC development (Zhang et al., 2015). Additionally, micronutrient deficiency and genetic variants that harm the activity of alcohol-metabolizing enzymes also promote ESCC (Chang et al., 2017). Despite the advancement of the diagnostic techniques and treatment of ESCC, the survival rate is still poor.

In Japan, ESCC is most conventionally treated by the standard neo-adjuvant chemotherapy followed by surgical resection. Preoperative chemotherapy with cisplatin and fluorouracil is considered as standard treatment option for patients in advanced stage of ESCC (Baba et al., 2014; Yoshida et al., 2018). Response rate to this standard therapy is moderate (35–40%) (Yoshida et al., 2018). The varying response among patients might be partly attributed to the genetic heterogeneity of tumors.

Although ESCC is common in China and Japan, ESCC of both the countries have some common as well as different characteristics. Smoking and alcohol drinking are attributed as concerning risk factors for ESCC of both the countries. In Japan, ESCC is the tenth most common cancer type while it is the fourth most frequent cancer type in China (Sawada et al., 2016). ESCC incidence and mortality rates are higher in China than in Japan (Yingsong et al., 2013). The incidence of esophageal cancer is higher in males than in females, for example, 16,241 male cases and 3,778 female cases were found in Japan in 2018 (source of data: World Health Organization [WHO] Global Cancer Observatory database) (Yingsong et al., 2013).

Recently, several studies in China and Japan characterized somatic mutations in ESCC using whole-exome sequencing (WES). These WES studies reported frequent mutations of TP53, CDKN2A, NOTCH1, RB1, ERBB2 and NFE2L2 in ESCC (Sawada et al., 2016; Song et al., 2014; Qin et al., 2016). While somatic mutations are important, structural variations (SVs) and somatic copy number alterations (CNAs) would also affect the development of ESCC. SVs have the capability to rearrange the large genomic alterations which impact in the treatment and prediction of its consequences in patients. However, systematic characterization of somatic mutations, mutational signatures, SVs and CNAs together have not been reported in Japanese ESCC at the whole-genome level.

In this study, we investigated 20 ESCC samples in a Japanese population using WGS. We comprehensively analyzed and showed somatic mutations in important genes, mutational signatures and their association with clinical features. Our SV and CNA analysis also identified potential target genes and regions in ESCC. The characterization of the mutational landscape in Japanese ESCC will guide our understanding of the disease in a better way and provide potential targets for the precision treatment and therapeutic prevention.

Materials & Methods

Clinical samples

Tumor and normal samples were obtained from 20 patients in Kindai University hospital, Osaka, Japan. All the patients agreed to participate in the study and provided written informed consent following ICGC guidelines. The study was approved by the Institutional Review Board at Kindai University Hospital and RIKEN (approval number 25-031) and Personal history of ESCC in these patients were unavailable. All except one patients (OK047) had not received any cancer treatment before the sample collection. OK047 received cisplatin-based chemotherapy before sample collection. Tumor tissues in esophagus were collected by biopsy, and histologically confirmed as ESCC. The patients were treated with neoadjuvant chemotherapy after collecting the samples. The clinico-pathological data are available in the Table 1 and Table S1.

Table 1:

Summary of the clinical information of the ESCC samples used in this study.

Sex	Male	13
	Female	7
Histology	Squamous cell carcinoma	19
	Basaloid squamous cell carcinoma	1
Tumor location	Cervical esophagus	1
	Upper thoracic esophagus	2
	Middle thoracic esophagus	13
	Lower thoracic esophagus	3
	Abdominal esophagus	1
Tumor stage	II	2
	III	14
	IV	4
Age	≥60	18
	<60	2
Smoking status	Smoker	14
	Non-smoker	6
Alcohol drinking status	Drinker	16
	Non-drinker	4
Response to chemotherapy	Responder	10
	Non-responder	10

DOI: 10.7717/peerj.9294/table-1

Whole genome sequencing

We performed WGS of the 20 pairs of matched tumor and normal samples. The tumor DNA was extracted from the ESCC samples, and normal DNA was from the lymphocytes in blood. The libraries were prepared using TruSeq Nano DNA Library Prep Kit (Illumina) following the manufacturer’s protocol. Paired-end sequencing of 101- or 126-bp reads was performed using HiSeq2000/2500. The Fig. S1 shows the schematic representation of the WGS analysis pipeline performed in this study. Sequence reads were mapped to the human reference genome GRCh37 using BWA. We removed PCR duplicates using Picard tool (http://broadinstitute.github.io/picard/).

Somatic mutation calling and mutation signature profiling

Somatic single nucleotide variations (SNVs) and short insertions/deletions (INDELs) were called as previously described (Fujimoto et al., 2016). Functional annotation of the detected SNVs and INDELs was performed with Annovar (Wang, Li & Hakonarson, 2010). We applied dNdScv method (Martincorena et al., 2017) to search for genes with significant recurrent mutations (q-value < 0.05). We further summarized and visualized the annotated variants using the MAFtools package (Mayakonda et al., 2018) of the R software (https://www.r-project.org/). For the detection of mutational signatures in 20 ESCC, we used the SignatureAnalyzer (https://software.broadinstitute.org/cancer/cga/msp). Identified mutational signatures were compared with the COSMIC mutational signatures (version 2) using the cosine similarity scoring.

Structural variation calling

Somatic SVs were called by merging calls of two software: in-house pipeline (Fujimoto et al., 2016) and Genomon2 structural variation (SV) detection tool (https://github.com/Genomon-Project/GenomonSV). In the Genomon2 SV detection, The SVs were called using minimum junction number 2, maximum control variant read pair 10 and minimum overhang size 50. The SVs were then filtered using parameters minimum allele frequency 0.07, maximum control variant read pair 1, control depth threshold 10 and minimum overhang size 100. The inversion size threshold set to 1,000 and the simple repeats were removed. Here, SVs were categorized into four classes based on the mapping information for a read pair. The four classes are intrachromosomal deletion, inversion and tandem duplication, and interchromosomal translocation, respectively.

The breakage-fusion-bridge (BFB) events were identified based on the information of fold-back inversion and loss of telomere (Hermetz et al., 2014). We implement the following criteria in order to infer BFB: (i) Inversion is single inversion (either forward or reverse) i.e., without any reciprocal partner, (ii) Inversions must have copy number change versus the adjacent position, and (iii) The two ends of the fold-back inversion must be separated by <20 kb. To detect kataegis, we used Maftools (Mayakonda et al., 2018) package, which is defined by mutation clusters of six or more consecutive mutations localized in a small region with an average inter-variant distance of less than or equal to 1 kb. Chromothripsis was inferred in ESCC samples based on the criteria provided by a previous study (Korbel & Campbell, 2013). Chromothripsis was identified in samples which show clustering of SV breakpoints, usually more than 10 breakpoints within 50 kb, with regular oscillation of copy number states.

Copy number alteration calling

Somatic CNAs were called by analyzing read depth of matched tumor and normal using the Varscan2 software (Koboldt et al., 2012). The thresholds used to call the CNAs using CopyNumber function were p-value 0.001, minimum segment size 100 and maximum segment size 1000. The output of the above step was filtered using the copyCaller function. The raw CNAs were segmented by the circular binary segmentation method implemented in the R package DNAcopy. The GISTIC2.0 algorithm was used to identify the significant recurrent copy number amplified and deleted regions (Mermel et al., 2011).

Identification of druggable genes

To identify the druggable genes across the ESCC samples, we used online gene-drug interaction database (DGIdb) (Griffith et al., 2013). We used the genetically altered genes as target data in order to determine the druggability of the genes. The target genes were detected by different method such as SNV/INDEL analysis, SV analysis and CNA analysis in this study. The database provides two options to examine gene-drug interactions either by gene or by drug names. In this case, we identified gene-drug interactions by providing the gene names. The genes that have at least one interaction with drug target was considered as druggable gene.

Results

Whole genome sequencing of ESCC samples

To identify the mutational events and driver genes that contributing to the development of ESCC in Japanese population, we performed WGS of 20 pairs of tumor and matched blood samples. The samples were collected by biopsy from the individuals before neo-adjuvant chemotherapy (except OK047). Among the 20 samples, 10 samples responded well to the therapy while rest 10 samples showed poor response. The average genome coverage was 43. 4 × for the tumor samples and 34. 3 × for blood samples, after removal of polymerase chain reaction (PCR) duplicates. The WGS data was computationally analyzed to call somatic alterations of the following types: single nucleotide variations (SNVs), small insertions and deletions (INDELs), structural variations (SVs), and copy number alterations (CNAs). In all, WGS analysis identified 104,534 somatic SNVs, 10,523 somatic INDELs, and 2,641 somatic SVs in the 20 tumors (Table S2). In addition, WGS analysis detected 10 significant copy number altered regions in ESCC.

Recurrent coding mutations in ESCC

We investigated the somatic SNVs, short INDELs in the protein-coding regions and their splice sites in the 20 ESCCs. Our WGS analysis identified recurrently mutated genes, including previously known esophageal cancer associated oncogenes and tumor-suppressor genes. We evaluated the somatic alterations by dNdSCV (Martincorena et al., 2017) method in order to find out the significantly recurring mutations across the ESCC genomes. We identified significant mutations in TP53 and ZNF750 genes in ESCC consistent with findings by previous studies (Fig. 1 and Fig. S2). TP53 mutations were most frequent and found in 55% of the samples followed by ZNF750 (15%). In addition, other genes that recurrently mutated in ESCC include FAT1 (10%), PTCH1 (10%), EP300 (5%), FAT2 (5%), FBXW7 (5%), KMT2D (5%), NFE2L2 (5%), NOTCH1 (5%), PIK3CA (5%), RB1 (5%), RIPK4 (5%) and TP63 (5%). These genes were previously reported in ESCC by different studies (Zhang et al., 2015; Sawada et al., 2016; Chang et al., 2017; Qin et al., 2016; Li et al., 2018; TCGA, 2017).

Figure 1: The landscape of somatic alternations in ESCCs from 20 Japanese patients.
(A) Potential driver mutations by SNVs/INDELs across the 20 ESCC patients with different mutation types coded by different colors. Two genes marked with an asterisk are significantly mutated genes (q < 0.05) detected by the dNdScv method. The other 12 genes are those recurrently mutated in previous ESCC studies. (A) shows the number of mutations in all the 20 ESCC cases. The number and type of mutations for each mutated gene is also shown. Mutation types are labelled on the legend. (B) The recurrent copy number amplified and deleted regions with the important cancer-related genes detected in ESCC patients. The legend shows frequency across the ESCC patients. (C) The clinical features such as gender, smoking, alcohol drinking status, age and the tumor stages of the ESCC patients.

Download full-size image

DOI: 10.7717/peerj.9294/fig-1

The mutational signatures of ESCC

In order to understand mutational mechanisms of ESCC in Japan, we analyzed the mutational signatures of the 20 ESCC tumors. A Bayesian variant of the non-negative matrix-factorization method was applied to trinucleotide substitution patterns and extracted six mutational signatures (Figs. 2A–2F). The mutational signatures in ESCC were then compared with the signatures of the Catalogue of Somatic Mutations in Cancer (COSMIC) database Alexandrov et al., 2013 (Table S3).

Figure 2: Mutational signatures of 20 whole genome of ESCC.
(A–F) Characterization of six mutational signatures identified across the ESCC genomes. Patterns of substitutions for each signatures W1–W6 in ESCC. The mutational signatures are presented according to the 96 substitution classifications defined by the substitution class and sequence context immediately 3’ and 5’ to the mutated base. We used the SignatureAnalyzer method to determine the six distinct mutational signatures (SNVs) out of the 20 ESCC samples. (A) Signature W1. (B) Signature W2. (C) Signature W3. (D) Signature W4. (E) Signature W5. (F) Signature W6. (G) Mutation burden and contribution of the six mutational signatures (W1-W6) across the ESCC genomes. (H) Boxplot showing the association of mutational signatures and smoking habit of patients with ESCC. Mutational signature W4 displays significant association (p-value = 0.01474) with smoking status of ESCC patients. P-value was calculated using Wilcoxon rank sum test. ns: P > 0.05.

Download full-size image

DOI: 10.7717/peerj.9294/fig-2

The identified Signature W1 was highly similar to the COSMIC signature 18 (cosine similarity 0.933), but the biological aetiology of this signature is not known. Signature W2 was characterized by C>A and C>T mutations and found in almost 85% of the ESCC patients (Fig. 2G). Signature W2 was similar to the COSMIC signature 1 (cosine similarity 0.850), which is an age-dependent signature. Signature W4 mainly represented by T>C mutation, was similar to COSMIC signature 5 and 16 (cosine similarity 0.868 and 0.9, respectively. Signature W5 displayed high similarity with COSMIC signature 2 and 13 (cosine similarity 0.811 and 0.835, respectively), which was characterized by C>T and C>G mutations. COSMIC signatures 2 and 13 were assigned to the hyperactivity of the APOBEC family enzyme, cytidine deaminases (Alexandrov et al., 2013). One of our samples, OK101, was basaloid squamous cell carcinoma of esophagus, a rare type of malignancy, and a hypermutator in our cohort (Fig. 2G). Somatic SNVs of this sample were dominated by the Signature W5. Signature W6 was characterized by CpTpT-to-CpGpT mutations and highly similar to COSMIC signature 17 (cosine similarity 0.945). COSMIC signature 17 is often found in esophagus and stomach cancer, but its aetiology is unknown. Signature W3 which was defined by C>A, C>T and T>C mutations appeared to have low similarity with any of the COSMIC signatures (all cosine similarity <0.7).

Since, previous studies did not find signatures associated with smoking alone in ESCC despite smoking being a major risk factor of this cancer (Sawada et al., 2016; Zhang et al., 2015). We examined the association of the mutation signatures with clinical features in ESCC. We found signature W4 was significantly elevated in the smoking patients compared to non-smoking patients (P = 0.01474, t-test) (Fig. 2H). Mutations of W4 may be caused by the carcinogenic chemical in tobacco smoke. Signature W4 shared similarity with COSMIC signature 5 and 16. Smoking associated mutational signatures were found higher in the squamous cell carcinomas of lung and head & neck (Wang et al., 2019). However, there was no difference in total mutation burden between the smoker and non-smoker group (P = 0.7, t-test) unlike liver cancer. In liver cancer patients, similar signature showed higher mutation rate in smoker group than the nonsmokers (Alexandrov et al., 2016). The other signatures showed no association with smoking, gender, response to chemotherapy and alcohol drinking status (P > 0.05).

Structural variations in ESCC

WGS analysis detected a total of 2,641 SVs in the 20 ESCC samples with an average of 132 SVs per tumor. The number of SVs varied, ranging from 0 to 515 across the 20 ESCC cases which shows the heterogeneous nature of tumor genome (Fig. S3). In particular, compared to other ESCC cases OK007 and OK008 had high number of SVs which affected most of their chromosomes, indicating genomic instability in these samples (Figs. 3A–3L and Fig. S4). The deletions were the most abundant type of SVs across the samples (Fig. 3M). We found 1,090 (41.27%) deletions followed by 793 (30.03%) inversions, 391 (14.80%) translocations and 367 (13.90%) tandem duplications, respectively.

We found numerous cancer associated genes affected by SVs in ESCC (Fig. 3N and Table S4). Consistently with a report on Chinese ESCC (Chang et al., 2017), LRP1B (30%) and TTC28 (30%) were the most commonly affected genes in Japanese ESCC. The tumor suppressor gene LRP1B was mostly affected by deletions, whereas TTC28 was by interchromosomal translocations. We were also able to identify SDK1, a novel gene, affected by SVs in 25% of the ESCC samples. SDK1 is a cell adhesion molecule that plays an active role in cancer development. Somatic mutations in this gene was reported in adrenocortical carcinoma (Juhlin et al., 2015). Recurrent SVs in CSMD1, WWOX, ERC1, PDE4D and SHANK2 were also identified in five tumor samples. The CUB and Sushi multiple domains 1 (CSMD1) is a tumor suppressor gene reported to be associated with poor prognosis in many cancer types including breast cancer, gastric cancer, head and neck squamous cell carcinoma (HNSCC), and hepatocellular carcinoma (Deng et al., 2012; Zhang et al., 2019; Jung et al., 2018). Deletion of WWOX, a tumor suppressor gene, is frequent in esophageal adenocarcinoma (32%) and stomach adenocarcinoma (30.2%) and also observed in other human cancer types such as colon adenocarcinoma, bladder urothelial carcinoma and lung adenocarcinoma (Hussain et al., 2019). Structural rearrangements of ERC1 was previously reported in Chinese ESCC (Chang et al., 2017). ERC1 was also found as a prognostic biomarker in HNSCC (Szczepanski et al., 2013). Previously, a genome wide association study in Chinese Han population identified that SNP rs10052657 in PDE4D on 5q11 was associated with ESCC risk (Wu et al., 2011). Homozygous deletion of PDE4D was also identified in breast, lung and gastric cancers which established it as a tumor-promoting gene (Lin et al., 2013a; Lin et al., 2013b).

Breakage-Fusion-Bridge (BFB) is a mechanism supported by previous studies in cancer (Cheng et al., 2016; Yang et al., 2017). The BFB event is characterized by a special type of structural rearrangements called ‘fold-back’ inversion. Fold-back inversion can be defined as somatic structural variants with single inverted breakpoints exhibiting copy-number changes (Campbell et al., 2010). We implemented these information to identify BFB in each ESCC genome. In total, we detected 101 fold-back inversions across the 20 ESCCs, of which chromosome 11 appeared to have highest fold-back inversions (30) (Fig. S5A). BFB event was present in total 14 ESCC cases (70%) in this study. Moreover, fold-back inversions were observed on chromosome 11 around amplification of CCND1 locus (69455873-69469242) in eight ESCC cases (Fig. S5B). Notably, our CNA analysis identified amplification of CCND1 in fourteen patients, and 57% of those amplifications (8/14) were caused as a result of BFB. In addition, oncogenes such as FGFR1 (1/20) (Fig. S5C) and EGFR (1/20) were also detected in the amplified regions which were affected by BFB event. In all, this analysis presented an important insight of BFB in ESCC, and targeting the amplified oncogenes in therapies will benefit the ESCC patients in future.

Kataegis loci, which are localized hyper-mutation clusters, were identified in 30% of the ESCC patients (6/20) (Fig. S6A and Table S5). In total 11 kataegis loci were detected in six cases, of which four kataegis had SVs in their close vicinity. Previous studies reported kataegis in ESCC (Chang et al., 2017; Cheng et al., 2016), and in breast cancer it was observed in more than 50% of the cases (Nik-Zainal et al., 2012; D’Antonio et al., 2016). Furthermore, chromothripsis (Korbel & Campbell, 2013), a phenomenon that affects chromosomes with more than ten structural rearrangements with regular oscillation of copy number changes, was found in one ESCC patient (OK008) (Fig. S6B) on chromosome 12 and 14. Chromothripsis lead structural rearrangements have been reported in many cancers usually in low frequency, however, more than 40% chromothripsis were found in glioblastomas and lung adenocarcinomas (Korbel & Campbell, 2013; Cortes-Ciriano et al., 2020).

Somatic copy number alterations in ESCC

We called the somatic CNAs from the whole genome of the 20 pairs of matched tumor and normal samples to investigate CNAs in ESCC. GISTIC2.0 was then used to identify the recurrently amplified and deleted regions (Mermel et al., 2011). We identified four frequent amplified regions (3q26.33, 8p11.23, 11q13.3 and 14q21.1) and six deleted regions (1q21.1, 4q35.2, 5q13.2, 9p21.3, 10p12.33 and 21p11.2) (Figs. 4A and 4B, Tables S6 and S7).

Figure 4: Somatic copy number alterations in specific regions detected by GISTIC 2.0 in ESCC.
Significantly observed regions of recurrent amplifications (A) and deletions (B) across samples are shown. Numbers in the left bar in both (A) and (B) refer to the chromosome number. GISTIC scores are presented on top and, q-values (x-axis) indicating the false discovery rate at each locus are shown on a log scale in both (A) and (B). (C) Common deletions are shown at 9p21.3 region across the ESCC samples by Integrative Genomic Viewer (IGV).

Download full-size image

DOI: 10.7717/peerj.9294/fig-4

The copy number amplification of 3q26.33 was found in 63% of the samples and included important caner driver genes PIK3CA, SOX2, FGF12 and TP63. Notably, SVs in TP63 was also found in 15% of the ESCC samples in this study. It was observed that 11q13.3 was the most frequently amplified region (74%) in our dataset, consistently with the observation in Chinese ESCC (Ying et al., 2012). 11q13.3 gain harbored many important genes such as CCND1, FGF3, FGF4 and FGF19 which established this region as a prominent target in ESCC. Amplification of 8p11.23 involved FGFR1, which plays an active role in cell growth and differentiation. Earlier studies reported FGFR1 as a potential drug target in many human cancers (Von Loga et al., 2015; Chang et al., 2014; Lin et al., 2014).

Copy number deletion of 9p21.3 was found in 79% of the samples and it was the most common deletion in the ESCC. This region contains CDKN2A, an essential regulator of cell cycle (Fig. 4C). 10p12.33 was deleted in 79% of the samples and harbored gene MRC1, a M2 macrophage antigen known to be associated with tumor development, invasion, metastasis and angiogenesis (Weber et al., 2016; Fang et al., 2017). Deletion of 4q35.2 was found in 63% of the samples. This region includes the tumor suppressor FAT1, which was recurrently deleted in colorectal cancers, glioblastoma and HNSCC (Morris et al., 2013). Notably, recurrent SNVs/INDELs in FAT1 was observed in our study as well. Overall, this study detected many copy number altered peaks and important ESCC-associated driver genes such as FGFR1, PIK3CA, CCND1, CDKN2A and MRC1, which have the potential to be used as therapeutic targets in future (Du et al., 2017; Lin et al., 2014; Padhi et al., 2017; Weber et al., 2016; Weber et al., 2014).

Assessment of druggable genes in ESCC

In order to examine the druggability of the genes identified by the CNA analysis, we analyzed the genes with the drug-gene interaction database (Griffith et al., 2013; Cotto et al., 2018). We found that 67 genes out of the 442 genes had at least one drug target that interact with it (Table S8). HTR1A, PIK3CA, FGFR1, ADRB3, HPIK3R1, MTNR1A, CDKN2A, CCND1, CDK7, and ANO1 were the top druggable genes that showed large interactions with drugs. Notably, amplification of CCND1 (74%), PIK3CA (63%), FGFR1 (37%) and deletion of CDKN2A (79%) and CDK7 (74%) genes were observed in a considerable amount of samples in our analysis. In all, CNA analysis was able to identify potentially druggable genes including CCND1, FGFR1, PIK3CA and CDKN2A which might be used for the treatment of ESCC in future.

The druggability of genes detected by our SNV analysis was also examined. We identified 212 genes out of the 1,111 genes that have at least one interaction with drug (Table S9). We identified PIK3CA, TP53, BRAF, NOTCH1, FGFR2, F11, MTOR, LRP2, and RB1 were the most prominent druggable genes with large number of drug-interactions. On the other hand, 149 genes out of 958 have the druggable property which were identified by SV analysis here (Table S10). LRP1B, PDE4D, WWOX, GPHN, KCNB2, FHIT, CDKN2A, TP53 and BRCA2 were the important druggable genes determined with large number of interactions with drugs. LRP1B (30%), PDE4D (25%), WWOX (25%), GPHN (20%) and KCNB2 (20%) were frequently affected by SVs in ESCC.

We also combined the analysis of druggable genes identified by all the mutational events SNVs, SVs and CNAs in ESCC. It was observed that some of the genes such as TP53, NOTCH1, RB1, JAK2, ACAN and SCN9A were commonly altered by both SNVs and SVs in ESCC (Fig. 5 and Table S11). Genes, for example, PIK3CA, F11, EPHB3 and TLR3 were commonly altered by SNVs and CNAs. We also detected some genes such as CDKN2A, ANO1 and NDUFB5 were altered by both SVs and CNAs. However, we found no druggable gene that was altered commonly by all the three mutation patterns in our analysis.

Figure 5: Venn diagram representing the druggable genes altered by different mutation events in ESCC.
The figure shows the common druggable genes, genes having interaction with at least one drug target. These genes were affected by SNVs, SVs and CNAs in ESCC.

Download full-size image

DOI: 10.7717/peerj.9294/fig-5

Discussion

In this study, we performed a comprehensive whole genome sequencing analysis in order to explore mutational landscape in ESCC with Japanese origin. Previously, several studies reported mutations in ESCC mostly using whole exome data, which is limited to coding mutations. Here, we provided a complete whole-genome level analysis of mutational signatures, SNVs, SVs and CNAs in ESCC. Consistent with the previous studies, this study also confirmed the frequent mutation of TP53, ZNF750, FAT1, PTCH1, EP300, FAT2, FBXW7, KMT2D, NFE2L2, NOTCH1, PIK3CA, RB1, RIPK4 and TP63 in ESCC patients (Chang et al., 2017; Sawada et al., 2016; Qin et al., 2016). Recurrent mutations in most of these genes were previously reported in HNSCC (Lin et al., 2018; Hazawa et al., 2017; TCGA, 2015) and lung squamous cell carcinoma (Choi et al., 2017; Li et al., 2015).

By analyzing mutational signatures we showed distinct signature profiles in ESCC and their correlation with environmental risk factor and clinical features. Importantly, among the identified six mutational signatures, significant association was shown between signature W4 and smoking status of ESCC patients. This signature W4 showed higher similarity with COSMIC signature 5 and 16 (cosine similarity 0.868 and 0.9, respectively). A WGS study showed association of COSMIC signature 16 with alcohol drinking and smoking status in ESCC patients of Chinese origin (Chang et al., 2017). Smoking tobacco increases the chances of cancer development. Tobacco smoke is composed of many carcinogens which are capable enough to disrupt the DNA. Tobacco smoking has been linked to many type of cancers such as lung cancer, liver cancer, esophageal cancer and gastric cancer (Alexandrov et al., 2016). COSMIC Signature 5, which is also characterized by C>T and T>C mutations, was reported in smoking associated lung cancer. Smoking associated signature was more commonly found in male patients (65%) with smoking history than female, and 62% of those patients were non-responder to chemotherapy. In contrast, the female patients (30%) who showed this signature were mostly non-smoker (86%). However, the female patients with nonsmoking history who showed signature W4, 84% of them responded well to the chemotherapy.

A recent WGS analysis on UK patients suggested C>A/T as a distinct mutational pattern with evidence of ageing in EAC (Secrier et al., 2016). We confirmed that C>T and T>C substitutions were also dominant type mutations with age and smoking imprint in ESCC as well. Further, it was previously described that these mutation patterns characterize the smoking and alcohol drinking signatures by Alexandrov et al. (2016). On the other hand, mutational signatures identified here showed high similarity not only with COSMIC signatures associated with alcohol drinking and smoking but also with age and APOBEC signature activity related signatures. APOBEC family enzymes alter cytidine to uracil through the process of deamination within DNA which leads to mutation clusters in different cancers (Harris, Petersen-Mahrt & Neuberger, 2002). Previous studies reported the association of PIK3CA mutations with APOBEC signature in Chinese and Japanese ESCC (Chang et al., 2017; Sawada et al., 2016; Zhang et al., 2015). In this study, we found that APOBEC signature is positively associated (P = 0.00003969, Wilcoxon rank sum test) with amplification of PIK3CA. APOBEC signature was observed in all the ESCC tumor samples (100%). APOBEC mediated genomic damages may play a major role in ESCC development.

In the present study, we analyzed SVs using WGS of 20 ESCCs which was the first time report in Japanese population. A recent analysis of WGS of 94 ESCCs from China showed LRP1B and TTC28 as the most commonly affected genes by SVs (Chang et al., 2017). We confirmed this finding in Japanese ESCCs as well. LRP1B was mostly affected by deletions and TTC28 by interchromosomal translocation in all the cases. Further, this analysis identified SVs in a novel gene SDK1 in 5 out of the 20 ESCC tumor samples. An epigenomic study identified SDK1 as an epigenomic driver in hepatocellular carcinoma (Gentilini et al., 2017). Besides, SVs were found in other genes such as WWOX, CSMD1, ERC1, PDE4D, SHANK2, and TP63. This study found two patterns of structural rearrangements across the 20 ESCC genome. In the first pattern samples were found with a few rearrangements and in the other, samples were with multiple complex rearrangements which represent the heterogeneous nature of ESCCs. Moreover, this study identified breakage-fusion-bridge, kataegis and chromothripsis in 75% of the ESCC patients (15/20). Chromothripsis, a phenomenon when multiple rearrangements affect a single or multiple chromosomes in a single event (Rode et al., 2016; Korbel & Campbell, 2013). It is believed that chromothripsis plays important role in the development of cancer. In recent years, chromothripsis has emerged as a significant biomarker in different cancers. Chromothripsis may promote cancer development either by increasing copy numbers of oncogenes or by deleting important tumor suppressors. Thus, identification of chromothripsis occurrence in tumors using strict criteria is crucial for cancer genomics. More studies are required with a comparatively larger sample size in future.

Moreover, we identified multiple significantly amplified and deleted regions in ESCC genomes. These regions harbored many genes that may be used as therapeutic targets. Amplification of 11q13.3 region, which contains CCND1, CTTN, SHANK2 and three FGF-family genes, was frequently altered in patients that responded to chemotherapy. CCND1, a key regulator of the G1 phase of cell cycle, is an important target of chemotherapy response in HNSCC (Feng et al., 2011). It has been associated with poor prognosis in many solid cancers including HNSCC (Feng et al., 2011). Another amplification region 3q26.33 harbored important caner-causing genes such as PIK3CA, SOX2, FGF12 and TP63. Alterations of PIK3CA which leads to dysfunction of cell cycle control, was also frequently found in colorectal cancer, HNSCC, gastric cancer and breast cancer (Mei et al., 2016; DeMello et al., 2018; Azizi Tabesh et al., 2017; Du et al., 2017). PIK3CA was reported to have association with drug sensitivity in many cancer types including ESCC (Du et al., 2017; Yokota et al., 2018). Furthermore, amplification of TP63 was also commonly found in squamous cell carcinomas of lung, and neck and head which showed over-expression in these tumors (TCGA, 2015; Ohnami et al., 2017). TP63 is a transcription factor and plays important role in tumorigenesis, apoptosis and embryogenesis (Ohnami et al., 2017; Candi et al., 2014). Deletion of CDKN2A gene is more frequent in Japanese than Chinese ESCC. We confirmed deletion of CDKN2A in 9p21.3 and MRC1 in 10p12.33 regions as the most common deletion (79%) in our dataset. Previously several studies reported recurrent mutations and loss in CDKN2A in many human cancer types such as pancreatic cancer and oral squamous cell carcinoma (Zhen et al., 2015; Padhi et al., 2017). A previous study on Chinese population also reported deletion of CDKN2A, but only 30% of the Chinese samples showed deletion of this gene. Our gene-drug interaction analysis was also able to detect druggable genes for example FGFR1, PIK3CA, CCND1, CDKN2A, CDK7 and ANO1 that have the potential to interact with at least one drug.

There were some limitations in this study. The main bottle-neck of this study was the sample size. We had 20 pairs of matched normal and tumor samples, which was not enough to find significantly mutated recurrent genes with high level of frequencies. Secondly, although the patient’s chemotherapy response information was available, the small number of sample size did not allow us to associate the identified genomic alterations with chemotherapy response at a significant level. It would be important to associate the genomic alterations with chemotherapy response to select the correct patients for effective neoadjuvent chemotherapy in future.

Conclusions

In summary, this analysis was able to identify the frequent genomic mutations, mutational signatures and significant association with environmental risk factor, druggable CNAs and genes, and structural rearrangements in ESCC. This comprehensive analysis provided insights into the ESCC development at molecular level and identified targets for the diagnosis and treatment of ESCC in future.

Supplemental Information

Workflow of the whole genome sequencing analysis

We performed WGS on 20 ESCC samples. BWA was used for the alignment with the reference genome. We used Riken-in-house pipeline to call the SNVs/INDELs. For mutational signature profiling, we used SignatureAnalyzer. To call the somatic CNAs from the WGS of ESCC samples, we used Varscan2. DNAcopy was used to do the segmentation. For the Structural Variation analysis, we used Riken-in-house pipeline and Genomon2. Finally, merged both the SVs list in order to identify distinct SVs in ESCC.

DOI: 10.7717/peerj.9294/supp-1

Download

Amino acid changes in TP53 and ZNF750

Distribution of TP53 (A) and ZNF750(B) mutations in ESCC. P53_TAD, transactivation domain; P53, DNA-binding domain; P53_tetramer, tetramerization domain.

DOI: 10.7717/peerj.9294/supp-2

Download

Frequency of SVs across the ESCC samples

The figure visually shows the SVs affect most chromosomes in OK007 and OK008.

DOI: 10.7717/peerj.9294/supp-3

Download

Somatic structural variations in ESCC. Related to Figure 3A

The inner ring represents the SVs: red for intrachromosomal interactions, and green for interchromosomal interactions. The second ring next to SVs displays the CNAs: red for amplifications and blue for deletions. The outer ring shows the chromosome ideogram.

DOI: 10.7717/peerj.9294/supp-4

Download

Evidence of Breakage-Fusion-Bridge (BFB) in ESCC cases

(A) The number of fold-back inversions identified in 20 ESCC patients. (B) The figure represents the amplification of CCND1 due to the BFB effect in one ESCC sample, and (C) shows amplification of FGFR1 as a result of BFB in another sample.

DOI: 10.7717/peerj.9294/supp-5

Download

Kataegis and Chromothripsis in ESCC

(A) In the rainfall plot, each dot represents a single somatic mutations. The X-axis shows the chromosomes in order and Y-axis shows the inter-genomic distance of the mutations. The blue cursor indicates the position of hyper-mutation on chromosome 1. (B) Chromothripsis on chromosome 14 in one ESCC patient. The cluster of SV breakpoints in the middle panel (red bars) present in conjunction with regular copy number gains in the bottom panel.

DOI: 10.7717/peerj.9294/supp-6

Download

Tables S1–11

DOI: 10.7717/peerj.9294/supp-7

Download

[1] Alexandrov LB, Ju YS, Haase K, Van Loo P, Martincorena I, Nik-Zainal S, Totoki Y, Fujimoto A, Nakagawa H, Shibata T, Campbell PJ, Vineis P, Phillips DH, Stratton MR. 2016. Mutational signatures associated with tobacco smoking in human cancer. Science 354:618-622

[2] Alexandrov LB, Nik-Zainal S, Wedge DC, Aparicio SAJR, Behjati S, Biankin AV, Bignell GR, Bolli N, Borg A, Borresen-Dale A-L, Boyault S, Burkhardt B, Butler AP, Caldas C, Davies HR, Desmedt C, Eils R, Eyfjord JE, Foekens JA, Greaves M, Hosoda F, Hutter B, Ilicic T, Imbeaud S, Imielinski M, Jager N, Jones DTW, Jones D, Knappskog S, Kool M, Lakhani SR, Lopez-Otin C, Martin S, Munshi NC, Nakamura H, Northcott PA, Pajic M, Papaemmanuil E, Paradiso A, Pearson JV, Puente XS, Raine K, Ramakrishna M, Richardson AL, Richter J, Rosenstiel P, Schlesner M, Schumacher TN, Span PN, Teague JW, Totoki Y, Tutt ANJ, Valdes-Mas R, vanBuuren MM, Van ’t Veer L, Vincent-Salomon A, Waddell N, Yates LR, Zucman-Rossi J, Futreal PA, McDermott U, Lichter P, Meyerson M, Grimmond SM, Siebert R, Campo E, Shibata T, Pfister SM, Campbell PJ, Stratton MR. 2013. Signatures of mutational processes in human cancer. Nature 500:415-421

[3] Azizi Tabesh G, Izadi P, Fereidooni F, Emami Razavi AN, Tavakkoly Bazzaz J. 2017. The high frequency of PIK3CA mutations in Iranian breast cancer patients. Cancer Investigation 35:36-42

[4] Baba Y, Watanabe M, Yoshida N, Baba H. 2014. Neoadjuvant treatment for esophageal squamous cell carcinoma. World Journal of Gastrointestinal Oncology 6:121-128

[5] Campbell PJ, Yachida S, Mudie LJ, Stephens PJ, Pleasance ED, Stebbings LA, Morsberger LA, Latimer C, McLaren S, Lin M-L, McBride DJ, Varela I, Nik-Zainal SA, Leroy C, Jia M, Menzies A, Butler AP, Teague JW, Griffin CA, Burton J, Swerdlow H, Quail MA, Stratton MR, Iacobuzio-Donahue C, Futreal PA. 2010. The patterns and dynamics of genomic instability in metastatic pancreatic cancer. Nature 467:1109-1113

[6] Candi E, Agostini M, Melino G, Bernassola F. 2014. How the TP53 family proteins TP63 and TP73 contribute to tumorigenesis: regulators and effectors. Human Mutation 35:702-714

[7] Chang J, Liu X, Wang S, Zhang Z, Wu Z, Zhang X, Li J. 2014. Prognostic value of FGFR gene amplification in patients with different types of cancer: a systematic review and meta-analysis. PLOS ONE 9:e105524

[8] Chang J, Tan W, Ling Z, Xi R, Shao M, Chen M, Luo Y, Zhao Y, Liu Y, Huang X, Xia Y, Hu J, Parker JS, Marron D, Cui Q, Peng L, Chu J, Li H, Du Z, Han Y, Tan W, Liu Z, Zhan Q, Li Y, Mao W, Wu C, Lin D. 2017. Genomic analysis of oesophageal squamous-cell carcinoma identifies alcohol drinking-related mutation signature and genomic alterations. Nature Communications 8:15290

[9] Chattopadhyay I, Singh A, Phukan R, Purkayastha J, Kataki A, Mahanta J, Saxena S, Kapur S. 2010. Genome-wide analysis of chromosomal alterations in patients with esophageal squamous cell carcinoma exposed to tobacco and betel quid from high-risk area in India. Mutation Research 696:130-138

[10] Cheng C, Zhou Y, Li H, Xiong T, Li S, Bi Y, Kong P, Wang F, Cui H, Li Y, Fang X, Yan T, Li Y, Wang J, Yang B, Zhang L, Jia Z, Song B, Hu X, Yang J, Qiu H, Zhang G, Liu J, Xu E, Shi R, Zhang Y, Liu H, He C, Zhao Z, Qian Y, Rong R, Han Z, Zhang Y, Luo W, Wang J, Peng S, Yang X, Li X, Li L, Fang H, Liu X, Ma L, Chen Y, Guo S, Chen X, Xi Y, Li G, Liang J, Yang X, Guo J, Jia J, Li Q, Cheng X, Zhan Q, Cui Y. 2016. Whole-genome sequencing reveals diverse models of structural variations in esophageal squamous cell carcinoma. American Journal of Human Genetics 98:256-274

[11] Choi M, Kadara H, Zhang J, Parra ER, Rodriguez-Canales J, Gaffney SG, Zhao Z, Behrens C, Fujimoto J, Chow C, Kim K, Kalhor N, Moran C, Rimm D, Swisher S, Gibbons DL, Heymach J, Kaftan E, Townsend JP, Lynch TJ, Schlessinger J, Lee J, Lifton RP, Herbst RS, Wistuba II. 2017. Mutation profiles in early-stage lung squamous cell carcinoma with clinical follow-up and correlation with markers of immune function. Annals of Oncology 28:83-89

[12] Cortes-Ciriano I, Lee JJ-K, Xi R, Jain D, Jung YL, Yang L, Gordenin D, Klimczak LJ, Zhang C-Z, Pellman DS, Park PJ. 2020. Comprehensive analysis of chromothripsis in 2, 658 human cancers using whole-genome sequencing. Nature Genetics 52(3):331-341

[13] Cotto KC, Wagner AH, Feng Y-Y, Kiwala S, Coffman AC, Spies G, Wollam A, Spies NC, Griffith OL, Griffith M. 2018. DGIdb 3.0: a redesign and expansion of the drug-gene interaction database. Nucleic Acids Research 46:D1068-D1073

[14] D’Antonio M, Tamayo P, Mesirov JP, Frazer KA. 2016. Kataegis expression signature in breast cancer is associated with late onset, better prognosis, and higher HER2 levels. Cell Reports 16:672-683

[15] DeMello RA, Castelo-Branco L, Castelo-Branco P, Pozza DH, Vermeulen L, Palacio S, Salzberg M, Lockhart AC. 2018. What will we expect from novel therapies to esophageal and gastric malignancies? American Society of Clinical Oncology educational book. American Society of Clinical Oncology. Annual Meeting 38:249-261

[16] Deng N, Goh LK, Wang H, Das K, Tao J, Tan IB, Zhang S, Lee M, Wu J, Lim KH, Lei Z, Goh G, Lim Q, Tan AL, Yu D, Poh S, Riahi S, Bell S, Shi MM, Linnartz R, Cheong HC, Rha SY, Boussioutas A, Grabsch H. 2012. A comprehensive survey of genomic alterations in gastric cancer reveals systematic patterns of molecular exclusivity and co-occurrence among distinct therapeutic targets. Gut. 61:673-684

[17] Du P, Huang P, Huang X, Li X, Feng Z, Li F, Liang S, Song Y, Stenvang J, Brunner N, Yang H, Ou Y, Gao Q, Li L. 2017. Comprehensive genomic analysis of Oesophageal Squamous Cell Carcinoma reveals clinical relevance. Scientific reports 7:15324

[18] Fang J, Li X, Ma D, Liu X, Chen Y, Wang Y, Lui VWY, Xia J, Cheng B, Wang Z. 2017. Prognostic significance of tumor infiltrating immune cells in oral squamous cell carcinoma. BMC cancer 17:375

[19] Feng Z, Guo W, Zhang C, Xu Q, Zhang P, Sun J, Zhu H, Wang Z, Li J, Wang L, Wang B, Ren G, Ji T, Tu W, Yang X, Qiu W, Mao L, Zhang Z, Chen W. 2011. CCND1 as a predictive biomarker of neoadjuvant chemotherapy in patients with locally advanced head and neck squamous cell carcinoma. PLOS ONE 6(10):e26399

[20] Fujimoto A, Furuta M, Totoki Y, Tsunoda T, Kato M, Shiraishi Y, Tanaka H, Taniguchi H, Kawakami Y, Ueno M, Gotoh K, Ariizumi S-I, Wardell CP, Hayami S, Nakamura T, Aikata H, Arihiro K, Boroevich KA, Abe T, Nakano K, Maejima K, Sasaki-Oku A, Ohsawa A, Shibuya T, Nakamura H, Hama N, Hosoda F, Arai Y, Ohashi S, Urushidate T, Nagae G, Yamamoto S, Ueda H, Tatsuno K, Ojima H, Hiraoka N, Okusaka T, Kubo M, Marubashi S, Yamada T, Hirano S, Yamamoto M, Ohdan H, Shimada K, Ishikawa O, Yamaue H, Chayama K, Miyano S, Aburatani H, Shibata T, Nakagawa H. 2016. Whole-genome mutational landscape and characterization of noncoding and structural mutations in liver cancer. Nature Genetics 48:500-509

[21] Gentilini D, Scala S, Gaudenzi G, Garagnani P, Capri M, Cescon M, Grazi GL, Bacalini MG, Pisoni S, Dicitore A, Circelli L, Santagata S, Izzo F, Di Blasio AM, Persani L, Franceschi C, Vitale G. 2017. Epigenome-wide association study in hepatocellular carcinoma: identification of stochastic epigenetic mutations through an innovative statistical approach. Oncotarget 8:41890-41902

[22] Griffith M, Griffith OL, Coffman AC, Weible JV, McMichael JF, Spies NC, Koval J, Das I, Callaway MB, Eldred JM, Miller CA, Subramanian J, Govindan R, Kumar RD, Bose R, Ding L, Walker JR, Larson DE, Dooling DJ, Smith SM, Ley TJ, Mardis ER, Wilson RK. 2013. DGIdb: mining the druggable genome. Nature Methods 10:1209-1210

[23] Harris RS, Petersen-Mahrt SK, Neuberger MS. 2002. RNA editing enzyme APOBEC1 and some of its homologs can act as DNA mutators. Molecular Cell 10:1247-1253

[24] Hazawa M, Lin D-C, Handral H, Xu L, Chen Y, Jiang Y-Y, Mayakonda A, Ding L-W, Meng X, Sharma A, Samuel S, Movahednia MM, Wong RW, Yang H, Tong C, Koeffler HP. 2017. ZNF750 is a lineage-specific tumour suppressor in squamous cell carcinoma. Oncogene 36:2243-2254

[25] Hermetz KE, Newman S, Conneely KN, Martin CL, Ballif BC, Shaffer LG, Cody JD, Rudd MK. 2014. Large inverted duplications in the human genome form via a fold-back mechanism. PLOS Genetics 10:e1004139

[26] Hussain T, Liu B, Shrock MS, Williams T, Aldaz CM. 2019. WWOX the FRA16D gene: a target of and a contributor to genomic instability. Genes, Chromosomes & Cancer 58:324-338

[27] Juhlin CC, Goh G, Healy JM, Fonseca AL, Scholl UI, Stenman A, Kunstman JW, Brown TC, Overton JD, Mane SM, Nelson-Williams C, Backdahl M, Suttorp A-C, Haase M, Choi M, Schlessinger J, Rimm DL, Hoog A, Prasad ML, Korah R, Larsson C, Lifton RP, Carling T. 2015. Whole-exome sequencing characterizes the landscape of somatic mutations and copy number alterations in adrenocortical carcinoma. The Journal of Clinical Endocrinology and Metabolism 100:E493-E502

[28] Jung AR, Eun Y-G, Lee YC, Noh JK, Kwon KH. 2018. Clinical significance of CUB and sushi multiple domains 1 inactivation in head and neck squamous cell carcinoma. International Journal of Molecular Sciences 19(12):3996

[29] Koboldt DC, Zhang Q, Larson DE, Shen D, Mclellan MD, Lin L, Miller CA, Mardis ER, Ding L, Wilson RK. 2012. VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Research 22:568-576

[30] Korbel JO, Campbell PJ. 2013. Criteria for inference of chromothripsis in cancer genomes. Cell 152:1226-1236

[31] Li C, Gao Z, Li F, Li X, Sun Y, Wang M, Li D, Wang R, Li F, Fang R, Pan Y, Luo X, He J, Zheng L, Xia J, Qiu L, He J, Ye T, Zhang R, He M, Zhu M, Hu H, Shi T, Zhou X, Sun M, Tian S, Zhou Y, Wang Q, Chen L, Yin G, Lu J, Wu R, Guo G, Li Y, Hu X, Li L, Asan, Wang Q, Yin Y, Feng Q, Wang B, Wang H, Wang M, Yang X, Zhang X, Yang H, Jin L, Wang C-Y, Ji H, Chen H, Wang J, Wei Q. 2015. Whole exome sequencing identifies frequent somatic mutations in cell-cell adhesion genes in chinese patients with lung squamous cell carcinoma. Scientific Reports 5:14237

[32] Li XC, Wang MY, Yang M, Dai HJ, Zhang BF, Wang W, Chu XL, Wang X, Zheng H, Niu RF, Zhang W, Chen KX. 2018. A mutational signature associated with alcohol consumption and prognostically significantly mutated driver genes in esophageal squamous cell carcinoma Original article. Annals of Oncology 2:938-944

[33] Lin D, Hao J, Nagata Y, Xu L, Shang L, Meng X, Sato Y, Okuno Y, Varela AM, Ding L, Garg M, Liu L, Yang H, Yin D, Shi Z, Jiang Y, Gu W, Gong T, Zhang Y, Xu X, Kalid O, Shacham S, Ogawa S, Wang M, Koeffler HP. 2014. Genomic and molecular characterization of esophageal squamous cell carcinoma. Nature Publishing Group 46:467-473

[34] Lin S-C, Lin L-H, Yu S-Y, Kao S-Y, Chang K-W, Cheng H-W, Liu C-J. 2018. FAT1 somatic mutations in head and neck carcinoma are associated with tumor progression and survival. Carcinogenesis 39:1320-1330

[35] Lin Y, Totsuka Y, He Y, Kikuchi S, Qiao Y, Ueda J, Wei W, Inoue M, Tanaka H. 2013b. Epidemiology of esophageal cancer in Japan and China. Journal of Epidemiology 23:233-242

[36] Lin D-C, Xu L, Ding L-W, Sharma A, Liu L-Z, Yang H, Tan P, Vadgama J, Karlan BY, Lester J, Urban N, Schummer M, Doan N, Said JW, Sun H, Walsh M, Thomas CJ, Patel P, Yin D, Chan D, Koeffler HP. 2013a. Genomic and functional characterizations of phosphodiesterase subtype 4D in human cancers. Proceedings of the National Academy of Sciences of the United States of America 110:6109-6114

[37] Martincorena I, Raine KM, Gerstung M, Dawson KJ, Haase K, Van Loo P, Davies H, Stratton MR, Campbell PJ. 2017. Universal patterns of selection in cancer and somatic tissues. Cell 171:1029-1041

[38] Mayakonda A, Lin D-C, Assenov Y, Plass C, Koeffler HP. 2018. Maftools: efficient and comprehensive analysis of somatic variants in cancer. Genome research 28:1747-1756

[39] Mei ZB, Duan CY, Li CB, Cui L, Ogino S. 2016. Prognostic role of tumor PIK3CA mutation in colorectal cancer: a systematic review and meta-analysis. Annals of Oncology 27:1836-1848

[40] Mermel CH, Schumacher SE, Hill B, Meyerson ML, Beroukhim R, Getz G. 2011. GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biology 12:R41

[41] Morris LGT, Kaufman AM, Gong Y, Ramaswami D, Walsh LA, Turcan S, Eng S, Kannan K, Zou Y, Peng L, Banuchi VE, Paty P, Zeng Z, Vakiani E, Solit D, Singh B, Ganly I, Liau L, Cloughesy TC, Mischel PS, Mellinghoff IK, Chan TA. 2013. Recurrent somatic mutation of FAT1 in multiple human cancers leads to aberrant Wnt activation. Nature Genetics 45:253-261

[42] Nik-Zainal S, Alexandrov LB, Wedge DC, Van Loo P, Greenman CD, Raine K, Jones D, Hinton J, Marshall J, Stebbings LA, Menzies A, Martin S, Leung K, Chen L, Leroy C, Ramakrishna M, Rance R, Lau KW, Mudie LJ, Varela I, McBride DJ, Bignell GR, Cooke SL, Shlien A, Gamble J, Whitmore I, Maddison M, Tarpey PS, Davies HR, Papaemmanuil E, Stephens PJ, McLaren S, Butler AP, Teague JW, Jonsson G, Garber JE, Silver D, Miron P, Fatima A, Boyault S, Langerod A, Tutt A, Martens JWM, Aparicio SAJR, Borg A, Salomon AV, Thomas G, Borresen-Dale A-L, Richardson AL, Neuberger MS, Futreal PA, Campbell PJ, Stratton MR. 2012. Mutational processes molding the genomes of 21 breast cancers. Cell 149:979-993

[43] Ohnami S, Ohshima K, Nagashima T, Urakami K, Shimoda Y, Saito J, Naruoka A, Hatakeyama K, Mochizuki T, Serizawa M, Ohnami S, Kusuhara M, Yamaguchi K. 2017. Comprehensive characterization of genes associated with the TP53 signal transduction pathway in various tumors. Molecular and Cellular Biochemistry 431:75-85

[44] Padhi SS, Roy S, Kar M, Saha A, Roy S, Adhya A, Baisakh M, Banerjee B. 2017. Role of CDKN2A/p16 expression in the prognostication of oral squamous cell carcinoma. Oral Oncology 73:27-35

[45] Qin H-D, Liao X-Y, Chen Y-B, Huang S-Y, Xue W-Q, Li F-F, Ge X-S, Liu D-Q, Cai Q, Long J, Li X-Z, Hu Y-Z, Zhang S-D, Zhang L-J, Lehrman B, Scott AF, Lin D, Zeng Y-X, Shugart YY, Jia W-H. 2016. Genomic characterization of esophageal squamous cell carcinoma reveals critical genes underlying tumorigenesis and poor prognosis. American Journal of Human Genetics 98:709-727

[46] Rode A, Maass KK, Willmund KV, Lichter P, Ernst A. 2016. Chromothripsis in cancer cells: an update. International Journal of Cancer 138:2322-2333

[47] Sawada G, Niida A, Uchi R, Hirata H, Shimamura T, Suzuki Y, Shiraishi Y, Chiba K, Imoto S, Takahashi Y, Iwaya T, Sudo T, Hayashi T, Takai H, Kawasaki Y, Matsukawa T, Eguchi H, Sugimachi K, Tanaka F, Suzuki H, Yamamoto K, Ishii H, Shimizu M, Yamazaki H, Yamazaki M, Tachimori Y, Kajiyama Y, Natsugoe S, Fujita H, Mafune K, Tanaka Y, Kelsell DP, Scott CA, Tsuji S, Yachida S, Shibata T, Sugano S, Doki Y, Akiyama T, Aburatani H, Ogawa S, Miyano S, Mori M, Mimori K. 2016. Genomic landscape of esophageal squamous cell carcinoma in a Japanese population. Gastroenterology 150:1171-1182

[48] Secrier M, Li X, de Silva N, Eldridge MD, Contino G, Bornschein J, MacRae S, Grehan N, O’Donovan M, Miremadi A, Yang T-P, Bower L, Chettouh H, Crawte J, Galeano-Dalmau N, Grabowska A, Saunders J, Underwood T, Waddell N, Barbour AP, Nutzinger B, Achilleos A, Edwards PAW, Lynch AG, Tavare S, Fitzgerald RC. 2016. Mutational signatures in esophageal adenocarcinoma define etiologically distinct subgroups with therapeutic relevance. Nature Genetics 48:1131-1141

[49] Song Y, Li L, Ou Y, Gao Z, Li E, Li X, Zhang W, Wang J, Xu L, Zhou Y, Ma X, Liu L, Zhao Z, Huang X, Fan J, Dong L, Chen G, Ma L, Yang J, Chen L, He M, Li M, Zhuang X, Huang K, Qiu K, Yin G, Guo G, Feng Q, Chen P, Wu Z, Wu J, Ma L, Zhao J, Luo L, Fu M, Xu B, Chen B, Li Y, Tong T, Wang M, Liu Z, Lin D, Zhang X, Yang H, Wang J, Zhan Q. 2014. Identification of genomic alterations in oesophageal squamous cell cancer. Nature 509:91-95

[50] Szczepanski MJ, DeLeo AB, Luczak M, Molinska-Glura M, Misiak J, Szarzynska B, Dworacki G, Zagor M, Rozwadowska N, Kurpisz M, Krzeski A, Kruk-Zagajewska A, Kopec T, Banaszewski J, Whiteside TL. 2013. PRAME expression in head and neck cancer correlates with markers of poor prognosis and might help in selecting candidates for retinoid chemoprevention in pre-malignant lesions. Oral Oncology 49:144-151

[51] TCGA. 2015. Comprehensive genomic characterization of head and neck squamous cell carcinomas. Nature 517:576-582

[52] TCGA. 2017. Integrated genomic characterization of oesophageal carcinoma. Nature 169-175

[53] Von Loga K, Kohlhaussen J, Burkhardt L, Simon R, Steurer S, Burdak-Rothkamm S, Jacobsen F, Sauter G, Krech T. 2015. FGFR1 amplification is often homogeneous and strongly linked to the squamous cell carcinoma subtype in esophageal carcinoma. PLOS ONE 10:e0141867

[54] Wang K, Li M, Hakonarson H. 2010. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Research 38:e164

[55] Wang J, Linxweiler M, Yang W, Chan TA, Morris LGT. 2019. Immunomodulatory and immunotherapeutic implications of tobacco smoking in squamous cell carcinomas and normal airway epithelium. Oncotarget 10:3835-3839

[56] Weber M, Buttner-Herold M, Hyckel P, Moebius P, Distel L, Ries J, Amann K, Neukam FW, Wehrhan F. 2014. Small oral squamous cell carcinomas with nodal lymphogenic metastasis show increased infiltration of M2 polarized macrophages–an immunohistochemical analysis. Journal of Cranio-Maxillo-Facial Surgery 42:1087-1094

[57] Weber M, Iliopoulos C, Moebius P, Buttner-Herold M, Amann K, Ries J, Preidl R, Neukam FW, Wehrhan F. 2016. Prognostic significance of macrophage polarization in early stage oral squamous cell carcinomas. Oral Oncology 52:75-84

[58] Wu C, Hu Z, He Z, Jia W, Wang F, Zhou Y, Liu Z, Zhan Q, Liu Y, Yu D, Zhai K, Chang J, Qiao Y, Jin G, Liu Z, Shen Y, Guo C, Fu J, Miao X, Tan W, Shen H, Ke Y, Zeng Y, Wu T, Lin D. 2011. Genome-wide association study identifies three new susceptibility loci for esophageal squamous-cell carcinoma in Chinese populations. Nature Genetics 43:679-684

[59] Yang B, Luo L, Luo W, Zhou Y, Yang C, Xiong T, Li X, Meng X, Li L, Zhang X, Wang Z, Wang Z. 2017. The genomic dynamics during progression of lung adenocarcinomas. Journal of Human Genetics 62:783-788

[60] Ying J, Shan L, Li J, Zhong L, Xue L, Zhao H, Li L, Langford C, Guo L, Qiu T, Lu N, Tao Q. 2012. Genome-wide screening for genetic alterations in esophageal cancer by aCGH identifies 11q13 amplification oncogenes associated with nodal metastasis. PLOS ONE 7:e39797

[61] Yingsong L, Yukari T, Yutong H, Shogo K, Youlin Q, Junko U, Wenqiang W, Manami I, Hideo T. 2013. Epidemiology of Esophageal Cancer in Japan and China. Journal of Epidemiology 23(4):233-242

[62] Yokota T, Serizawa M, Hosokawa A, Kusafuka K, Mori K, Sugiyama T, Tsubosa Y, Koh Y. 2018. PIK3CA mutation is a favorable prognostic factor in esophageal cancer: molecular profile by next-generation sequencing using surgically resected formalin-fixed, paraffin-embedded tissue. BMC Cancer 18:826

[63] Yoshida K, Suetsugu T, Imai T, Matsuhashi N, Yamaguchi K. 2018. Recent advancements in esophageal cancer treatment in Japan. 253-265

[64] Zhang X, Kang C, Li N, Liu X, Zhang J, Gao F, Dai L. 2019. Identification of special key genes for alcohol-related hepatocellular carcinoma through bioinformatic analysis. PeerJ 7:e6375

[65] Zhang L, Zhou Y, Cheng C, Cui H, Cheng L, Kong P, Wang J, Li Y, Chen W, Song B, Wang F, Jia Z, Li L, Li Y, Yang B, Liu J, Shi R, Bi Y, Zhang Y, Wang J, Zhao Z, Hu X, Yang J, Li H, Gao Z, Chen G, Huang X, Yang X, Wan S, Chen C, Li B, Tan Y, Chen L, He M, Xie S, Li X, Zhuang X, Wang M, Xia Z, Luo L, Guo J, Chen X, Zhang Y, Li Q, Liu L, Li Y, Zhang X. 2015. Genomic analyses reveal mutational signatures and frequently altered genes in esophageal squamous cell carcinoma. The American Journal of Human Genetics 96:597-611

[66] Zhen DB, Rabe KG, Gallinger S, Syngal S, Schwartz AG, Goggins MG, Hruban RH, Cote ML, McWilliams RR, Roberts NJ, Cannon-Albright LA, Li D, Moyes K, Wenstrup RJ, Hartman A-R, Seminara D, Klein AP, Petersen GM. 2015. BRCA1, BRCA2, PALB2, and CDKN2A mutations in familial pancreatic cancer: a PACGENE study. Genetics in Medicine 17:569-577

Whole genome sequencing analysis identifies recurrent structural alterations in esophageal squamous cell carcinoma

Introduction

Materials & Methods

Clinical samples

Whole genome sequencing

Somatic mutation calling and mutation signature profiling

Structural variation calling

Copy number alteration calling

Identification of druggable genes

Results

Whole genome sequencing of ESCC samples

Recurrent coding mutations in ESCC

Figure 1: The landscape of somatic alternations in ESCCs from 20 Japanese patients.

The mutational signatures of ESCC

Figure 2: Mutational signatures of 20 whole genome of ESCC.

Structural variations in ESCC

Figure 3: Somatic structural variations in ESCC.

Somatic copy number alterations in ESCC

Figure 4: Somatic copy number alterations in specific regions detected by GISTIC 2.0 in ESCC.

Assessment of druggable genes in ESCC

Figure 5: Venn diagram representing the druggable genes altered by different mutation events in ESCC.

Discussion

Conclusions

Supplemental Information

Workflow of the whole genome sequencing analysis

Amino acid changes in TP53 and ZNF750

Frequency of SVs across the ESCC samples

Somatic structural variations in ESCC. Related to Figure 3A

Evidence of Breakage-Fusion-Bridge (BFB) in ESCC cases

Kataegis and Chromothripsis in ESCC

Tables S1–11

Download article

Report a problem

Follow this publication for updates

Change notification settings or unfollow

Top referrals unique visitors

Share this publication

Metrics

Links

Articles citing this paper