A unified simulation model for understanding the diversity of cancer evolution

Atsushi Niida; Takanori Hasegawa; Hideki Innan; Tatsuhiro Shibata; Koshi Mimori; Satoru Miyano

doi:10.7717/peerj.8842

A unified simulation model for understanding the diversity of cancer evolution

Atsushi Niida ¹, Takanori Hasegawa², Hideki Innan³, Tatsuhiro Shibata^1,6, Koshi Mimori⁴, Satoru Miyano⁵

1Laboratory of Molecular Medicine, Human Genome Center, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan

2Division of Health Medical Data Science, Health Intelligence Center, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan

3SOKENDAI, The Graduate University for Advanced Studies, Hayama, Japan

4Department of Surgery, Kyushu University Beppu Hospita, Beppu, Japan

5Laboratory of DNA Information Analysis, Human Genome Center, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan

6Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan

DOI: 10.7717/peerj.8842

Published: 2020-04-08
Accepted: 2020-03-02
Received: 2019-11-08

Academic Editor: Leonardo Gollo

Subject Areas: Computational Biology, Oncology, Computational Science
Keywords: Cancer, Evolution, Simulation

Copyright: © 2020 Niida et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Niida A, Hasegawa T, Innan H, Shibata T, Mimori K, Miyano S. 2020. A unified simulation model for understanding the diversity of cancer evolution. PeerJ 8:e8842 https://doi.org/10.7717/peerj.8842

The authors have chosen to make the review history of this article public.

Abstract

Because cancer evolution underlies the therapeutic difficulties of cancer, it is clinically important to understand the evolutionary dynamics of cancer. Thus far, a number of evolutionary processes have been proposed to be working in cancer evolution. However, there exists no simulation model that can describe the different evolutionary processes in a unified manner. In this study, we constructed a unified simulation model for describing the different evolutionary processes and performed sensitivity analysis on the model to determine the conditions in which cancer growth is driven by each of the different evolutionary processes. Our sensitivity analysis has successfully provided a series of novel insights into the evolutionary dynamics of cancer. For example, we found that, while a high neutral mutation rate shapes neutral intratumor heterogeneity (ITH) characterized by a fractal-like pattern, a stem cell hierarchy can also contribute to shaping neutral ITH by apparently increasing the mutation rate. Although It has been reported that the evolutionary principle shaping ITH shifts from selection to accumulation of neutral mutations during colorectal tumorigenesis, our simulation revealed the possibility that this evolutionary shift is triggered by drastic evolutionary events that occur in a short time and confer a marked fitness increase on one or a few cells. This result helps us understand that each process works not separately but simultaneously and continuously as a series of phases of cancer evolution. Collectively, this study serves as a basis to understand in greater depth the diversity of cancer evolution.

Introduction

Cancer is regarded as a disease of evolution; during tumorigenesis, a normal cell evolves to a malignant population by means of mutation accumulation and adaptive Darwinian selection. Evolution allows cancer cells to adapt to a new environment and acquire malignant phenotypes such as metastasis and therapeutic resistance. Therefore, it is clinically important to understand cancer evolutionary dynamics. The view of cancer as an evolutionary system was established by Nowell (1976). By combining this view with a series of discoveries of onco- and tumor suppressor genes (hereinafter, collectively referred to as “driver genes”), Fearon & Vogelstein (1990) proposed a multistep model for colorectal carcinogenesis. Since then, cancer evolution has generally been described as “linear evolution,” where driver mutations are acquired linearly in a step-wise manner, generating a malignant clonal population.

However, this simple view of cancer evolution has been challenged since the advent of the next generation sequencing technology (Yates & Campbell, 2012; McGranahan & Swanton, 2017; Niida et al., 2018). Deep sequencing demonstrated that subclonality prevails in both blood and solid tumors, and multiregion sequencing of various types of solid tumor more dramatically unveiled intratumor heterogeneity (ITH), which results from the branching process in a cancer cell population along with mutation accumulation. These genomic studies also found that subclones often harbor mutations in known driver genes, suggesting that at least a part of ITH is subject to Darwinian selection. In some types of cancer, such as renal cell carcinoma (Turajlic et al., 2018) and low-grade glioma (Suzuki et al., 2015), this Darwinian selection-driven branching process is especially prominent; we observed convergent evolution in which different subclonal mutations are acquired in the same driver gene or pathway.

Other types of tumors, however, show no clear enrichment of driver mutations in subclonal mutations. Consistently with this observation, several studies employing mathematical modeling have suggested that the accumulation of neutral mutations that do not affect the growth or survival of cancer cells mainly shapes ITH; that is, “neutral evolution” is the major contributor of ITH in multiple types of cancers (Uchi et al., 2016; Sottoriva et al., 2015; Ling et al., 2015; Niida, Iwasaki & Innan, 2019). The evolutionary principles shaping ITH differ not only among cancer types but also between stages of tumorigenesis. We and others have recently reported that ITH in the early stage of colorectal tumorigenesis involves selection, whereas the accumulation of neutral mutations plays the central role in shaping IHT in the later stages (Saito et al., 2018; Cross et al., 2018).

In addition to single nucleotide mutation- and small indel-driven drivers, recent studies have demonstrated that, in multiple types of cancers, more drastic chromosome- and/or genome-wide evolutionary events producing copy number alterations and chromosomal rearrangements may have occurred in a short time at the early stage of cancer evolution (Gao et al., 2016; Baca et al., 2013). Such large-scale events could confer a marked fitness increase on one or a few cells, which expand to constitute the tumor mass uniformly. This type of evolution is referred to as “punctuated evolution” after the term “punctuated equilibrium”, which was proposed for species evolution by Gould and Eldredge to challenge the long-standing paradigm of gradual Darwinian evolution (Gould & Eldredge, 1972; Jay Gould & Eldredge, 1993), although the underlying molecular mechanisms that cause rapid bursts of change are very different.

Collectively, at least four scenarios of cancer evolution were proposed (Davis, Gao & Navin, 2017). In this paper, we term the four scenarios as the linear-replacing, punctuated-replacing, driver-branching, and neutral-branching processes (Figs. 1A–1D). The linear-replacing process applies when newly arisen clones repeatedly spread and replace the entire population very quickly. A special case of the linear-replacing process is the punctuated-replacing process, where a number of drastic changes occur in a very short time and a very fit clone spreads and replaces the entire population very quickly. In the driver-branching process, multiple subclones having distinct driver mutations coexist to shape ITH, whereas, in the neutral-branching process, there are no significant driver mutations when accumulating mutations that constitute ITH.

Figure 1: Illustrating the scenarios in cancer evolution.
(A–D) The four typical evolutionary processes. Red stars indicate normal driver events, which are assumed to be single nucleotide mutations and small indels, while a green star indicates more drastic chromosome- and/or genome-wide evolutionary events producing copy number alterations and chromosomal rearrangements. (E) Our model explaining the temporal shift of evolutionary principles shaping ITH during colorectal tumorigenesis.

Download full-size image

DOI: 10.7717/peerj.8842/fig-1

To obtain an understanding of cancer evolutionary dynamics, many mathematical models of cancer evolution have been developed (Beerenwinkel et al., 2014; Altrock, Liu & Michor, 2015); in particular, agent-based simulation models are commonly employed for this purpose (Sottoriva et al., 2015; Waclaw et al., 2015; Uchi et al., 2016; Iwasaki & Innan, 2017; Minussi et al., 2019; Poleszczuk, Hahnfeldt & Enderling, 2015). In agent-based simulation models, each cell in a tumor corresponds to an agent; the cells can divide to produce new cells, die, or migrate, and each cell’s behavior can be stochastically determined from its own state and/or the environment surrounding the cell. By applying sensitivity analysis to the simulation models, (i.e., examining the simulation results while changing the parameters of the models), it is possible to identify the factors affecting the cancer evolutionary dynamics (Niida, Hasegawa & Miyano, 2019). However, to the best of our knowledge, there exists no simulation work aiming to reproduce and analyze the four above-stated evolutionary processes in a unified manner.

In this paper, we introduce a unified agent-based simulation model, which is simple but sufficient to reproduce the four evolutionary processes (Figs. 1A–1D). Although the unified model is formulated in ‘Materials & Methods’, the ‘Results’ section presents a family of simulation models, each of which constitutes submodels of the unified model. While constructing the submodels, we explore the conditions leading to, and the ITH pattern from the four processes. The ‘Results’ section is composed of four parts. In the first part, we introduce the driver model, which contains only driver mutations, and examine the conditions leading to the linear-replacing and driver-branching processes. In the second part, the neutral model, which contains only neutral mutations, is introduced to address the conditions leading to a neutral pattern of ITH. We show that, although a high neutral mutation rate is necessary for the neutral pattern of ITH, a stem cell hierarchy can also contribute to the neutral pattern by apparently increasing the mutation rate. In the third part, we present a combination of these two models as a composite model and reproduce realistic ITH patterns, which are generated by mixing the neutral pattern with the pattern from the linear-replacing or driver-branching processes. In the final part, we build the punctuated model by incorporating the punctuated-replacing process into the composite model. Our simulation based on the punctuated model demonstrates that the punctuated-replacing process triggers an evolutionary shift from the driver- to the neutral-branching process that is commonly observed during colorectal tumorigenesis (Fig. 1E). This result helps us understand that each process works not separately but simultaneously and continuously as a series of “phases” of cancer evolution.

Materials & Methods

Simulation model

Although we described a family of simulation models in the Results section, we here formulate the unified model, which encompasses these models. Starting from a stem cell without mutations, the following time steps are repeated until the number of population size p reaches P or the number of time steps t reaches T. For each time step, each cell is subject to cell division with a probability g and cell death with a probability d.g depends on a base division rate g₀, the increase in the cell division probability per driver mutation f, the number of driver mutations accumulated in the cell n_d, population size p, and the carrying capacity p_c: g = g₀f^n_d(1 − p∕p_c).d depends on the base death rate d₀, the decrease in the cell death probability per driver mutation, and the number of driver mutations accumulated in the cell n_d: d = d₀e^−n_d. When the cell is a differentiated cell, d₀ is replaced by $d_{0}^{d}$ , which is the base death rate for differentiated cells: $d = d_{0}^{d} e^{- n_{d}}$ . The order of the trials of cell division and death is flipped with probability 0.5. We also assumed that cell death occurs only in the case where p > 1, to prevent the simulation from halting before clonal expansion.

In a cell division, the cell is replicated into two daughter cells. If the parent cell is a stem cell, one of the two daughter cells is differentiated with a probability 1 − s; that is, s expresses the probability of symmetrical division. For each of the two daughter cells, we introduce k_d driver and k_n neutral mutations. k_d and k_n are sampled from Poison distributions, the parameters of which are m_d∕2 and m_n∕2, respectively: k_d ∼ Pois(m_d∕2) and k_n ∼ Pois(m_n∕2). Note that this means that each cell division generates m_d driver and m_n neutral mutations on average. We assumed each mutation acquired by different division events occurs at different genomic positions and each cell can accumulate N_d driver and N_n neutral mutations at maximum. When each of the two daughter cells has N_d driver mutations, we further attempted to introduce an explosive driver mutation; the explosive driver mutation is introduced with a probability m_e and sets the carrying capacity p_c of the cell to infinite. The pseudocode for the unified model is provided as Algorithm 1. The variables and parameters employed in the unified model are listed in Tables 1 and 2. The simulation code used in this study is available from https://github.com/atusiniida/canevosim.

Table 1:

Variables.

Symbol	Description
k_d	Number of driver mutations obtained in a cell division
n_d	Number of driver mutations accumulated in a cell
k_n	Number of neutral mutations obtained in a cell division
p	Population size
t	Number of time steps
g	Cell division probability
d	Cell death probability

DOI: 10.7717/peerj.8842/table-1

Table 2:

Parameters.

Symbol	Description
m_d	Expected number of driver mutations generated per cell division
m_n	Expected number of neutral mutations generated per cell division
m_e	Probability of acquiring an explosive mutation
N_d	Maximum number of driver mutations accumulated in a cell
N_n	Maximum number of neutral mutations accumulated in a cell
f	Increase of the cell division probability per driver mutation
e	Decrease of the cell death probability per driver mutation
g₀	Base cell division probability
d₀	Base cell death probability for stem cells
$d_{0}^{d}$	Base cell death probability for differentiated cells
s	Symmetrical division probability
p_c	Carrying capacity
P	Maximum population size
T	Maximum number of time steps

DOI: 10.7717/peerj.8842/table-2

 
____________________________ 
Algorithm 1 Unified model___________________________________________________________________________________________ 
  1:  prepare a stem cell without mutations 
  2:  while p < P or t < T do 
 3:       for each cell do 
 4:            g = g0fnd(1 − p∕pc) 
  5:            d = d0e−nd 
  6:            if the cell is a differentiated cell then 
 7:                  d = dd0e−nd 
  8:            if rand < 0.5 then 
 9:                  if rand() < g then 
10:                       divide(the cell) 
11:                  if p > 1 and rand() < d then  
12:                       kill the cell (accordingly, p = p − 1 ) 
13:                       # in the case that the cell is replicated, kill one of the two daughter cells 
14:            else 
15:                  if p > 1 and rand() < d then 
16:                       kill the cell (accordingly, p = p − 1) 
17:                  if rand() < g then 
18:                       divide(the cell) 
19:       t = t + 1 
20: 
21: 
22:  function rand() 
23:       return a random number ranging from 0 to 1 
24: 
25:  function divide(a cell) 
26:       replicate the cell into two daughter cells (accordingly, p = p + 1) 
27:       if the parent cell is a stem cell then 
28:            if rand() > s then 
29:                  differentiate one of the daughter cells 
30:       for each of the daughter cells do 
31:            introduce kd ∼ Pois(md∕2) driver mutations 
32:            introduce kn ∼ Pois(mn∕2) neutral mutations 
33:            if nd = ∑ 
  kd reaches the upper limit Nd then 
34:                  if rand() < me  then 
35:                       set pc of the cell to infinite    _______________________________________________________________________

Post-processing of simulation results

To evaluate the simulation results quantitatively, we calculated summary statistics based on 1,000 cells randomly sampled from each simulated tumor. these summary statistics are listed in Table 3. time and population size indicate the numbers of time steps and cells, respectively, when the simulation is complete. mutation count per cell represents the mean number of mutations accumulated in each of the randomly sampled 1,000 cells. By combining the mutations of the 1,000 cells, we defined the mutations that occur in 95% or more of the 1,000 cells as clonal mutations, and the others as subclonal mutations. The numbers of clonal, subclonal, and both types of mutations were then defined as clonal mutation count, subclonal mutation count, and total mutation count, from which clonal mutation proportion and subclonal mutation proportion were further calculated. The degree of ITH was also measured by Shannon and Simpson indices, which were calculated based on the proportions of different subclones (i.e., cell subpopulations with different mutations) after removing mutations having a frequency less of than 5% or 10%: Shannon index 0.05, Shannon index 0.1, Simpson index 0.05, and Simpson index 0.1. Similarly, after removing mutations having a frequency of less than 5% or 10%, we also checked whether multiple subclones harboring different driver mutations coexist, which is represented as binary statistics, driver-branching 0.05, and driver-branching 0.1. When the simulated tumor had differentiated cells or subclones with explosive driver mutations, the proportion of the subpopulation was calculated as subpopulation proportion.

Table 3:

Summary statistics.

Name	Description
time	Number of time steps when simulation is finished
population size	Number of cells when simulation is finished
mutation count per cell	Mean number of mutations accumulated in each cell
clonal mutation count	Number of clonal mutations
subclonal mutation count	Number of subclonal mutations
total mutation count	clonal mutation count + subclonal mutation count
clonal mutation proportion	clonal mutation count / total mutation count
subclonal mutation proportion	subclonal mutation count / total mutation count
Shannon index 0.1	Shannon index calculated with
	a mutation frequency cutoff of 0.1
Shannon index 0.05	Shannon index calculated with
	a mutation frequency cutoff of 0.05
Simpson index 0.1	Simpson index calculated with
	a mutation frequency cutoff of 0.1
Simpson index 0.05	Simpson index calculated with
	a mutation frequency cutoff of 0.05
driver-branching 0.05	Binary statistic indicating that multiple subclones
	harboring different driver mutations coexist,
	calculated with a mutation frequency cutoff of 0.05
driver-branching 0.1	Binary statistic indicating that multiple subclones
	harboring different driver mutations coexist,
	calculated with a mutation frequency cutoff of 0.1
subpopulation proportion	proportion of differentiated cells
	or subclones with explosive driver mutations

DOI: 10.7717/peerj.8842/table-3

The single-cell mutation profiles of the 1,000 cells are represented as a binary matrix, the row and column indices of which are mutations and samples, respectively. To interpret the simulation results intuitively, we also visualized the binary matrix by utilizing the heatmap function in R after the following pre-processing, if necessary. When the number of rows was less than 10, empty rows were added to the matrix so that the number of rows was 10. When the number of rows was more than 300, we extracted the 300 rows with the highest mutation occurrence so that the number of rows was 300. In the neutral and neutral-s models, we exceptionally set the maximum row number to 1,000 in order to visualize low-frequency mutations. The visualized matrix is accompanied by a left-side blue bar indicating the driver mutations. When the simulated tumor had differentiated cells or subclones with explosive driver mutations, the subpopulation is indicated by the purple bar on the top of the visualized matrix.

Sensitivity analysis based on MASSIVE

To cover a sufficiently large parameter space in the sensitivity analysis, we employed a supercomputer, SHIROKANE4 (at Human Genome Center, The Institute of Medical Science, The University of Tokyo). The simulation and post-processing steps for different parameter settings were parallelized on Univa Grid Engine. For each model, we employed a full factorial design involving four parameters (i.e, we tested every combination of candidate values of the four parameters) while other parameters were fixed. The parameter values used for our analysis are listed in Table 2. For each parameter setting, 50 Monte Carlo trials were performed and the summary statistics were averaged over the 50 trials. The averaged summary statistics calculated for each parameter setting were visualized by interactive heat maps on a web-based visualization tool, the MASSIVE viewer. The MASSIVE viewer also displays single-cell mutation profiles from 5 of the 50 trials with the same parameter setting. For details, please refer to our methodological report (Niida, Hasegawa & Miyano, 2019). All the results in this study can be interactively explored in the MASSIVE viewer on our website (https://www.hgc.jp/∼niiyan/canevosim). Parameter values used for the MASSIVE analysis are provided in Table S1.

Results

Driver model

First, we constructed the “driver” model, which contains only driver genes, aiming to study the two Darwinian selection processes: linear-replacing and driver-branching. We employed an agent-based model where each cell in a tumor is represented by an agent. The model starts from one cell without mutations. In a unit time, a cell divides into two daughter cells with a probability g. This model assumes that immortalized cell, which just divides without dying. In each cell division, each of the two daughter cells acquires k_d driver mutations. Here, k_d is sampled from a Poisson distribution with the parameter m_d∕2, i.e., k_d ∼ Pois(m_d∕2), which means that one cell division generates m_d mutations on average. We assumed that driver mutations acquired by different division events occur at different genomic positions and each cell can accumulate N_d mutations at maximum. In this study, we assumed that all mutations are driver mutations, which increase the cell division rate. When the cell acquires mutations, the cell division rate increases f fold per mutation; that is, when a cell has n_d (=∑k_d) mutations in total, the cell division probability g is defined as g = g₀f^n_d, where g₀ is a base division probability. In each time step, every cell is subject to a cell division trial, which is repeated until population size p reaches P or the number of time steps t reaches T.

To examine the manner in which each parameter affects the evolutionary dynamics of the simulation model, we performed a sensitivity analysis utilizing MASSIVE (Niida, Hasegawa & Miyano, 2019), for which we employed a supercomputer. MASSIVE first performs a very large number of agent-based simulations with a broad range of parameter settings. The results are then intuitively evaluated by the MASSIVE viewer, which interactively displays heat maps of summary statistics and single-cell mutation profiles from the simulations with each parameter setting. In Figs. 2A–2C and Fig. S1, the heat maps of three representative summary statistics, the proportion of clonal mutations ( clonal mutation proportion), a measure for ITH ( Shannon index 0.05), and an indicator for the occurrence of the driver-branching process ( driver-branching 0.05), are presented for a part of the parameter space examined. To calculate clonal mutation proportion, we defined the mutations having a frequency of 95% or more as clonal mutations. Shannon index 0.05 is the Shannon index calculated based on the proportions of different subclones (i.e., cell subpopulations with different mutations) after removing the mutations having a frequency less of than 5%. The Shannon index is commonly used to measure species richness in community ecology, and it has a positive correlation with diversity. Similarly, after removing mutations having a frequency of less than 5%, we also checked whether multiple subclones harboring different driver mutations coexist, which is represented as a binary statistic, driver-branching 0.05. For each parameter setting, 50 Monte Carlo trials were performed and the summary statistics were averaged over the 50 trials. To examine ITH visually, we sampled 1,000 cells from a simulated tumor and obtained a single-cell mutation profile matrix. The mutation profile matrix was visualized after reordering its rows and columns based on hierarchical clustering. The rows and columns index mutations and samples, respectively (Figs. 2D–2F). All the results can be interactively explored in the MASSIVE viewer on our website (https://www.hgc.jp/∼niiyan/canevosim/driver).

The results of the MASSIVE sensitivity analysis demonstrated that the strength of driver mutations f is the most prominent determinant of the Darwinian selection processes (Fig. 2). A smaller value of f (e.g., f = 10^0.3), which indicates weaker driver mutations, is generally associated with the driver-branching process, which is characterized by large driver-branching 0.05, corresponding to parameter setting D in Figs. 2A–2C. However, in the case of a low mutation rate (e.g., m_d = 10⁻³), a small f value is insufficient to cause expansions of multiple clones, corresponding to parameter setting F in Figs. 2A–2C. When the value of f is large (e.g., f = 10^0.9), driver-branching 0.05 is small, but the clonal mutation proportion is large, which suggests that the linear-replacing process generates a homogeneous tumor, corresponding to parameter setting E in Figs. 2A–2C. By considering these results with time-course snapshots of the simulations, mechanisms driving the linear-replacing and driver-branching processes were intuitively interpreted (Fig. 3). Under the assumption of weak driver mutations, before a clone that has acquired the first driver mutation becomes dominant, other clones that have acquired different mutations expand, leading to the driver-branching process (Figs. 3A and 3B). In contrast, under the assumption of strong driver mutations, a clone that has acquired the first driver mutation rapidly expands to obtain more driver mutations serially, leading to the linear-replacing process (Figs. 3C and 3D).

Figure 3: Time-course snapshots of simulations based on the driver model.
Growth curve (A) and time-course snapshots of mutation profiles (B) simulated from the driver model with N_d = 3, P = 10⁶, f = 10^0.3, and m_d = 10^−1.5 (corresponding to parameter setting D in Figs. 2A–2C). Growth curve (C) and time-course snapshots of mutation profiles (D) simulated from the driver model with N_d = 3, P = 10⁶, f = 10^0.9, and m_d = 10^−1.5 (corresponding to parameter setting E in Figs. 2A–2C). The time points when snapshots were obtained are indicated by empty circles on the growth curves.

Download full-size image

DOI: 10.7717/peerj.8842/fig-3

The linear-replacing process is very similar to the fixation and selective sweep described in the standard population genetics framework (Maynard Smith & Haigh, 1974; Ohta & Kimura, 1975). Note that, in a strict sense, fixation does not occur under the assumption that cancer cells are immortal (Sidow & Spies, 2015; Ohtsuki & Innan, 2017; Niida, Iwasaki & Innan, 2019); even if a tumor appears to be monoclonal in a mutation profile for 1,000 randomly sampled cells, it is possible that minor clones having less fitness coexist in the actual population. In the driver-branching process, we observe various subclones that coexist in the population. They could compete with each other depending on their fitness. If different subclones obtain distinct driver mutations with very similar fitness effects independently, the competition between them will be neutral so that none of them can be fixed and they will keep competing. This situation is similar to the phenomenon called “clonal interference” in an asexual population (Gerrish & Lenski, 1998).

In actual tumors, driver mutations can not only increase the growth rate but also decrease the death rate. To test the effect of driver mutations decreasing the death rate, we also created a modified version of the driver model, the “driver-d” model. In the driver-d model, each cell divides with a constant probability g₀ and dies with a probability d. Driver mutation decreases the cell death probability by f fold: d = d₀e^−n_d, where d₀ is the base death probability. Moreover, we assumed that cell death occurs only in the case of p > 1, to prevent the simulation from halting before clonal expansion. We applied the MASSIVE analysis to the driver-d model to find that, if a high mutation rate is assumed (i.e., m_d = 10⁻²), the driver-branching process is pervasive, irrespective of the strength of the driver mutations (Fig. S2; https://www.hgc.jp/∼niiyan/canevosim/driver_d). This observation is presumably ascribed to the fact that a driver mutation that decreases the death rate cannot provide a cell with the strong growth advantage necessary for the linear-replacing process. Even if the mutation rate is low (i.e., m_d = 10⁻⁴), multiple clones appear after the simulation proceeds to reach a sufficient population size. We also examined the evolutionary dynamics of the driver-d models with different mutation rates by taking time-course snapshots of the simulations (Fig. S3).

In both the driver and driver-d models, we do not consider spatial information. However, it should be noted that, by simulating tumor growth on a one-dimensional lattice, we demonstrated that the spatial bias of a resource necessary for cell divisions could prompt the driver-branching process (Niida, Hasegawa & Miyano, 2019).

Neutral model

Next, we examined the neutral-branching process by analyzing the “neutral” model, where we considered only neutral mutations that do not affect cell division and death. In a unit time, a cell divides into two daughter cells with a constant probability g₀ without dying. Similarly to driver mutations in the driver model, in each cell division, each of the two daughter cells acquires k_n ∼ Pois(m_n∕2) neutral mutations. We assumed that neutral mutations acquired by different division events occur at different genomic positions and each cell can accumulate N_n mutations at maximum. In this study, we set N_n = 1000, which is sufficiently large that no cell reaches the upper limit, except in a few exceptional cases. The simulation started from one cell without mutations and ended when population size p reached P or time t reached T.

The MASSIVE analysis of the neutral model demonstrated that, as expected, the mutation rate is the most important factor for the neutral-branching process (Fig. 4; https://www.hgc.jp/∼niiyan/canevosim/neutral_s; note that the neutral model is included by the neutral-s model, which is described below). When the mean number of mutations generated by per cell division, m_n, was less than 1, the neutral model just generated sparse mutation profiles with relatively small values of the ITH score, Shannon index 0.05. In contrast, when m_n exceeded 1, the mutation profiles presented extensive ITH, which are characterized by a fractal-like pattern and large values of the ITH score (hereinafter, this type of ITH is referred to as “neutral ITH”). According to these results, it is intuitively supposed that neutral ITH is shaped by neutral mutations that trace the cell lineages in the simulated tumors. Note that the mutation profiles were visualized after filtering out low-frequency mutations. Under the assumption of a high mutation rate, more numerous subclones having different mutations should be observed if we count the mutations existing with lower frequencies.

Figure 4: Sensitivity analysis of the neutral model.
(A) Heap map obtained by calculating Shannon index 0.05 while changing the neutral mutation rate m_n and the maximum population size P. (B–H) Single-cell mutations profiles obtained for seven parameter settings, which are indicated on the heat map in A.

Download full-size image

DOI: 10.7717/peerj.8842/fig-4

To verify this speculation, we counted the number of subclones generated from a simulated tumor, while varying the frequency cutoffs for filtering out mutations. Figure S4 shows the plot of the relationship between the number of subclones and the frequency cutoffs. As expected, the results indicate that the simulated tumor presents an increasing number of subclones as the frequency cutoff is lowered. The linearity of the log–log plot demonstrates that the power law is hidden in the mutation profile, consistently with its fractal-like pattern (Brown et al., 2002). Note that, although the ITH score does not depend on population size P and the fractal-like pattern shaped in the earliest stage appears to be subsequently unchanged in the time-course snapshots (Fig. 5), these are also because low-frequency mutations were filtered out before visualization; the simulated tumor in fact expands neutral ITH by accumulating numerous low-frequency mutations as it grows.

Figure 5: Time-course snapshots of simulations based on the neutral model.
Growth curve (A) and time-course snapshots of mutation profiles (B) simulated from the driver model with P = 10⁶ and m_n = 10 (corresponding to parameter setting H in Fig. 4A). The time points when snapshots were obtained are indicated by empty circles on the growth curves.

Download full-size image

DOI: 10.7717/peerj.8842/fig-5

Thus far, several theoretical and computational studies have shown that a stem cell hierarchy can boost the neutral-branching process (Sottoriva et al., 2010; Solé et al., 2008), which prompted us to extend the neutral model to the “neutral-s” model such that it contains a stem cell hierarchy (Fig. S5). The neutral-s model assumes that two types of cells exist: stem and differentiated. Stem cells divide with a probability g₀ without dying. For each cell division of stem cells, a symmetrical division generating two stem cells occurs with a probability s, while an asymmetrical division generating one stem cell and one differentiated cell occurs with a probability 1 − s. A differentiated cell symmetrically divides to generate two differentiated cells with a probability g₀ but dies with a probability $d_{0}^{d}$ . The means of accumulating neutral mutations in the two types of cell is the same as that in the original neutral model, which means that the neutral-s model is equal to the original neutral model when s = 0 or $d_{0}^{d} = 0$ . For convenience, we define $δ = {log}_{10} (d_{0}^{d} ∕ g_{0})$ and hereinafter use δ instead of $d_{0}^{d}$ .

The MASSIVE analysis of the neutral-s model confirmed that the incorporation of the stem cell hierarchy boosts the neutral-branching process (https://www.hgc.jp/∼niiyan/canevosim/neutral_s). To obtain the heat map in Fig. 6A, the ITH score was measured while $d_{0}^{d}$ and δ were changed, but m_n = 0.1 and P = 1, 000 were constantly set. In the heat map, a decrease of s leads to an increase in the ITH score when δ ≥ 0 (i.e., $d_{0}^{d} \geq g_{0}$ ). A smaller value of s means that more differentiated cells are generated per stem cell division, and δ ≥ 0 means that the population of the differentiated cells cannot grow in total, which is a valid assumption for typical stem cell hierarchy models. That is, this observation indicates that the stem cell hierarchy can induce neutral ITH even with a relatively low mutation rate setting (i.e., m_n = 0.1), with which the original neutral model cannot generate neutral ITH.

Figure 6: Sensitivity analysis of the neutral-s model.
(A) Heat map obtained by calculating Shannon index 0.05 while changing the relative death rate of differentiated cells $δ = {log}_{10} (d_{0}^{d} ∕ g_{0})$ and the symmetrical division rate s. The neutral mutation rate m_n and the maximum population size P set to 10⁻¹ and 10⁵, respectively. (B–J) Single-cell mutation profiles obtained for nine parameter settings, indicated on the heat map presented in A.

Download full-size image

DOI: 10.7717/peerj.8842/fig-6

The underlying mechanism boosting the neutral-branching process can be explained as follows. We here consider only stem cells for an approximation, because differentiated cells do not contribute to tumor growth with δ ≥ 0. While one cell grows to a population of P cells, let cell divisions synchronously occur across x generations during the clonal expansion. Then, (1 + s)^x = P holds, because the mean number of stem cells generated per cell division is estimated as 1 + s. Solving the equation for x gives x = logP∕log(1 + s); that is, it can be estimated that, during the clonal expansion, each of the P cells experiences logP∕log(1 + s) cell divisions and accumulates m_nlogP∕2log(1 + s) mutations on average. We confirmed that the expected mutation count based on this formula is well fit with the values observed in our simulation, except in the exceptional cases where the mutation counts reached the upper limit, N_n = 1, 000 (Fig. S6). These arguments mean that a tumor with a stem cell hierarchy accumulates more mutations until reaching a fixed population size than does a tumor without a stem cell hierarchy. That is, a stem cell hierarchy increases the apparent mutation rate by log2∕log(1 + s) folds, which induces the neutral-branching process even with relatively low mutation rate settings.

Similarly, we can also show that the introduction of cell death to the neutral model boosts the neutral-branching process. In the neutral model having a non-zero death rate d₀, we estimate that the mean number of cells generated per cell division is 2 − d₀∕g₀. Through arguments similar to the one above, we can also show that the apparent mutation rate is increased by log2∕log(2 − d₀∕g₀). Collectively, although the mutation rate is the most important determinant for generating neutral ITH, the introduction of cell death as well as stem cell hierarchy also contribute to the neutral-branching process by increasing the apparent mutation rate.

Combining the driver and neutral model

We now present the “composite” model that was constructed by combining the driver and neutral model, aiming to reproduce ITH more similar to those in real tumors. In a unit time, a cell divides into two daughter cells with a constant probability g without dying. In each cell division, each of the two daughter cells acquires k_d ∼ Pois(m_d∕2) driver mutations and k_n ∼ Pois(m_n∕2) neutral mutations. For each type of mutation, N_d and N_n mutations can be accumulated at maximum. For a cell that has n_d (=∑k_d) mutations, cell division probability g is defined as g = g₀f^n_d, where g₀ is a base division probability. The simulation started from one cell without mutations and ended when the population size p reached P or time t reached T. As expected from the MASSIVE analyses of the driver and neutral model that were performed separately, our MASSIVE analysis of the composite model confirmed that, depending on the parameter setting, behaviors of the composite model and the resultant mutation profiles are roughly categorized into the following six classes (Fig. 7; https://www.hgc.jp/∼niiyan/canevosim/composite):

Figure 7: Six classes of mutation profiles simulated by the composite model.
Our sensitivity analysis demonstrated that, depending on the parameter setting, behaviors of the composite model are roughly categorized into the six classes. Representative mutation profiles of the six classes are presented.

Download full-size image

DOI: 10.7717/peerj.8842/fig-7

With small m_d and small m_n, i.e., with low driver and neutral mutation rates, no evolutionary process involving driver and neutral mutations occurs.
With large m_d, small m_n, and small f (i.e., with high driver and low neutral mutation rates, and weak driver mutations), the driver-branching occurs while the neutral-branching process does not occur.
With large m_d, small m_n, and large f (i.e., with high driver and low neutral mutation rates, and strong driver mutations), the linear-replacing process occurs while the neutral-branching process does not occur..
With small m_d and large m_n (i.e., with low driver and high neutral mutation rates), the neutral-branching process occurs while no evolutionary process involving driver mutations occurs.
With large m_d, large m_n, and small f (i.e., with low driver and high neutral mutation rates, and weak driver mutations), the driver-branching and neutral-branching processes occur simultaneously.
With large m_d, large m_n, and large f (i.e., with high driver and high neutral mutation rates, and strong driver mutations), the linear-replacing and neutral-branching processes occur simultaneously.

Note that, because tumors having high driver mutation rates must have high neutral mutation rates also, the linear-replacing and driver-branching processes must in general be accompanied by the neutral-branching process. Therefore, the last three behaviors are supposed to constitute the process that can actually occur in real tumors (note that, since different processes work simultaneously and continuously as a series of phases of cancer evolution in real tumors as described below, the situation is not so simple).

Adding the punctuated-replacing process

Previously, we analyzed multiregion sequencing data of advanced colorectal cancer and precancerous lesions jointly to demonstrated that the evolutionary principle generating ITH shifts from the driver- to neutral-branching process during colorectal tumorigenesis (Saito et al., 2018). We also demonstrated that the number of copy number alterations drastically increases during the progression from colorectal precancerous lesions to advanced colorectal cancer, which prompted us to suspect that the punctuated-replacing process underlies the evolutionary shift from branching to the neutral-branching process (Fig. 1E). To examine this possibility, we additionally incorporated the punctuated-replacing process into the composite model to build the “punctuated” model.

For the models considered thus far, we assumed that a cell can infinitely grow without a decrease in their growth speed. However, it is more natural to assume that there exists a limit of population size because of the resource limitation and that the growth speed gradually slows down as the population size approaches the limit. The limit of population sizes is called the carrying capacity and employed in the well-known logistic equation (Verhulst, 1838). By mimicking the logistic equation, we modified the division probability as g = g₀f^n_d(1 − p∕p_c), where p_c is the carrying capacity. To reproduce the punctuated-replacing process, we additionally employ an “explosive” driver mutation, which negates the effect of the carrying capacity. After a cell accumulates driver mutations up to the maximum N_d, the explosive driver mutation is introduced at a probability m_e after cell division. For a cell that has the explosive driver mutation, the carrying capacity p_c is set to infinite; That is, it is assumed that the explosive driver mutation rapidly evolves the cell so that it can conquer the growth limit and attain infinite proliferation ability.

Next, we searched for parameter settings that lead the punctuated model to reproduce the punctuated-replacing process. The MASSIVE analysis confirmed that, with sufficiently large m_e (i.e., m_e > 10⁻⁴), the punctuated-replacing process is reproducible in the punctuated model (https://www.hgc.jp/∼niiyan/canevosim/punctuated; note that, for simplicity, we omitted neutral mutations by setting m_n = 0 in the MASSIVE analysis). We also examined time-course snapshots of simulations conducted with these parameter settings. In the example shown in Figs. 8A and 8B, we observed that multiple subclones having different driver genes coexist; that is, the driver-branching process, with which the neutral-branching process occurs simultaneously, is prominent during the early phase of the simulation. Note that a growth curve plot indicates that, as the population size approaches the carrying capacity, the growth speed slows down; however, the tumor regrows after the appearance of a clone that has acquired an explosive driver mutation. The clone with the explosive driver mutation is then subjected to a selective sweep, which causes subclonal driver mutations in the clone to shift to clonal mutations. Then, only neutral mutations are accumulated as subclonal mutations; That is, ITH is finally generated by the neutral-branching process.

Figure 8: Time-course snapshots of simulations based on the punctuated model.
Growth curve (A) and time-course snapshots of mutation profiles (B) simulated from the punctuated model with P = 10⁶, p_c = 10^3.5, m_d = 10⁻¹, m_p = 10^0.5, and m_e = 10⁻⁴. Growth curve (C) and time-course snapshots of mutation profiles (D) simulated from the punctuated model with P = 10⁶, p_c = 10^3.5, m_d = 10⁻¹, m_p = 10^0.5, and m_e = 10⁻³. The time points when snapshots were obtained are indicated by empty circles on the growth curves.

Download full-size image

DOI: 10.7717/peerj.8842/fig-8

We also found that two subclones having different subclonal driver mutations sometimes appear by obtaining two independent explosive driver mutations (Figs. 8C and 8D). This observation recalls to mind the multiverse model, which was proposed for glioblastoma evolution (Lee et al., 2017). The multiverse model is derived from the Big-Bang model, a model for jointly describing punctuated and the neutral-branching process during colorectal tumorigenesis (Sottoriva et al., 2015). The Big-Bang model assumes that a single clone explosively expands from a precancerous lesion while generating neutral ITH, consistently with our evolutionary shift model. However, in the multiverse model, it is assumed that multiple subclones are subject to explosive expansion. Collectively, our simulation based on the punctuated model not only supports our hypothesis that the punctuated-replacing process underlies the evolutionary shift during colorectal tumorigenesis, but also can reproduce multiple types of punctuated models proposed thus far.

Our simulation based on the punctuated model also demonstrated a dramatic evolution of cancer, during which multiple processes could go on simultaneously and continuously, and we observed different phases along the evolution. Consequently, the mutation profile records the history of the processes such that a series of multiple phases arise with different patterns of mutation profiles. It is possible that we infer the history from the mutation profile at the end point to some degree; for example, the accumulation of clonal driver mutations suggests that the tumor has been subjected to the linear- or punctuated-replacing process. However, our result emphasizes the importance of having a time-series data to fully understand the detailed process behind cancer evolution (Sato et al., 2019).

Discussion

In the ‘Results’ section, we introduced a family of simulation models that reproduce the four types of cancer evolutionary processes: linear-replacing, driver-branching, neutral-branching, and punctuated-replacing. Our sensitivity analysis of these models successfully identified the conditions leading to each of the evolutionary processes. For example, under the assumption of a sufficiently high mutation rate, the driver-branching process occurs with strong driver mutations, whereas linear evolution occurs with weak driver mutations. However, a major concern about our sensitivity analysis is whether the ranges of parameter values examined is realistic. Although dependent on tumor types, the number of driver mutations were previously estimated as in the low single digits for most tumor types, consistently with our settings for d. As the increase in the cell division probability per driver mutation f, which is interpreted as the strength of driver mutations, we examined values ranging from 10^0.1 to 10^1.0. Although the value of f has not been the subject to extensive experimental determination, it has been reported that the induction of K-ras^G12D in murine small intestine increases growth rate from one cell cycle per 24 hr to one cell cycle per 15 hr, from which f is estimated as 10^0.204 (Snippert et al., 2014).

The driver mutation rate m_d and population size P appear to be problematic. Although the driver mutation rate was previously estimated as ∼3.4 × 10⁻⁵ per cell division (Bozic et al., 2010), our sensitivity analysis examined values from 10⁻⁴ to 10⁻¹, which are above the estimated value by orders of magnitude. It should also be noted that, in our simulation, it was assumed that a tumor contains 10⁶ cells at maximum, whereas the number of cancer cells in one gram of tumor tissue is reportedly 10⁹ or one order less (Del Monte, 2009). Clearly, for m_d and P, the parameter space we examined does not cover those for a real tumor. However, the results of the MASSIVE analysis allow the behaviors of the driver model to be envisioned in a realistic parameter space. When P is small, neither the linear-replacing process nor the driver-branching process occurs. As P increases, we observe the linear-replacing or driver-branching process with smaller m_d, although the range of f that leads to the driver-branching process shifts to larger values. Moreover, as shown by the sensitivity analysis of the neutral-s model, the presence of a stem cell hierarchy increases the apparent mutation rate. Therefore, a real tumor having a stem cell hierarchy apparently should have a higher m_d value. Collectively, it is natural to assume that a real tumor having large P and small m_d can be similarly generated by the linear-replacing or driver-branching process, although, in such cases, the actual value of f might be larger than those that we examined.

The sensitivity analysis of the neutral model showed that neutral ITH is generated if the expected number of neutral mutations generated per cell division, m_n, exceeds 1. In a recent report, the estimated somatic mutation rate was given as 2.66 × 10⁻⁹ mutations per base pair per mitosis. Given that most mutations are neutral on the human genome comprised of 3 × 10⁹ bases, even a cell division of normal cells generates more than 1 neutral mutation. As cancer cells should have higher mutation rates, which can be further accelerated by stem cell hierarchies, it is reasonable to assume that a tumor in general satisfies the conditions to generate neutral ITH. However, not every tumor necessarily has neutral ITH; neutral ITH is distorted by natural selection if the tumor additionally satisfies the conditions for the driver-branching process, as shown by the analysis employing the composite model.

A highlight of this work is that the punctuated model demonstrated that the punctuated-replacing process triggers the evolutionary shift from branching to the neutral-branching process. For carrying capacity p_c and the probability of acquiring an explosive mutation m_e in the punctuated model, the parameter values that we examined are clearly outside realistic ranges. Similarly to P, p_c should take a larger value. Although it cannot easily to be experimentally determined, m_e also appears to be overestimated; although the human body in fact potentially harbors numerous precancerous lesions (Brunner et al., 2019; Yokoyama et al., 2019), which are assumed not to have acquired explosive driver mutations yet, only a tiny fraction of cases progress to advanced stages by acquiring explosive driver mutations. However, it is intuitively understandable that the behaviors of the punctuated model, as well as of the driver model, are not dependent on precise values of these parameters, and in our opinion, our analysis is sufficient to provide a semi-quantitative understanding of cancer evolution.

The models we introduced in the Results section can be described collectively as the unified model, a formal description of which is provided in the ‘Materials & Methods’. The unified model is very simple but sufficient to reproduce the linear-replacing, driver-branching, neutral-branching, and punctuated-replacing processes. Of course, the unified model harbors many limitations, which should be addressed in future studies. Our current version of the model completely ignores spatial information, which potentially influences evolutionary dynamics. Recently reported studies have shown that spatial structures regulate evolutionary dynamics in tumors (Noble et al., 2019; West et al., 2019). We also determined that resource bias prompts the driver-branching process, by simulating tumor growth on a one-dimensional lattice (Niida, Hasegawa & Miyano, 2019). Moreover, Iwasaki & Innan (2017) recently developed a realistic simulator called tumopp to show that the three-dimensional pattern of ITH is affected by the local cell competition and asymmetric stem cell division. Although our model assumed that driver mutations independently have effects of equal strength, different driver mutations should have different strengths and might work synergistically (Castro-Giner, Ratcliffe & Tomlinson, 2015). Similarly, although we assumed that the punctuated-replacing process occurs only once in the course of cancer evolution, it is possible that a tumor is confronted with different types of resource limitations during the tumor progression and undergoes the punctuated-replacing process multiple times to conquer them (Aktipis et al., 2013).

Conclusion

Although the unified model harbors the above-described limitations, the application of sensitivity analysis to the model has successfully provided a number of insights into cancer evolutionary dynamics. In our opinion, the unified model serves as a starting point for constructing more realistic simulation models to understand in greater depth the diversity of cancer evolution, which is being unveiled by the ever-growing amount of cancer genomics data.

Supplemental Information

Sensitivity analysis of the driver model

While changing the driver mutation rate m_d, the strength of driver mutations f, and the maximum population size P, heat maps of the summary statistics were prepared for the proportion of clonal mutations, clonal mutation proportion (A), a measure for ITH, Shannon index 0.05 (B), and an indicator for the occurrence of the driver-branching process, driver-branching 0.05 (C). N_d was set to 3.

DOI: 10.7717/peerj.8842/supp-1

Download

Sensitivity analysis of the driver-d model

While changing the driver mutation rate m_d , the strength of driver mutations e, and the maximum population size P, heat maps of the summary statistics were prepared for the proportion of clonal mutations, clonal mutation proportion (A), a measure for ITH, Shannon index 0.05 (B), and an indicator for the occurrence of the driver-branching process, driver-branching 0.05 (C). N_d was set to 3.

DOI: 10.7717/peerj.8842/supp-2

Download

Time-course snapshots of simulations based on the driver-d model

Growth curve (A) and time-course snapshots of mutation profiles (B) simulated from the driver model with N_d = 3, P = 10⁶, e = 10^0.5, and m_d = 10⁻⁴ (a low mutation rate setting). Growth curve (C) and time-course snapshots of mutation profiles (D) simulated from the driver model with N_d = 3, P = 10⁶, e = 10^0.5, and m_d = 10⁻² (a high mutation rate setting). The time points when the snapshots were obtained are indicated by empty circles on the growth curves.

DOI: 10.7717/peerj.8842/supp-3

Download

Self-similarity of neutral ITH

(A) Illustrative explanation of the preparation of the log-log plot presented in (B). After mutations having frequencies less than r are filtered out, the number of subclones c is counted based on the mutation profiles. (B) Log-log plot for r and c obtained from a simulation with P = 10⁵ and m_n = 10. Similar linearity holds when m_n ≥ 1.

DOI: 10.7717/peerj.8842/supp-4

Download

Schema of the neutral-s model

Stem cells divide with a probability g_o without dying. For each cell division of stem cells, a symmetrical division generating two stem cells occurs with probability s, while an asymmetrical division generating one stem cell and one differentiated cell occurs with probability 1 − s. A differentiated cell symmetrically divides to generate two differentiated cells d with probability g₀ but dies with probability d₀.

DOI: 10.7717/peerj.8842/supp-5

Download

Observed and expected mutation counts from the neutral-s model

The observed mutation counts (obs) were prepared from values of mutation count per cell in the MASSIVE analysis, while the expected mutation counts (exp) were analytically estimated as m_n log P∕2log(1 + s) under the assumption that δ ≥ 0. Each cross representing each parameter setting was plotted in log10 scale for different values of δ. Positioning on the dashed line indicates the equality of the observed and expected mutation counts.

DOI: 10.7717/peerj.8842/supp-6

Download

Parameter values used for the MASSIVE analysis

DOI: 10.7717/peerj.8842/supp-7

Download

[1] Aktipis CA, Boddy AM, Gatenby RA, Brown JS, Maley CC. 2013. Life history trade-offs in cancer evolution. Nature Reviews Cancer 13(12):883-892

[2] Altrock PM, Liu LL, Michor F. 2015. The mathematics of cancer: integrating quantitative models. Nature Reviews Cancer 15(12):730-745

[3] Baca S, Prandi D, Lawrence M, Mosquera J, Romanel A, Drier Y, Park K, Kitabayashi N, MacDonald T, Ghandi M, Van AE, Kryukov G, Sboner A, Theurillat J, Soong T, Nickerson E, Auclair D, Tewari A, Beltran H, Onofrio R, Boysen G, Guiducci C, Barbieri C, Cibulskis K, Sivachenko A, Carter S, Saksena G, Voet D, Ramos A, Winckler W, Cipicchio M, Ardlie K, Kantoff P, Berger M, Gabriel S, Golub T, Meyerson M, Lander E, Elemento O, Getz G, Demichelis F, Rubin M, Garraway L. 2013. Punctuated evolution of prostate cancer genomes. Cell 153(3):666-677

[4] Beerenwinkel N, Schwarz RF, Gerstung M, Markowetz F. 2014. Cancer evolution: mathematical models and computational inference. Systematic Biology 64(1):e1-e25

[5] Bozic I, Antal T, Ohtsuki H, Carter H, Kim D, Chen S, Karchin R, Kinzler KW, Vogelstein B, Nowak MA. 2010. Accumulation of driver and passenger mutations during tumor progression. Proceedings of the National Academy of Sciences of the United States of America 107(43):18545-18550

[6] Brown JH, Gupta VK, Li B-L, Milne BT, Restrepo C, West GB. 2002. The fractal nature of nature: power laws, ecological complexity and biodiversity. Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences 357(1421):619-626

[7] Niida A, Nagayama S, Miyano S, Mimori K. 2018. Understanding intratumor heterogeneity by combining genome analysis and mathematical modeling. Cancer Science 109(4):884-892

[8] Brunner S, Roberts N, Wylie L, Moore L, Aitken S, Davies S, Sanders M, Ellis P, Alder C, Hooks Y, Abascal F, Stratton M, Martincorena I, Hoare M, Campbell P. 2019. Somatic mutations and clonal dynamics in healthy and cirrhotic human liver. Nature 574(7779):538-542

[9] Castro-Giner F, Ratcliffe P, Tomlinson I. 2015. The mini-driver model of polygenic cancer evolution. Nature Reviews Cancer 15(11):680-685

[10] Cross W, Kovac M, Mustonen V, Temko D, Davis H, Baker A-M, Biswas S, Arnold R, Chegwidden L, Gatenbee C, Anderson AR, Koelzer VH, Martinez P, Jiang X, Domingo E, Woodcock DJ, Feng Y, Kovacova M, Maughan T, Adams R, Bach S, Beggs A, Brown L, Buffa F, Cazier J.-B., Blake A, Wu C-H, Chatzpili E, Richman S, Dunne P, Harkin P, Higgins G, Hill J, Holmes C, Horgan D, Kaplan R, Kennedy R, Lawler M, Leedham S, McDermott U, McKenna G, Middleton G, Morton D, Murray G, Quirke P, Salto-Tellez M, Samuel L, Schuh A, Sebag-Montefiore D, Seymour M, Sharma R, Sullivan R, Tomlinson I, West N, Wilson R, Jansen M, Rodriguez-Justo M, Ashraf S, Guy R, Cunningham C, East JE, Wedge DC, Wang LM, Palles C, Heinimann K, Sottoriva A, Leedham SJ, Graham TA, Tomlinson I. PM, Consortium TS. 2018. The evolutionary landscape of colorectal tumorigenesis. Nature Ecology & Evolution 2(10):1661-1672

[11] Davis A, Gao R, Navin N. 2017. Tumor evolution: Linear, branching, neutral or punctuated? Biochimica et Biophysica Acta (BBA)-Reviews on Cancer 1867(2):151-161

[12] Del Monte U. 2009. Does the cell number 109 still really fit one gram of tumor tissue? Cell Cycle 8(3):505-506

[13] Fearon ER, Vogelstein B. 1990. A genetic model for colorectal tumorigenesis. Cell 61(5):759-767

[14] Gao R, Davis A, McDonald T, Sei E, Shi X, Wang Y, Tsai P, Casasent A, Waters J, Zhang H, Meric-Bernstam F, Michor F, Navin N. 2016. Punctuated copy number evolution and clonal stasis in triple-negative breast cancer. Nature genetics 48(10):1119-1130

[15] Gerrish PJ, Lenski RE. 1998. The fate of competing beneficial mutations in an asexual population. Genetica 102:127

[16] Gould NE-SJ, Eldredge N. 1972. Punctuated equilibria: an alternative to phyletic gradualism. In: Schopf TJM, ed. Models in paleobiology. San Francisco: Cooper & Co.. 82-115

[17] Iwasaki WM, Innan H. 2017. Simulation framework for generating intratumor heterogeneity patterns in a cancer cell population. PLOS ONE 12(9):e0184229

[18] Jay Gould S, Eldredge N. 1993. Punctuated equilibrium comes of age. Nature 366(6452):223-227

[19] Lee J-K, Wang J, Sa JK, Ladewig E, Lee H-O, Lee I-H, Kang HJ, Rosenbloom DS, Camara PG, Liu Z, Van Nieuwenhuizen P, Jung SW, Choi SW, Kim J, Chen A, Kim K-T, Shin S, Seo YJ, Oh J-M, Shin YJ, Park C-K, Kong D-S, Seol HJ, Blumberg A, Lee J-I, Iavarone A, Park W-Y, Rabadan R, Nam D-H. 2017. Spatiotemporal genomic architecture informs precision oncology in glioblastoma. Nature Genetics 49(4):594-599

[20] Ling S, Hu Z, Yang Z, Yang F, Li Y, Lin P, Chen K, Dong L, Cao L, Tao Y, Hao L, Chen Q, Gong Q, Wu D, Li W, Zhao W, Tian X, Hao C, Hungate EA, Catenacci DV, Hudson RR, Li WH, Lu X, Wu CI. 2015. Extremely high genetic diversity in a single tumor points to prevalence of non-Darwinian cell evolution. Proceedings of the National Academy of Sciences of the United States of America 112(47):E6496-E6505

[21] Maynard Smith J, Haigh J. 1974. The hitchhiking effect of a favourable gene. Genetics Research., Cambridge University Press 23:23-35

[22] McGranahan N, Swanton C. 2017. Clonal heterogeneity and tumor evolution: past, present, and the future. Cell 168(4):613-628

[23] Minussi DC, Henz B, Dos Santos Oliveira M, Filippi-Chiela EC, Oliveira MM, Lenz G. 2019. esiCancer: evolutionary in silico cancer simulator. Cancer Research 79(5):1010-1013

[24] Niida A, Hasegawa T, Miyano S. 2019. Sensitivity analysis of agent-based simulation utilizing massively parallel computation and interactive data visualization. PLOS ONE 14(3):e0210678

[25] Niida A, Iwasaki WM, Innan H. 2018. Neutral theory in cancer cell population genetics. Molecular Biology and Evolution 35(6):1316-1321

[26] Noble R, Burri D, Kather JN, Beerenwinkel N. 2019. Spatial structure governs the mode of tumour evolution. bioRxiv 586735

[27] Nowell PC. 1976. The clonal evolution of tumor cell populations. Science 194(4260):23-28

[28] Ohta T, Kimura M. 1975. The effect of a selected linked locus on heterozygosity of neutral alleles (the hitch-hiking effect) Genetics Research., Cambridge University Press 25:313-326

[29] Ohtsuki H, Innan H. 2017. Forward and backward evolutionary processes and allele frequency spectrum in a cancer cell population. Theoretical Population Biology 117:43-50

[30] Poleszczuk J, Hahnfeldt P, Enderling H. 2015. Evolution and phenotypic selection of cancer stem cells. PLOS Computational Biology 11(3):e1004025

[31] Saito T, Niida A, Uchi R, Hirata H, Komatsu H, Sakimura S, Hayashi S, Nambara S, Kuroda Y, Ito S, Eguchi H, Masuda T, Sugimachi K, Tobo T, Nishida H, Daa T, Chiba K, Shiraishi Y, Yoshizato T, Kodama M, Okimoto T, Mizukami K, Ogawa R, Okamoto K, Shuto M, Fukuda K, Matsui Y, Shimamura T, Hasegawa T, Doki Y, Nagayama S, Yamada K, Kato M, Shibata T, Mori M, Aburatani H, Murakami K, Suzuki Y, Ogawa S, Miyano S, Mimori K. 2018. A temporal shift of the evolutionary principle shaping intratumor heterogeneity in colorectal cancer. Nature Communications 9(1):2884

[32] Sato K, Niida A, Masuda T, Shimizu D, Tobo T, Kuroda Y, Eguchi H, Nakagawa T, Suzuki Y, Mimori K. 2019. Multiregion genomic analysis of serially transplanted patient-derived xenograft tumors. Cancer Genomics-Proteomics 16(1):21-27

[33] Sidow A, Spies N. 2015. Concepts in solid tumor evolution. Trends in Genetics 31(4):208-214

[34] Snippert HJ, Schepers AG, Van Es JH, Simons BD, Clevers H. 2014. Biased competition between Lgr5 intestinal stem cells driven by oncogenic mutation induces clonal expansion. EMBO Reports 15(1):62-69

[35] Solé RV, Rodríguez-Caso C, Deisboeck TS, Saldaña J. 2008. Cancer stem cells as the engine of unstable tumor progression. Journal of Theoretical Biology 253(4):629-637

[36] Sottoriva A, Kang H, Ma Z, Graham T, Salomon M, Zhao J, Marjoram P, Siegmund K, Press M, Shibata D, Curtis C. 2015. A Big Bang model of human colorectal tumor growth. Nature Genetics 47(3):209

[37] Sottoriva A, Verhoeff JJ, Borovski T, McWeeney SK, Naumov L, Medema JP, Sloot PM, Vermeulen L. 2010. Cancer stem cell tumor model reveals invasive morphology and increased phenotypical heterogeneity. Cancer Research 70(1):46-56

[38] Suzuki H, Aoki K, Chiba K, Sato Y, Shiozawa Y, Shiraishi Y, Shimamura T, Niida A, Motomura K, Ohka F, Yamamoto T, Tanahashi K, Ranjit M, Wakabayashi T, Yoshizato T, Kataoka K, Yoshida K, Nagata Y, Sato-Otsubo A, Tanaka H, Sanada M, Kondo Y, Nakamura H, Mizoguchi M, Abe T, Muragaki Y, Watanabe R, Ito I, Miyano S, Natsume A, Ogawa S. 2015. Mutational landscape and clonal architecture in grade II and III gliomas. Nature Genetics 47(5):458-468

[39] Turajlic S, Xu H, Litchfield K, Rowan A, Horswell S, Chambers T, O’Brien T, Lopez J, Watkins T, Nicol D, Stares M, Challacombe B, Hazell S, Chandra A, Mitchell T, Au L, Eichler-Jonsson C, Jabbar F, Soultati A, Chowdhury S, Rudman S, Lynch J, Fernando A, Stamp G, Nye E, Stewart A, Xing W, Smith J, Escudero M, Huffman A, Matthews N, Elgar G, Phillimore B, Costa M, Begum S, Ward S, Salm M, Boeing S, Fisher R, Spain L, Navas C, Gronroos E, Hobor S, Sharma S, Aurangzeb I, Lall S, Polson A, Varia M, Horsfield C, Fotiadis N, Pickering L, Schwarz R, Silva B, Herrero J, Luscombe N, Jamal-Hanjani M, Rosenthal R, Birkbak N, Wilson G, Pipek O, Ribli D, Krzystanek M, Csabai I, Szallasi Z, Gore M, McGranahan N, Van LP, Campbell P, Larkin J, Swanton C. 2018. Deterministic evolutionary trajectories influence primary tumor growth: TRACERx renal. Cell 173(3):595-610

[40] Uchi R, Takahashi Y, Niida A, Shimamura T, Hirata H, Sugimachi K, Sawada G, Iwaya T, Kurashige J, Shinden Y, Iguchi T, Eguchi H, Chiba K, Shiraishi Y, Nagae G, Yoshida K, Nagata Y, Haeno H, Yamamoto H, Ishii H, Doki Y, Iinuma H, Sasaki S, Nagayama S, Yamada K, Yachida S, Kato M, Shibata T, Oki E, Saeki H, Shirabe K, Oda Y, Maehara Y, Komune S, Mori M, Suzuki Y, Yamamoto K, Aburatani H, Ogawa S, Miyano S, Mimori K. 2016. Integrated multiregional analysis proposing a new model of colorectal cancer evolution. PLOS Genetics 12(2):e1005778

[41] Verhulst P-F. 1838. Notice sur la loi que la population suit dans son accroissement. Correspondance Mathématique et Physique 10:113-126