Genetic differentiation and genetic structure of mixed-ploidy Camellia hainanica populations

Hailang Tong; Hongda Deng; Zhiqiang Han

doi:10.7717/peerj.14756

Genetic differentiation and genetic structure of mixed-ploidy Camellia hainanica populations

Hailang Tong^1,2, Hongda Deng^1,2, Zhiqiang Han ^1,2

1Central South University of Forestry and Technology, The College of Forestry, Changsha, China

2Central South University of Forestry and Technology, The Laboratory of Forestry Genetics, Changsha, China

DOI: 10.7717/peerj.14756

Published: 2023-02-22
Accepted: 2022-12-27
Received: 2022-09-12

Academic Editor: Sushil Sarswat

Subject Areas: Agricultural Science, Genetics, Plant Science, Forestry, Population Biology
Keywords: Camellia hainanica, Polyploidy, Genetic differentiation, Genetic diversity, Genetic structure

Copyright: © 2023 Tong et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Tong H, Deng H, Han Z. 2023. Genetic differentiation and genetic structure of mixed-ploidy Camellia hainanica populations. PeerJ 11:e14756 https://doi.org/10.7717/peerj.14756

Abstract

Camellia hainanica, which is common in China’s Hainan Province, is an important woody olive tree species. Due to many years of geographic isolation, C. hainanica has not received the attention it deserves, which limits the exploitation of germplasm resources. Therefore, it is necessary to study population genetic characteristics for further utilization and conservation of C. hainanica. In this study, 96 individuals in six wild Camellia hainanica populations were used for ploidy analysis of the chromosome number, and the genetic diversity and population structure were investigated using 12 pairs of SSR primers. The results show complex ploidy differentiation in C. hainanica species. The ploidy of wild C. hainanica includes tetraploid, pentaploid, hexaploid, heptaploid, octoploid and decaploid species. Genetic analysis shows that genetic diversity and genetic differentiation among populations are low. Populations can be divided into two clusters based on their genetic structure, which matches their geographic location. Finally, to further maintain the genetic diversity of C. hainanica, ex-situ cultivation and in-situ management measures should be considered to protect it in the future.

Introduction

C. oleifera is the most important woody oil crop in China, and has excellent potential for development. This tree species plays an essential role in ensuring national grain and oil safety, socialist economic construction and ecological civilization construction (Lin et al., 2022; Gong et al., 2020). C. hainanica is a new species of the Camellia Sect. Oleifera was discovered in Hainan Province in 2020, and is an essential genetic resource for breeding. Due to its long-term geographical isolation, C. hainanica has not received due attention. The research and utilization of C. hainanica remain in its infancy, which means that the systematic evaluation of this resource has not been carried out domestically and overseas.

Species genetic structure and genetic differentiation are an important part of biodiversity conservation. Genetic variation underlies the evolutionary potential of species and is critical to the adaptability of populations (Du et al., 2016; Toro & Caballero, 2005). Many plant species are rich in diversity due to their wide distribution and a large number of individuals (Han et al., 2020). The study of their genetic variation is of great significance for the effective selection and maintenance of the diversity of germplasm resources (Belaj et al., 2012). Morphological traits have been applied to assess genetic variability, and the stability of results may be affected by environmental conditions. As an alternative, molecular markers are more stable and reliable for evaluating germplasm resources and have been used to evaluate the genetic variability of various species (Mafakheri et al., 2020; Rasul, Grundler & Tahir, 2022). Simple sequence repeats (SSRs) are the most frequently used molecular markers in population genetics. Due to the advantages of high polymorphism, specificity, reproducibility and comprehensive genome coverage, SSRs have been used to analyse population genetic diversity, structure and differentiation in species, such as Cunninghamia lanceolata (Hang et al., 2019), Pinus massoniana (Feng et al., 2014), Populus tomentosa (Du et al., 2012), and Pinus sylvestris (Pazouki et al., 2016).

The above research are all concerning diploids, but polyploidy is common in C. hainanica, mainly including octaploids and decaploids; however there are different reports of ploidy, such as on tetraploids, pentaploids, hexaploids and heptaloids (Ye et al., 2021). The uncertainty about polyploid inheritance patterns makes population genetic analysis more complex and challenging (Meirmans & Tienderen, 2013). For polyploid genetic analysis, a method has been developed to handle SSR data of polyploids for population genetic analysis (Dufresne et al., 2014), and the genetic diversity, structure and differentiation of polyploid plants were analysed using SSRs based on this method (Huang et al., 2018).

In this study, we used wild C. hainanica as material to determine the somatic chromosome count and the genetic structure and differentiation across different provenances by SSRs. The primary purpose of our study was to quantify the level of genetic diversity, and assess the genetic differentiation and structure of C. hainanica populations. Additionally, we provide support for the evaluation and utilization of genetic resources in C. hainanica.

Materials and Methods

Plant sampling

Wild C. hainanica samples were collected from major georeferenced sampling sites covering the entire longitude and latitude range of Hainan Province. The latitude and longitude of each sampling site were determined using a handheld GPS (GarmineTrex Handheld GPS; Garmin). We used ArcGIS (Esri) programs to draw a population distribution map based on population distribution coordinates. Plant individuals are randomly selected from each population, and the number of samples chosen depended on the size of the population. Ninety-six plant individuals from six wild C. hainanica populations were assigned. We collect fresh leaf material from each individual and stored it at −80 °C for DNA extraction.

DNA extraction, amplification, and microsatellite genotyping

The DNeasy Plant Mini Kit (Tiangen Biotech Co. Ltd., Beijing, China) was used to extract the DNA of the leaf sample, and the operation of this application was strictly conducted according to the manufacturer’s instructions. Likewise, a NanoDrop 2000 spectrophotometer (Thermo Scientific, USA) and agarose gel assay were used to measure the quality of the DNA samples. Twelve highly polymorphic SSR markers were used to genotype 96 samples of C. hainanica (Table 1). The target DNA fragment was amplified using the fluorescently labelled TP-M13-SSR polymerase chain reaction (PCR) method (Schuelke, 2000). The universal M13 primers fluorescently labelled with 6-carboxy-x-rhodamine, 6-carboxy-fluorescein, tetramethyl-6-carboxyrhodamine, or 5-hexachloro-fluorescein and the forward primer with a universal M13 primer tail (5-TGTAAAACGACGGC CAGT-3) at the 5′ end were used for the previously described PCR method. The PCR procedure was as follows: denaturation for 2 min at 95 °C; 12 cycles of 30 s at 95 °C, 30 s at 64–59 °C (−0.5 °C per cycle), 1 min at 72 °C; then, 24 cycles of 30 s at 95 °C, 30 s at 65 °C, 1 min at 72 °C, and a extension of 2 min at 72 °C (Cui et al., 2021). PCR amplification was performed in a Thermal Cycler (Bio-Rad, Hercules, CA, USA), and an ABI 3730XL DNA Analyser (Applied Biosystems, USA) was used for the detection of PCR products. The fluorescence detection data from capillary electrophoresis were used for SSR analysis with the GeneScan™ 500 LIZ^{^®} Size Standard as an internal standard, and allele analysis for each SSR locus was performed using GeneMapper version 3.7 (Applied Biosystems, Foster City, CA, USA).

Table 1:

The primary simple sequence repeat primers used in the study.

Locus	Sequence (5′–3′)	Repetitive unit	Repetitive time	Annealing temperature (°C)	No. of Alleles (A)	PIC	SPI (PIC ×A)	Allele range size (bp)
Loc1	F: CCCAGCACCCAGAATCAGAA	CCT	6	66	7	0.810	5.67	180–201
Loc1	R: TCGTCTCTCAATTGCGCGAT
loc2	F: GCGTATGGAAAAGCTGAGAA	TC	10	57	28	0.945	26.46	156–292
loc2	R: GAAGCAAACCACTGAGGTGA
Loc3	F: TTGTAGGCTGCCCAATCTCC	GAA	6	66	10	0.754	7.54	278–305
Loc3	R: CCATCTCAACCATCACCCCC
Loc4	F: TCCGAAACCGTGCCTCATTT	GAA	5	65	15	0.828	12.42	243–310
Loc4	R: ATCAAGCCCGGTTGTTGTCA
Loc5	F: GCCGTCTGTCTTCATCCGAA	CTC	5	66	9	0.711	6.399	170–294
Loc5	R: ACCACAGCTCAAACACACCA
Loc6	F: TACCTCTCACAGCCTCCTCC	TCT	5	67	7	0.564	3.948	245–269
Loc6	R: CCCAACTACCCCCACTGTTC
Loc7	F: GATCTGTGTCTCTCTGTTCCC	TG	32	55	9	0.760	6.84	187–215
Loc7	R: CCACACATCATCTTTTCCTC
Loc8	F: GAAGTTTGTTGAGAGTGCTGC	TA	22	55	14	0.875	12.25	156–204
Loc8	R: ACAGATCTAAATTTGGGGGG
Loc9	F: CTCACTACAGCRGCAACCGC	TC	11	55	9	0.731	6.579	170–260
Loc9	R: CCTGAATCTAGTGGGGCTTC
Loc10	F: TCACACCAGCCCCAAAAAGT	CAC	5	65	17	0.857	14.569	174–309
Loc10	R: CTAGAAACCCCGCGAGTCTC
Loc11	F: GCCGAAACCCAGGATCAGAA	CCG	5	66	10	0.788	7.88	203–293
Loc11	R: CTCAACCTCCTCAGCTGGTG
Loc12	F: GATGACAGGCCTGCGAAATG	TG	8	59	8	0.705	5.64	204–350
Loc12	R: TCAACGAAGCATACACAACGT
Mean			10.000	61.833	11.917	0.777

DOI: 10.7717/peerj.14756/table-1

Chromosome counts and ploidy levels

The young leaves of C. hainanica were collected and placed in an ice box, and celllysates (Precise-P, Sysmex) and DAPI staining solution (4′,6-diamidino-2-phenylindole) were placed in an ice box to precool. Approximately 0.5 g of fresh leaf tissue was chopped using a sharp razor blade in a 55 mm plastic Petri dish containing 300 of µL celllysate and 10 µL of PVP aqueous solution (28.57%, mass fraction). The separation buffer containing the nuclei was filtered into a 1.5 mL centrifuge tube using a 40 µm nylon filter, and 200 µL of DAPI staining solution was immediately added. After incubation for 30 s in an ice bath, the fluorescence intensity of DAPI-stained nuclei was determined using a Partec CyFlow^{^®} Space flow cytometer (Partec, Munich, Germany). Each DAPI-stained nuclei solution was tested for three replicates, and each repetition comprising 5000–10000 counts was measured. Camellia fluviatilis (2n =2x =30), a diploid relative species, was used as the control (Huang et al., 2018). Somatic cell chromosome counting was further validated using carbol fuchsin staining (Ye et al., 2020).

Genetic analysis

The nonamplified (null) alleles at each microsatellite locus were detected using Micro-Checker v2.2.3 software (Van Oosterhout et al., 2004). The allele data from Genemarker were imported into POLYSAT version 1.4 (Clark & Jasieniuk, 2011), and the data were exported into the GenoDive format (Meirmans & Van Tienderen, 2004) to calculate the number of alleles, effective number of alleles, observed heterozygosity, expected heterozygosity and inbreeding coefficient for every population and every locus. In addition, Hardy-Weinberg equilibrium tests and various estimates of genetic differentiation between populations were performed using GenoDive version 2.0, and analysis of molecular variance (AMOVA) was performed to estimate sources of genetic variation between populations and within individuals (Meirmans, Liu & Van Tienderen, 2018a).

Population structure

STRUCTURE version 2.3.3 (https://web.stanford.edu/group/pritchardlab/structure_software/release_versions/v2.3.3/html/structure.html) is more reliable than other clustering methods for cluster analysis of mixed ploidy populations (Stift, Kolar & Meirmans, 2019) and was used to estimate the C. hainanica population structure. Population genetic structure analysis based on the Bayesian clustering method was performed using an admixture model with independent allele frequencies (Meirmans & Liu, 2018b). We set the number of groups (K) from 1 to 6 and the length of the burn-in period of MCMC (Markov chain Monte Carlo) to 100,000 steps followed by 1,000,000 MCMC steps. Then, we calculated the lnP (D) value corresponding to each K value10 times, and calculate the average lnP (D). In addition, an ad hoc algorithm was used to calculate the K-1 ΔK value. Finally, the K value corresponding to the highest ΔK according to the fluctuation of all ΔK value peaks was the optimal number of subpopulations for the genetic structure of the population (Pritchard, Stephens & Donnelly, 2000; Evanno, Regnaut & Goudet, 2005). The bar charts for the data derived from the STRUCTURE analysis were created using DISTRUCT version 1.1 software packages (Rosenberg, 2003). Principal coordinate analysis (PCA) was performed using POLYSAT version 1.4 (Clark & Jasieniuk, 2011), and the genetic distance between samples was calculated using the Bruvo distance (Bruvo, Michiels & Schulenburg, 2004).

Results

Ploidy analysis

Flow cytometry was used to determine the ploidy of 96 plants from six different regions in Hainan Province. After staining with DAPI (100 µL, 50 µg/mL), the relative DNA content of each plant was determined. The results indicated that the relative DNA content in hexaploid plants was 1.5-fold higher than that of in tetraploid plants (Figs. 1A, 1C). The ploidy was estimated at tetraploid, pentaploid, hexaploid, heptaploid, octoploid, and decaploid for all samples (Fig. 1). Moreover, a chromosome tabletting technique indicated that the tetraploid plants contained 60 chromosomes, the pentaploid plants contained 75 chromosomes (2n=5x=75) (Fig. 1B), the hexaploid plants contained 90 chromosomes (2n=6x=90) (Fig. 1C), the heptaploid plants contained 105 chromosomes (2n=7x=105) (Fig. 1D), the octoploid plants contained 120 chromosomes (2n=8x=120) (Fig. 1E), and the decaploid plants contained 150 chromosomes (2n=10x=150) (Fig. 1F). These results indicated that the ploidy of wild C. hainanica from different provenances in Hainan Province includes tetraploid, pentaploid, hexaploid, heptaploid, octoploid and decaploid, and that there is complex ploidy differentiation in the C. hainanica species.

Figure 1: Chromosome counting and histograms presenting the flow cytometric analysis results, Scale bar =10 µm.
(A) Tetraploid plants with 2n=4x=60; (B) pentaploid plants with 2n=5x=75; (C) hexaploid plants with 2n=6x=90; (D) heptaploid plants with 2n=7x=105; (E) octoploid plants with 2n=8x=120; (F) decaploid plants with 2n=10x=150. A triangle (Δ) represents the fluorescence intensity detected by flow cytometry.

Download full-size image

DOI: 10.7717/peerj.14756/fig-1

Sample assessment

The mixed ploidies were recognized in a few populations via the ploidy test of every individual. We found that the numbers of tetraploid, pentaploid, hexaploid, heptaploid, octoploid and decaploid plants were 1, 2, 9, 13, 29, and 42, respectively, in six wild C. hainanica populations, which showed that the higher the ploidy was, the greater the number of plants. In addition, octaploids and decaploids accounted for approximately 74% of all samples, and the number of tetraploid, pentaploid, hexaploid and heptaploid plants was lower than that of octaploids and decaploids in each population (Fig. 2).

Figure 2: The ploidy composition of populations from different populations.
(A) Tunchang; (B) Wuzhishan; (C) Qionghai; (D) Qiongzhong; (E) Chengmai; (F) Dingan; The ordinate on the left represents the number of individuals in each population, and the ordinate on the right represents the percentage of different ploidies in each population. The dot represents the number of individuals with different ploidies, and the square represents the proportion of varying ploidy plants in the population.

Download full-size image

DOI: 10.7717/peerj.14756/fig-2

Genetic diversity and inbreeding coefficient estimates of C. hainanica

High polymorphism SSR markers with a mean number of alleles equal to 7.931 and a mean effective number of alleles equal to 4.217 were used to estimate the genetic diversity and inbreeding coefficient. Low genetic diversity was found for the C. hainanica populations, and heterozygosity decreased with decreasing latitude from Chengmai, Dingan, Tunchang, Qionghai, Qiongzhong to Wuzhishan, as shown in the arrow direction (Fig. 3), while Gis values increased accordingly (Table 2). The Chengmai population had the highest genetic diversity estimates, and the observed heterozygosity and expected heterozygosity were 0.558 and 0.752, respectively. Meanwhile, the Wuzhishan population had the lowest genetic diversity estimates, and the observed heterozygosity and expected heterozygosity were 0.335 and 0.674, respectively. Positive inbreeding coefficient Gis values were found for the whole population, which showed significant loss of heterozygosity. However, the overall observed heterozygosity and expected heterozygosity were not similar for the same population, the observed heterozygosity was lower than the expected heterozygosity, and the genetic diversity estimates decreased with increasing Gis values accordingly.

Figure 3: Geographic locations of the sampling sites and corresponding ploidy assignments (pie charts) for the 96 individuals in Hainan Province.
Each pie chart is divided into several coloured sections according to the proportion of individuals of different ploidies collected in its population (blue: tetraploid; red: pentaploid; grey: hexaploid; yellow: heptaploid; purple: octoploid; green: decaploid); The size of the pie charts represents the number of selected individuals in each population; the red circles represent the size of the population. The arrow direction indicates a gradual decrease in the genetic diversity of the population.

Download full-size image

DOI: 10.7717/peerj.14756/fig-3

Genetic differentiation coefficient of C. hainanica

Population genetic differentiation analysis showed that the degree of genetic differentiation among populations was low (Fst<0.100) (Table 3). The coefficient of genetic differentiation was the lowest between the Chengmai and Dingan populations (Fst =0.016), while Fst was the highest between the Tunchang, Qionghai and Qiongzhong Wuzhishan populations (Fst =0.066, 0.064, 0.091). The results showed that there was moderate genetic differentiation among populations. The results of the AMOVA calculation showed that (Table 4), the genetic variation within each population accounted for 85% of the total genetic variation (P = 0.001). The genetic variation within a population is much higher than the genetic variation among populations.

Genetic structure of C. hainanica

The C. hainanica populations come from different provenances (Fig. 4), and there may be subpopulations. Therefore, the genetic structure of these populations was estimated by STRUCTURE software. When the Bayesian clustering method was used to analyse the C. hainanica population, Ln Pr(K) increased gradually with increasing K value (Fig. 4B), and the optimal number of subpopulations could not be selected by this method. ΔK values corresponding to K = 1 − 6 were counted, and the ΔK statistic value reached the maximum value when K = 2 (Fig. 4C). Therefore, the individuals in all populations could be divided into 2 groups (Fig. 4A). The individuals from the Tunchang, Wuzhishan and Qiongzhong regions were divided into Cluster 1, while the individuals from the Qionghai, Chengmai and Dingan regions were divided into Cluster 2. We performed principal coordinate analysis (PCA). The results of PCA showed that C. hainanica samples from Tunchang, Wuzhishan and Qiongzhong were separate from those from Qionghai, Chengmai and Dingan, which verified the STRUCTURE analysis (Fig. S1). The results indicated clear genetic differentiation between the Tunchang, Wuzhishan, and Qiongzhong regions and the Qionghai, Chengmai, and Dingan regions. In addition, we found lower genetic differentiation between the Qionghai, Chengmai, and Dingan regions and the Tunchang, Wuzhishan, and Qiongzhong regions.

Table 2:

Indices of genetic diversity C. hainanica population.

Population	Num	Eff_num	H_o	H_e	G_is
Chengmai	7.167	5.128	0.558	0.752	0.240
Dingan	9.083	4.320	0.497	0.734	0.241
Tunchang	8.167	4.160	0.466	0.694	0.283
Qionghai	8.250	4.082	0.439	0.694	0.328
Qiongzhong	7.333	3.925	0.413	0.690	0.402
Wuzhishan	7.583	3.689	0.335	0.674	0.502
Overall	7.931	4.217	0.451	0.706	0.333

DOI: 10.7717/peerj.14756/table-2

Notes:

Num: Number of alleles
Eff_num: Effective number of alleles
Ho: Observed heterozygosity
He: Expected heterozygosity
Gis: Inbreeding coefficient

Table 3:

The coefficient of genetic differentiation (F_st) among populations.

Site	Dingan	Tunchang	Qionghai	Qiongzhong	Wuzhishan
Chengmai	0.016	0.030	0.044	0.037	0.064
Dingan		0.030	0.021	0.019	0.037
Tunchang			0.066	0.064	0.091
Qionghai				0.012	0.031
Qiongzhong					0.011

DOI: 10.7717/peerj.14756/table-3

Table 4:

Analysis of molecular variance (AMOVA) based on 12 SSR markers.

Source of variation	SSD	df	MS	VC	PV	F-value	P-value
Within population	351.038	90	3.9	3.9	0.85
Among population	73.534	5	14.707	0.688	0.15	0.15	0.001

DOI: 10.7717/peerj.14756/table-4

Notes:

df: degrees of freedom
PV: percentage of variation
SS: sum of squares
VC: variance components

The distribution and genetic structure of 6 populations of C. hainanica. — Figure 4: The distribution and genetic structure of 6 populations of *C. hainanica*.
(A) The distribution of *C. hainanica* populations. The size of the circles represents the number of individuals in each population. The red pie chart represents Cluster 1 and the blue pie chart represents Cluster 2, when K = 2. Bayesian inference analysis for determining the most likely number of clusters (K) for the distribution of (B) the likelihood L(K) values and (C) Δ K values is presented for K = 2–6. (D) STRUCTURE plots are presented for K = 2 and K = 4. Each vertical bar represents a population and its assignment proportion into population clusters (K).

Download full-size image

DOI: 10.7717/peerj.14756/fig-4

Discussion

Ploidy of C. hainanica

The flow cytometry and somatic chromosome count results indicated that wild C. hainanica samples had an inconsistent ploidy at different provenances, and the ploidy of the samples in the same provenance was different. Complex intraspecies multiploidy is common in C. hainanica, and previous research supports this conclusion (Ye et al., 2021). The reason may be the fusion of an undiminished gamete with a normal gamete (Zhang et al., 2021). The C. hainanica samples used in our study contained mixed-ploidy, which verified previous findings that there is complex intraspecific polyploidy in the genus Camellia (Zhuang, 2008). Polyploid species of the genus Camellia are more common in the western, northern, and northeastern regions, but are rarely found in the south (Ming, 2000). However, C. hainanica was found in Hainan Province in the southernmost region of China. Therefore, more samples from diverse distribution regions should be used to confirm the intraspecific polyploidy phenomenon.

Genetic diversity of C. hainanica

The genetic diversity of a species reflects its evolutionary and adaptive potential. The more genetic variation a species has, the more adaptive it is Du et al. (2012). Therefore, the genetic diversity of species is necessary to be studied to define their biological properties (Zhang et al., 2019). Our study shows that C. hainanica populations have high levels of genetic diversity with a high average expected heterozygosity (0.706), although the expected heterozygosity is higher than the results of previous research for other Camellia species (Tang et al., 2006; Lin et al., 2013; Li et al., 2019), and the higher heterozygosity may be due to a recent polyploidy event. The high degree of population genetic variation for polyploid species increases the genetic diversity of polyploidy (Huang et al., 2013). Similar to the results of population genetic analyses by Cui et al. (2021) observed heterozygosity (H_o = 0.451) estimates were also lower than the expected heterozygosity (H_e = 0.706), indicating significant heterozygosity deficits in all populations. Meanwhile, we found that the average inbreeding coefficient was positive (G_is = 0.333) for all C. hainanica populations, and the level of inbreeding was higher, which may have resulted from geographical isolation between islands and continents.

Genetic structure and genetic differentiation of C. hainanica

Differences in population genetic structure reflect genetic diversity, which conveys the adaptation potential of a species to its changing environment (Melo et al., 2014; Chen et al., 2021; Niu et al., 2019). In our study, F_st underestimated genetic differentiation between provenances, while lower genetic differentiation occurred among the six provenances tested. Meanwhile, the analysis of molecular variance (AMOVA) revealed that the genetic variation within the population accounted for 85% of the total genetic variation, which is consistent with the low F_ST values (Fst<0.100) among provenances. The results are similar to those of previous studies of the genus Camellia, and island isolation may play a key role in this genetic differentiation (Lin et al., 2013). In addition, the low degree of variation may be due to the relatively small geographic regions in our survey being from similar climatic provinces, which might be another factor resulting in low differentiation. Genetic structure analyses using both PCA and structure indicated that C. hainanica plants are genetically differentiated and showed that C. hainanica plants could be clustered into two clusters (Fig. 4). The two clusters were located in the northeast with the coastal area and southwest being a certain distance from the ocean, which matches the geographic provenance of the individual. As indicated by Huang et al. (2018), C. hainanica is insect pollinated and gene flow frequency decreases with increasing distance between plants. As a result, plants in close proximity are grouped together.

Conclusions

The study of the genetic diversity, genetic differentiation and genetic structure of C. hainanica will be helpful for exploiting its germplasm resources. Population genetic characteristics of C. hainanica have been discovered using flow cytometry and SSR markers, and the results focus on (1) complex ploidy differentiation in C. hainanica; (2) low genetic diversity and genetic differentiation among populations are low; and (3) the analysis of genetic structures in different populations, showing that C. hainanica can be divided into two clusters. Finally, to maintain the genetic diversity of C. hainanica and improve its potential application value, we must implement a protection strategy and then protect populations with low genetic diversity from evolutionary threats.

Supplemental Information

Allelic configuration of each sample at 12 SSR loci

DOI: 10.7717/peerj.14756/supp-1

Download

The PCA analysis for hainanica populations

Red circles represent samples from Tunchang, Wuzhishan, Qiongzhong and green circles represent samples from Qionghai, Chengmai, Dingan.

DOI: 10.7717/peerj.14756/supp-2

Download

[1] Belaj A, Dominguez-García M, Atienza SG, Urdíroz NM, Rosa RDela, Satovic Z, Martín A, Kilian A, Trujillo I, Valpuesta V. 2012. Developing a core collection of olive (Olea europaea L.) based on molecular markers (DArTs, SSRs, SNPs) and agronomic traits. Tree Genetics & Genomes 8(2):365-378

[2] Bruvo R, Michiels NK, Schulenburg H. 2004. A simple method for the calculation of microsatellite genotype distances irrespective of ploidy level. Molecular Ecology 13(7):2101-2106

[3] Chen L, Pan T, Qian H, Zhang M, Yang G, Wang X. 2021. Genetic diversity and population structure revealed by SSR markers on endemic species Osmanthus serrulatus Rehder from Southwestern Sichuan Basin, China. Forests 12(10):1365

[4] Clark LV, Jasieniuk M. 2011. POLYSAT: an R package for polyploid microsatellite analysis. Molecular Ecology Resources 11(3):562-566

[5] Cui X, Li C, Qin S, Huang Z, Gan B, Jiang Z, Huang X, Yang X, Li Q, Xiang X. 2021. High-throughput sequencing-based microsatellite genotyping for polyploids to resolve allele dosage uncertainty and improve analyses of genetic diversity, structure and differentiation: a case study of the hexaploid Camellia oleifera. Molecular Ecology Resources 22(1):199-211

[6] Du Q, Gong C, Wang Q, Zhou D, Yang H, Pan W, Li B, Zhang D. 2016. Genetic architecture of growth traits in Populus revealed by integrated quantitative trait locus (QTL) analysis and association studies. New Phytologist 209(3):1067-1082

[7] Du Q, Wang B, Wei Z, Zhang D, Li B. 2012. Genetic diversity and population structure of Chinese White poplar (Populus tomentosa) revealed by SSR markers. Journal of Heredity 103(6):853-862

[8] Dufresne F, Stift M, Vergilino R, Mable BK. 2014. Recent progress and challenges in population genetics of polyploid organisms: an overview of current state-of-the-art molecular and statistical tools. Molecular Ecology 23(1):40-69

[9] Evanno G, Regnaut S, Goudet J. 2005. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Molecular Ecology 14(8):2611-2620

[10] Feng Y, Yang Z, Wang J, Luo Q, Li H. 2014. Development and characterization of SSR markers from Pinus massoniana and their transferability to P. elliottii, P. caribaea and P. yunnanensis. Genetics and Molecular Research 13(1):1508-1513

[11] Gong W, Song Q, Ji K, Gong S, Wang L, Chen L, Zhang J, Yuan D. 2020. Full-length transcriptome from camellia oleifera seed provides insight into the transcript variants involved in oil biosynthesis. Journal of Agricultural and Food Chemistry 68(49):14670-14683

[12] Han Z, Han Q, Xia Y, Geng X, Du K, Yang J, Kang X. 2020. Construction of a breeding parent population of Populus tomentosa based on SSR genetic distance analysis. Scientific Reports 10(1):1-11

[13] Hang Y, Yu J, Zhou S, Cheng J, Huang H, Lin E, Tong Z. 2019. Development and application of SSR loci from functional genes involved in wood formation in Cunninghamia lanceolata. Journal of Agricultural Biotechnology 27(1):34-42

[14] Huang H, Tong Y, Zhang QJ, Gao L. 2013. Genome size variation among and within Camellia species by using flow cytometric analysis. PLOS ONE 8(5):e64981

[15] Huang X, Chen J, Yang X, Duan S, Long C, Ge G, Rong J. 2018. Low genetic differentiation among altitudes in wild Camellia oleifera, a subtropical evergreen hexaploid plant. Tree Genetics & Genomes 14(2):1-12

[16] Li X, Wang J, Fan Z, Li J, Yin H. 2019. Genetic diversity in the endangered Camellia nitidissima assessed using transcriptome-based SSR markers. Trees 34(2):543-552

[17] Lin L, Hu ZY, Ni S, Li JY, Qiu YX. 2013. Genetic diversity of Camellia japonica (Theaceae), a species endangered to East Asia, detected by inter-simple sequence repeat (ISSR) Biochemical Systematics and Ecology 50:199-206

[18] Lin P, Wang K, Wang Y, Hu Z, Yan C, Huang H, Ma X, Cao Y, Long W, Liu W, Li X, Fan Z, Li J, Ye N, Ren H, Yao X, Yin H. 2022. The genome of oil-Camellia and population genomics analysis provide insights into seed oil domestication. Genome Biology 23(1):1-21

[19] Mafakheri M, Kordrostami M, Rahimi M, Matthews PD. 2020. Evaluating genetic diversity and structure of a wild hop (Humulus lupulus L.) germplasm using morphological and molecular characteristics. Euphytica 216(4):1-19

[20] Meirmans PG, Liu S. 2018b. Analysis of molecular variance (AMOVA) for autopolyploids. Frontiers in Ecology and Evolution 6:66

[21] Meirmans PG, Liu S, Van Tienderen PH. 2018a. The analysis of polyploid genetic data. Journal of Heredity 109(3):283-296

[22] Meirmans PG, Van Tienderen PH. 2004. Genotype and genodive: two programs for the analysis of genetic diversity of asexual organisms. Molecular Ecology Notes 4(4):792-794

[23] Melo ATDO, Coelho ASG, Pereira MF, Blanco AJV, Franceschinelli EV. 2014. High genetic diversity and strong spatial genetic structure in Cabralea canjerana (Vell.) Mart. (Meliaceae): implications to Brazilian Atlantic Forest tree conservation. Natureza & Conservação 12(2):129-133

[24] Ming TL. 2000. Monograph of the genus Camellia. Kunming: Yunnan Science and Technology Press.

[25] Niu S, Song Q, Koiwa H, Qiao D, Zhao D, Chen Z, Liu X, Wen X. 2019. Genetic diversity, linkage disequilibrium, and population structure analysis of the tea plant (Camellia sinensis) from an origin center, Guizhou plateau, using genome-wide SNPs developed by genotyping-by-sequencing. BMC Plant Biology 19(1):1-12

[26] Pazouki L, Shanjani PS, Fields PD, Martins K, Suhhorutšenko M, Viinalass H, Niinemets Ü. 2016. Large within-population genetic diversity of the widespread conifer Pinus sylvestris at its soil fertility limit characterized by nuclear and chloroplast microsatellite markers. European Journal of Forest Research 135(1):161-177

[27] Pritchard JK, Stephens M, Donnelly P. 2000. Inference of population structure using multilocus genotype data. Genetics 155(2):945-959

[28] Rasul KS, Grundler FM, Tahir AR. 2022. Genetic diversity and population structure assessment of Iraqi tomato accessions using fruit characteristics and molecular markers. Horticulture, Environment, and Biotechnology 63(4):523-538

[29] Rosenberg NA. 2003. DISTRUCT: a program for the graphical display of population structure. Molecular Ecology Notes 4(1):137-138

[30] Schuelke M. 2000. An economic method for the fluorescent labeling of PCR fragments. Nature Biotechnology 18(2):233-234

[31] Stift M, Kolar F, Meirmans PG. 2019. STRUCTURE is more robust than other clustering methods in simulated mixed-ploidy populations. Heredity 123(4):429-441

[32] Tang S, Bin X, Wang L, Zhong Y. 2006. Genetic diversity and population structure of yellow camellia (Camellia nitidissima) in China as revealed by RAPD and AFLP markers. Biochemical Genetics 44:449-461

[33] Meirmans, Tienderen V. 2013. The effects of inheritance in tetraploids on genetic diversity and population divergence. Heredity 110(2):131-137

[34] Toro M, Caballero A. 2005. Characterization and conservation of genetic diversity in subdivided populations. Philosophical Transactions of the Royal Society B: Biological Sciences 360(1459):1367-1378

[35] Van Oosterhout C, Hutchinson WF, Wills D, Shipley P. 2004. MICRO-CHECKER: software for identifying and correcting genotyping errors in microsatellite data. Molecular Ecology Notes 4(3):535-538

[36] Ye TW, Li YM, Zhang J, Gong QY, Yuan DY, Xiao SX. 2020. Optimization of chromosome mounting technique and karyotype analysis of Camellia oleifera. Journal of Nanjing Forestry University 44(5):93-99