High variability of perezone content in rhizomes of Acourtia cordata wild plants, environmental factors related, and proteomic analysis

View article
Plant Biology

Introduction

The plant species Acourtia cordata (Cerv.) B. L. Turner (Asteraceae) is commonly known as the “Buzzard herb” and has been used in traditional Mexican medicine as an infusion possessing anthelmintic, laxative, and diuretic properties (Flores y Troncoso & de Asís, 1982). The genus Acourtia comprises approximately 80 species, most of which are endemic to Mexico (Villaseñor, 2016). The rhizomes of several species of this genus produce many compounds of terpenic origin, including the sesquiterpene p-benzoquinone perezone, which is highly soluble in n-hexane and can be easily extracted and purified with this solvent. The perezone was the first substance isolated and purified from the New World (in 1852), and its chemistry has been widely studied. It is known to possess quinone-like properties; it oxidizes easily and reduces spontaneously (Joseph-Nathan, González & Rodríguez, 1972; Joseph-Nathan & Santillan, 1989). Recently, the importance of perezone and more than 100 related compounds has been reviewed, as well as their biological activities, which have been highlighted (Escobedo-González et al., 2022). Interestingly, they report several plant species that not belonging to the genus Acourtia, produce perezone and/or related compounds.

In addition, several pharmacological activities have been reported for perezone and its derivatives: hypoglycemic activity, inhibition of platelet aggregation, induction of contractile response in intestinal smooth muscle, and protection against the deleterious effects of ischemia and reperfusion (Alarcon-Aguilar et al., 1997; Peña et al., 2001; Garcia et al., 1995; Tellez et al., 1999). The antifeeding effects of perezone derivatives against larvae of agriculturally important insect pests and their phytotoxic activity have also been reported (Burgueño-Tapia et al., 2008).

Some reports on the biotechnological production of perezone have shown that callus, hairy roots, and in vitro cultured plantlets of some Acourtia species produce and accumulate lower amounts of perezone than the rhizomes of native plants (Arellano et al., 1996; Gómez-Serrano, Cristiani-Urbina & Villegas-Garrido, 2010, 2012). This biosynthesis and accumulation of secondary metabolites (SMs) can vary in a plant species due to genotypic differences among individuals growing in the same area. The variation may also be due to different environmental conditions in the places where each plant grows. Such environmental factors include altitude, latitude, precipitation, temperature, humidity, soil, sun radiation, etc. (Liu et al., 2015; Moreira et al., 2018; Filippi et al., 2021). The quantitative and qualitative SM variation in plant species occurs among individuals, tissue types, ontogenetic stages, plant age and size (Koricheva & Barton, 2012; Lattanzio et al., 2009; Andrew et al., 2007). Therefore, the population age and structure can contribute to SM variation (Moore & Foley, 2005). Edaphic parameters also contribute to the expression of different biochemical characteristics in plants (Hamilton et al., 2001). Phosphorous (P) and nitrogen (N) are the two most limiting elements in plants since they alter diverse physiological and biochemical processes (Pradhan et al., 2017; Ramakrishna & Ravishankar, 2011). N and P can increase the accumulation of carbohydrates for plant growth and photosynthetic rates (Bassi, Menossi & Mattiello, 2018). They can also modify the biosynthesis of carbon-based SMs, such as phenolic acids, flavonoids, tannins, and terpenes (Ormeño & Fernandez, 2012).

Proteomics is an important tool to describe diverse processes in the plant kingdom, such as morphogenetic, physiological, and biochemical changes during the development of tissues and organs (Di Michele et al., 2006; Tian et al., 2009; Mohamad et al., 2011; Takáč, Pechan & Šamaj, 2011; Martínez-Márquez et al., 2013; Martínez-Esteso et al., 2014). In addition, proteomics can facilitate the analysis of proteins related to different biotic and abiotic stresses (Sergeant & Renaut, 2010; Fernandez-Garcia et al., 2011; Trupiano et al., 2012; Aghaei & Komatsu, 2013; Ghosh & Xu, 2014; Fang et al., 2015; Garcia de la Garma et al., 2015; Chmielewska et al., 2016; Martinez-Esteso et al., 2016). Furthermore, proteomics enables the study of proteins engaged in biosynthetic pathways that lead to the biosynthesis of SMs in medicinal plants (Senthil et al., 2011; Martinez-Esteso et al., 2011; Champagne et al., 2012; Bhattacharyya et al., 2012; Ma et al., 2013; Sud, Chauhan & Tandon, 2014; Bryant et al., 2015; Martínez-Esteso et al., 2015).

Due to liquid chromatography–mass spectrometry and bioinformatics clear progress in plant proteomics has been seen (Baginsky, 2008). Based on this technology, many types of studies have been conducted for the analysis of complex mixtures of proteins obtained from tissues of model and non-model plant species (Bayer et al., 2006; Armengaud et al., 2014). Although there are many proteomic studies with different approaches in model plants, there are only a few in non-model plants, mostly in medicinal plants (Oldham et al., 2010).

The plants of A. cordata are a natural source of several bioactive compounds responsible for different pharmacological effects as perezone. Nevertheless, there are no reports to date on the probable effect or relation of the edaphic factors impacting production of sesquiterpenes such as perezone. Also, there are no reports on the identification, location, and characterization of proteins involved in perezone biosynthesis.

In a preliminary study carried out in the Alarcon locality, we found a high variability of perezone content between the rhizomes of 10 individual wild plants of A. hebeclada synonymous with A. cordata, ranging from 0.00–0.29 mg/g dry weight (García-Méndez et al., 2016). Hence, the aim of the present study was to analyze the relationship between the high variability of perezone content in the rhizomes of A. cordata wild plants and rhizospheric soil parameters and plant macromorphological characteristics, as well as provide information on the proteins differentially expressed in rhizomes of A. cordata with different concentrations of perezone. In this sense, we hypothesized that soil components, macromorphological characteristics, and differences in protein expression could be related to differences in perezone content between high and low producers in Acourtia cordata wild plants.

Materials and Methods

Plant material

This study used the rhizomes of 60 wild plants of A. cordata from: Chamilpa, Alarcon, and Felipe Neri. These are located inside the ecologically protected area known as “Chichinautzin”, in the state of Morelos located at 2,309 m.a.s.l. and 18°45′0″N, 99°4′0″W (Vega Guzmán, López-García & Manzo Delgado, 2008). All the plant material collected in the field was carried out according to the protocols of the Universidad Autónoma del Estado de Morelos/Corredor Biológico Chichinautzin. Each rhizome sample was immediately placed in a plastic bag and stored on ice for transport to the laboratory (Fig. S1). Once in the laboratory, each sample was divided in two parts: one part was used for perezone quantification and the other part was carefully washed and stored at −80 °C. A plant specimen from each locality (voucher numbers 34664, 34665, and 34666) was deposited at the HUMO-Herbarium of the Biodiversity and Conservation Research Center of the Autonomous University of the State of Morelos, Mexico (Fig. S2). The macromorphological characteristics of each sampled plant (basal diameter, number of branches, total individual height, and shrub cover) were measured.

Perezone quantification

Gas chromatography coupled to mass spectrometry

Three sections of each rhizome sample of A. cordata were used for perezone quantification. The perezone was extracted three times via maceration of 2.0 g (dry weight) of grinded rhizome tissue with 20.0 ml of n-hexane over 3 days. The extracts were then obtained by solvent evaporation in an extraction chamber at room temperature until dry. The extracts were then dissolved in methanol and analyzed by gas chromatography coupled to mass spectrometry. We used an Agilent Technology 6890 gas chromatograph coupled to a MSD 5973 quadrupole mass detector (HP Agilent, Santa Clara, CA, USA) to quantify the perezone using a standard of pure perezone as the reference.

The compounds were separated on an HP-SMS capillary column 30 mm long with a 0.25-mm internal diameter and 0.25-μm film. The carrier gas was helium administered at a linear injection rate of 1.0 ml/min at constant flow. The inlet temperature of the injector was 250 °C, and the temperature of the column was initially 40 °C and increased at 10 °C/min until it reached 250 °C where it was held for 20 min. The mass spectrometer was operated in positive detection mode with an ionization energy of 70 eV. The acquisition was made in selective ion monitoring (SIM), and chromatographic peaks were identified and quantified using target ions appeared (Sánchez-Ramos et al., 2018).

Rhizospheric soil analysis

To characterize the soil, we collected 60 samples over three localities (20/locality) and collected individual rhizospheric soil samples from the site where each plant grew. For this analysis we took 200 g of dry rhizospheric soil and followed the procedures described by Valencia-Cuevas et al. (2020) quantifying pH and the content of organic matter (O.M.), ammonium (NH4), nitrate (NO3), phosphate (PO4), total nitrogen (Nt) and total phosphorous (Pt). For details see the authors (Bremner, 1965; Valencia-Cuevas et al., 2020; Bray & Kurtz, 1945).

Data analyses

We used a general linear model (GLM) approach to examine whether the soil parameters (pH, O.M., NH4, NO3, PO4, Nt, Pt) and the measured macromorphological characteristics (basal diameter, number of branches, total individual height, and shrub cover) influence the perezone production in A. cordata. The collection sites were considered as random factors, and pH, O.M., NH4, NO3, PO4, Nt, Pt, basal diameter, number of branches, total individual height, and shrub cover as our factors (Table 1). The software used for statistical analysis was STATISTICA 8.0 (Weiß, 2007).

Table 1:
General lineal model analysis testing the influence of different soil parameters and macromorphological characters on perezone contents in Acourtia cordata in three localities at the “Chichinautzin”, Morelos, Mexico.
SS F1,46 P % Variation
Soil parameter
pH 2.17 1.20 n.s.
O.M. 17.55 9.75 ** 21.2
NH4 1.64 0.91 n.s.
NO3 1.08 0.60 n.s.
PO4 0.39 0.22 n.s.
Nt 6.83 3.79 * 8.3
Pt 28.71 15.96 *** 34.7
Macromorphological character
Basal diameter 0.47 0.26 n.s.
Number of branches 0.42 0.23 n.s.
Total individual height 0.60 0.33 n.s.
Shrub cover 0.01 0.00 n.s.
Locality
Study site 2.34 0.65 n.s.
DOI: 10.7717/peerj.16136/table-1

Notes:

not significant differences.
p < 0.05.
p < 0.01.
p < 0.005.

Proteomic analysis

Root tissue cleaning

Before protein extraction, each rhizome sample was cleaned according to Wang et al. (2003) with some modifications (Martinez-Esteso et al., 2011). For protein extraction, 4.0 g of tissue was prepared in a mortar and pestle with liquid nitrogen. The tissue was then transferred to 50.0 ml centrifuge tubes, suspended in 10.0 ml of n-hexane, agitated on vortex per 30 s, and centrifuged at 10,000×g per 5 min at 4 °C. The supernatant was then discarded, and the pellet was cleaned again 3–4 times until the supernatant was colorless. Next, the pellet was re-suspended in 10.0 ml of a mixture of ethyl acetate:ethanol (1:2) for 1 min and centrifuged 5 min at 4 °C. The wash was repeated followed by discarding the supernatant and re-suspending the sample after the addition of 10.0 ml pure acetone. This procedure includes resuspension, centrifugation, and the discarding of the supernatant. This was repeated during the following steps changing only the resuspension solution to 10% trichloroacetic acid (TCA) in cold acetone, 10% TCA in cold water, and finally, 80% acetone. At each step, the sample was centrifuged at 10,000 × g and 4 °C. Finally, the pellet was dried at room temperature and then stored at −80 °C for future usage.

Protein extraction

For protein extraction and further protein analysis, all individual rhizome samples were grouped according to their perezone yield using a statistical mean (k-means) test for grouping. Three groups were generated: high, medium, and low producers of perezone (Fig. S3). To obtain protein samples with clear differences for the proteomic analysis, only two groups were analyzed in this study: groups with high and low perezone production.

The cleaned dry powder of tissue (about 250 mg) was homogenized in 2.0 ml microtubes using 0.7 ml extraction buffer containing 0.7 M sucrose, 0.5 M Tris-HCl pH 8.0, 0.1 M HCl, 50 mM EDTA, 1% DTT, 1% polyvinylpolypyrrolidone (PVPP) and a cocktail of protease inhibitors (4-(2-aminoethyl) benzenesulfonyl fluoride, E-64, bestatin, leupeptin, aprotinin and sodium EDTA (Sigma-Aldrich, St. Louis, MO, USA). The homogenate was added with 0.7 ml Tris-saturated phenol pH 8.0 (Sigma, St. Louis, MO, USA). The mixture was vortexed thoroughly for 30 s and incubated with orbital shaking on ice for 1 h. The phenol phase was separated by centrifuging at 10,000 × g for 20 min at 4 °C. The upper phenol phase was recovered and pipetted to new 2.0-ml microtubes. The remaining aqueous phase was re-extracted with 0.7 ml Tris-saturated phenol pH 8.0 and 0.7 ml of extraction buffer. Proteins were precipitated from the pooled phenol phases by adding five volumes cold 0.1 M ammonium acetate in methanol, incubating at −20 °C overnight, and collecting by centrifuge at 10,000 × g for 10 min at 4 °C. The protein pellet was washed twice with 0.1 M ammonium acetate in methanol and twice with chilled 80% acetone. The pellet was resuspended in 6 M urea. Finally, the protein concentration was determined by RCDC protein assay (BIO-RAD, Madrid, Spain) based on the modified Lowry protein assay (Raghupathi & Diwan, 1994). Protein extracts were also visualized on SDS-PAGE (Fig. S4).

Liquid protein digestion and protein analysis by ultra-high-performance liquid chromatography coupled with quadrupole time-of-flight

One hundred micrograms of protein extracts were reduced with 5 µl of 0.2 M DTT. The reaction was kept at 37 °C for 60 min. Samples were alkylated with 20 µl of 0.2 M iodoacetamide (IAM) at room temperature for 60 min in the dark. The urea concentration during digestion was reduced 10 times with 25 mM ammonium bicarbonate (AMBIC) which is compatible with trypsin activity. Modified sequencing-grade trypsin was added at a 30:1 protein/enzyme ratio, and samples were digested at 37 °C overnight. Finally, we added trypsin 60:1 protein/enzyme ratio and allowed digestion to proceed for an additional 4–5 h at 37 °C. The tryptic digestion was stopped by acidifying the sample with formic acid. Once digested, each sample was evaporated and resuspended in 0.5% TCA in 5% acetonitrile (ACN) to desalt it. The peptides were passed through a C18 column (Pierce® C18 Spin Columns; Thermo Scientific, Waltham, MA, USA) previously prepared to be activated with 50% ACN buffer and calibrated with a solution of 0.5% TCA in 5% ACN. Once the peptides were inside the column, they were washed with activation buffer and eluted with 70% ACN and 0.1% formic acid; the solvent was then evaporated. Peptides were resuspended in 3% acetonitrile and 0.1% formic acid prior to LC-MS/MS analysis.

The desalted peptide digests were analyzed on an Agilent 6550 iFunnel Q-TOF mass spectrometer (Agilent Technologies, Santa Clara, CA, USA) coupled to an Agilent 1290 UHPLC system. Peptide samples were loaded onto an Agilent AdvanceBio Peptide mapping column (2.1 mm × 250 mm, 2.7 μm particle size, operating at 50 °C) via an Infinity Autosampler (Agilent Technologies, Santa Clara, CA, USA) with Buffer A (water, 0.1% formic acid) flowing at 0.4 ml/min. Peptides were eluted into the mass spectrometer using a 140 min linear gradient of 3–40% ACN in 0.1% formic acid. Peptides were introduced to the mass spectrometer from the LC using a Jet Stream source (Agilent Technologies, Santa Clara, CA, USA) operating in positive-ion mode (3,500 V) and high-sensitivity mode. Source parameters employed gas temp (250 °C), drying gas (14.0 l/min), nebulizer (35 psi), sheath gas temp (250 °C), sheath gas flow (11.0 l/min), VCap (3,500 V), fragmentor (360 V), and OCT 1 RF Vpp (750 V). The data were acquired as described Torregrosa-Crespo et al. (2020) with the same Workstation Software and LC/MS Data Acquisition, operating in the same Auto MS/MS mode. For more details, see the authors.

Each MS/MS spectrum was preprocessed with the extraction tool Spectrum Mill Proteomics Workbench (Agilent, Santa Clara, CA, USA) to obtain a peak list and to improve the spectral quality by merging MS/MS spectra with the same precursor (Δm/z < 1.4 Da and chromatographic Δt < 15 s). The reduced dataset was searched against a subset of the NCBInr protein database composed of sequences from the Asteraceae family, and contaminant proteins without taxonomical restrictions was searched in identity mode with the MS/MS search tool of Spectrum Mill Proteomics Workbench. The settings were as follows: trypsin, up to two missed cleavages, carbamidomethylation of cysteines as fixed modifications, oxidation of methionine as a variable, and mass tolerance of 20 ppm for precursor and 50 ppm for product ions. Peptide hits were validated in the peptide mode (FDR < 1.2%) and in protein mode according to the score settings recommended by the manufacturer. At the peptide level, the identification threshold was set at score ≥ 6 and % SPI ≥ 60 (the percentage of the extracted spectrum that is explained by the database search result). In Spectrum Mill, highly confidence identifications were done at the protein level and were considered when two or more peptides were matched. Their summed score is >20; however, the identification relies on highly conserved peptides because the database does not contain any sequence of Acourtia cordata; thus, we used all proteins identified through the surrogate-identified peptides for further bioinformatic analysis.

Quantitative label-free LC-MS analysis

LC-MS analysis without labels used Progenesis QI for proteomics software version 4.1 (NonLinear Dynamics, Newcastle upon Tyne, UK) as recommended by the manufacturer (www.nonlinear.com). The software processed the raw data in two steps as described by Meleady et al. (2012). First, each sample run was subjected to an alignment that involved aligning the data according to the LC retention time of each sample. This allows for any deviation in retention time that gives a tight retention time for all runs in the analysis. The execution of the sample that produced most of the characteristics (i.e., peptide ions) was used as the reference run, to which the retention time of all other runs was adjusted; the maximum intensities were then normalized. The Progenesis peptide quantification algorithm calculates the abundance of peptides as the sum of the areas of the peaks within their isotopic limits. Each abundance value is then transformed into a normalized abundance value by applying a global scale factor. The abundance of the proteins was automatically calculated by the Hi-3 as described by Silva et al. (2006) implemented in the Progenesis QI for proteomics.

Categorization and functional annotation

Proteins with differential abundances were considered upregulated when log2 was 1.0 or greater and downregulated when log2 was −1.0 or less. Differentially expressed proteins were classified by their Gene Ontology (GO) using Blast2GO v5 (https://www.blast2go.com/) (Meleady et al., 2012). Three different GO vocabularies were assigned including biological process, molecular function, and cellular component. A sequence file of FASTA sequences was generated from the group of proteins previously identified and quantified using the NCBI website. Blast2GO was charged with the FASTA file; first, the sequence description was introduced by a BLASTp searching vs. NCBInr (e-value cut-off 1 × 1010), then the terms GO, IC and Interpro were mapped, followed by sequence annotation (1 × 106 E-value Hit-Filter, cut off 0 Hsp-Hit covering, 55 annotation limit and five GO weight). The analysis of the metabolic pathways in which the identified proteins were classified was performed with the Kyoto Encyclopedia of Genes and Genomes (KEGG) database (http://www.genome.jp/kegg/) (Kanehisa et al., 2017). Finally, we analyzed protein-protein interaction for identified proteins using Cytoscape 3.6.1 software (Shannon et al., 2003). A protein-protein interaction network was obtained from the STRING database; we used the high score corresponding homologous proteins from the Arabidopsis thaliana (http://string-db.org/) (Szklarczyk et al., 2017). The interaction network from STRING was visualized in Cytoscape (http://www.cytoscape.org/). Network clustering was carried out as described by Safari-Alighiarloo et al. (2016) with the ClusterONE algorithm, identifying modules with a minimum density of >0.2 and a degree of >0.07. A p-value of < 0.05 for a cluster was considered to be a module (Tables S7 and S8).

Results

Perezone quantification in the rhizomes of Acourtia cordata wild plants

Our results showed significant variation in the production/accumulation of perezone between plant individuals of the same locality (Table S1). The content ranged from 0.329 ± 0.004 to 7.757 ± 0.170 mg g−1 of root (Figs. 1A1C). However, the perezone content did not vary significantly among the three localities studied. We found high producers of perezone that generate between 7.757 ± 0.170 and 4.474 ± 2.9405 × 10−5 mg g−1 of rhizome, medium producers of perezone that generate between 4.169 ± 0.486 to 2.881 ± 0.060 mg g−1, and low producers of perezone that generate between 2.21 ± 0.069 and 0.329 ± 0.004 mg g−1 of rhizome (Fig. S3). The identification and quantification of perezone used gas chromatography coupled with mass spectrometry with a retention time between 17.62 and 17.75 min; fragmentation patterns were seen at m/z 166, 191, 205, and 248 (Figs. S5S7).

Perezone content in rhizomes of each Acourtia cordata plant in the three localities: Chamilpa (A), Alarcón (B) and Felipe Neri (C).

Figure 1: Perezone content in rhizomes of each Acourtia cordata plant in the three localities: Chamilpa (A), Alarcón (B) and Felipe Neri (C).

Influence soil parameters and macromorphological characters on perezone contents production

The general linear model (GLM) analysis showed a significant influence of the O.M., Nt, and Pt soil parameters on the perezone contents production (Table 1). GLM analysis showed that Pt, O.M. and Nt had a significantly influenced on perezone content, explaining 34.7%, 21.2% and 8.3% of their variation, respectively. In contrast, there was no relationship between perezone contents and plant morphological characters and collection site (Table 1).

Identification and functional classification of differentially expressed proteins

The data obtained by mass spectrometry (MS) from each group of A. cordata plants were compared with the NCBI database for the Asteraceae family using Spectrum Mill Proteomics Workbench search engine. This approach identified 2,772 proteins (Table S2). We identified 616 differentially expressed proteins when comparing the high production group against the low perezone production group, from which 125 (20.3%) proteins correspond to over-expressed proteins and 491 (79.7%) to repressed proteins. To understand such differences, we performed a further analysis.

Blast2Go software (Conesa & Götz, 2008) was used for the automatic assignment of proteins description and could annotate the sequences compared by homology with the database of NCBI. The proteins over-expressed and repressed were analyzed and classified according to their gene ontology (GO) as follows: biological process (GOBP), molecular function (GOMF), and cellular component (GOCC). We note that some proteins were identified and classified in more than one GO category, so the total number of proteins in each graphic may be higher than the initial number (Tables S3 and S4).

Proteomic comparison between high and low perezone producers

We performed enrichment analysis of up-regulated and down-regulated proteins. Figures 2A, 2B, 3A, and 3B show the top significantly enriched GO terms and pathways. The most enriched GOBP categories in up-regulated proteins were the secondary metabolic process showing regulation of macromolecule biosynthetic process, DNA duplex unwinding, regulation of proteolysis, and DNA geometric change. These GO terms represent 88% of the GOBP category. The GOCC categories include the chloroplast envelope and plastid envelope; these GO terms represent 4.8% of the GOCC category. Finally, the GOMF category contained enzyme binding, protein binding, oxidoreductase activity, helicase activity, DNA helicase activity, and pyrophosphatase activity; these GO terms represent 60% of the GOMF category (Fig. 2A). In contrast, the enriched GOBP categories in down-regulated proteins were signal transduction, cell communication, MAPK cascade, and regulation of transferase activity; these GO terms represent 11% of the GOBP category. The GOMF categories were transferase activity transferring phosphorus-containing groups, protein serine/threonine kinase activity, and oxidoreductase activity; these GO terms represent 8.5% of the GOMF category (Fig. 2B).

(A) Representation of the set of over-expressed proteins grouped within the GO terms: molecular function, cellular component and biological process. (B) Representation of the set of under-expressed proteins grouped within the GO terms of molecular function and biological process.

Figure 2: (A) Representation of the set of over-expressed proteins grouped within the GO terms: molecular function, cellular component and biological process. (B) Representation of the set of under-expressed proteins grouped within the GO terms of molecular function and biological process.

Enrichment analysis of KEGG pathways of differentially expressed proteins between high and low perezone producers.

Figure 3: Enrichment analysis of KEGG pathways of differentially expressed proteins between high and low perezone producers.

(A) Comparison of the group of over expressed proteins between high and low perezone producers. (B) Comparison of the group of subexpressed proteins between high and low perezone producers.

Over-expressed and repressed proteins of both high and low perezone producers were also analyzed by the KEGG. This work used an enrichment analysis based on the Fisher exact test (p-value < 0.05) (Table S5 and S6). The enriched up-regulated metabolic pathways via KEGG were compared by comparing high and low perezone producers. These included carbapenem biosynthesis, terpenoid backbone biosynthesis, valine, leucine, and isoleucine degradation, biosynthesis of antibiotics, and benzoate degradation (Fig. 3A). The main down-regulated metabolic pathways in KEGG include pyruvate metabolism, purine metabolism, glycolysis/gluconeogenesis, thiamine metabolism, glyoxylate and dicarboxylate metabolism, oxidative phosphorylation, steroid hormone biosynthesis, galactose metabolism, drug metabolism, pyrimidine metabolism, arginine biosynthesis, pentose phosphate pathway, amino sugar and nucleotide sugar metabolism, biosynthesis of antibiotics, starch and sucrose metabolism, and nitrogen metabolism (Fig. 3B). The importance of this analysis is discussed below.

Protein-protein interaction analysis

We further searched for predicted interactions for the differentially expressed proteins identified by UHPLC-QTOF proteomics in the STRING protein-protein interaction database. We constructed a protein-protein interaction network in Cytoscape-Cluster ONE. The protein network analysis showed that five clusters were created in the up-regulated proteins group (Table S7). The network predicted an interaction among AT1G11660, FPS1, HSC70-1, P5CS2, AT5G47720, MFP2, AIM1, GAPC2, PKT4, PKT3, AT4G13010, ACAT2, AOR, and KAT5 (cluster 1). This group of proteins is involved in processes such as stress response, defense response, biosynthesis of SM, and fatty acid metabolism. An interaction was also observed among PHYA, FUS5, COP8, COP13, FUS12, COP9, and FUS6 (cluster 2). These proteins are involved in processes such as response to abiotic stimulus as well as an interaction among ESP3, At1g80070, MAC3A, MAC3B, CDC5, AT5G46840, MOS4, and AT3G19810 (cluster 3); these proteins are involved in spliceosome process. Finally, there was an interaction between UPF3, DCP2, LBA1, DCP1, and AT2G39260 with AT2G40770, AT1G30680, and SRS2, resulting in clusters 4 and 5, respectively; these proteins are involved in plant development and adaptation.

Nine clusters were created in the down-regulated proteins group (Table S8). The first cluster grouped proteins like LOS1, AT2G45030, RPS11, RPS5A, AT1G67430, RPS5B, AT1G07210, AT3G09630, and AT2G09990. These proteins are involved in the ribosome pathway and organic nitrogen compound biosynthetic processes. The interaction among AT5G51570, VHA-A3, AT3G42050, AT3G28715, VHA-A, and TUF (cluster 2) are vacuolar. An interaction was also observed among proteins such as NUP155, NUP1, NTF2B, EMB3142, and RAN3 (cluster 3) that are specifically involved in nucleocytoplasmic transport. EMB1467, NDHF, MATK, and NAD7 (cluster 4) are proteins that form part of ATP synthesis coupled with electron transport, cellular respiration, photorespiration, response to oxidative stress, as well as ATP synthesis-coupled electron transport. Among the interaction of AT3G53980, AGP30, and AT1G17860 (cluster 5) are proteins involved in processes such as systemic acquired resistance, regulation of root development, regulation of seed dormancy process, and endopeptidase inhibitor activity. The interaction among MDH, GDH2, PGMP, ALDH11A3, PKP-ALPHA, NADP-ME4, HXK2, NADP-ME1, PPC3, CSY2, PMDH1, TPI, GLN1-1, and IDH-V (cluster 6) are proteins involved in processes such as malate metabolic process, response to cold, cellular amino acid metabolic process, response to cadmium ion, response to salt stress, fatty acid biosynthetic process, hexokinase-dependent signaling, programmed cell death, leaf development, and photosynthesis. The interaction between LD, FLK, and AT2G47820 (cluster 7) results in this group of proteins being involved in cell differentiation, flower development, vegetative to reproductive phase transition of meristem, and positive regulation of flower development.

The interactions among AT5G46940, UGD3, and RGP5 (cluster 8) are proteins involved in processes such as ionotropic glutamate receptor activity, carbohydrate metabolic process, plant-type cell wall, biogenesis, and response to salt stress. Finally, the interactions among ABCF4, MSR2, and AT2G46290 (cluster 9) are proteins involved in transport, mannan metabolic process, and formation of cytoplasmic translation initiation complex.

Discussion

Influence of edaphic factors on the production of perezone

The perezone content could be considered constant in all three localities as can be deduced from the statistical analysis (Table 1), probably because the three studied localities are in the same natural area with the same geological history with similar climatic and altitudinal conditions (Vega Guzmán, López-García & Manzo Delgado, 2008). They are also associated with vegetation dominated by Quercus-Pinus forests (Fig. S1) (Jaimes-Viera et al., 2018). Another possibility is that the individuals associated with the three localities studied correspond to the same A. cordata population, although localities are several kilometers apart. Therefore, future research could be necessary to study the structure and genetic diversity to clarify whether the individuals of the three studied sites are part of the same population.

The high variation observed in perezone content among individual A. cordata wild plants at the same locality (Figs. 1A1C) can be attributed to possible genotypic differences; nevertheless, environmental factors may also have an important role. Therefore, the genotypic characteristics of each individual plant as well as the variability of the environment could determine the intra-population variation observed in perezone content because it is known that the genotypic variation in wild plants could be a significant contribution to qualitative and quantitative variability of secondary metabolites; in this sense, the interaction with environmental factors strongly influences secondary metabolism in wild plants (Moore et al., 2014; Soltis & Kliebenstein, 2015; Moreira et al., 2019).

The high perezone content among plants of the same locality positively correlates with organic matter, ammonium, total nitrogen, and total phosphorus. In this sense, it could be associated with the role that nitrogen and phosphorus play in processes such as photosynthesis and electron transport as well as the availability of precursor molecules for terpene biosynthesis in species that emit and accumulate terpenes. Ormeño & Fernandez (2012) mentioned that: “Nitrogen could promote terpenoid emissions by increasing electron transport rate and leaf photosynthesis, which provide ATP requirements and carbon substrate availability for isoprene synthesis. Phosphorus is expected to influence terpenoid production since terpenoid precursors (IPP: isopentenyl diphosphate and DMAPP: dimethylallyl pyrophosphate) contain high-energy phosphate bonds and phosphorus is a key component of ATP and NADPH, which are required for photosynthesis and terpenoid synthesis (Ormeño & Fernandez, 2012).

We found a positive and significant relationship between ammonium and perezone content that could be important. The assimilation of ammonium by the plant roots releases hydrogen ions (H+) into the soil, which in turn changes the soil pH leading to an increase in the acidity of the rhizospheric soil (Bolan, Hedley & White, 1991) that could be related to higher perezone content. In addition, it seems that these plants prefer ammonium as a source of nitrogen relative to nitrate perhaps because the assimilation of ammonia is less energetically expensive than that of nitrate (Hachiya & Sakakibara, 2017). Some authors state that the accumulation of SM is strongly affected by environmental factors; thus, plants regulate the type and quantity of SM according to environmental variations (Liu et al., 2015).

Comparison of the proteome of perezone-producing rhizomes

A. cordata is a medicinal plant that has been used as a source of perezone and other sesquiterpenes with several applications of pharmacological importance. However, it is a species without a sequenced and annotated genome. It is a “non-model plant” or an “orphan organism”, and proteomic research into non-model organisms like this is particularly challenging. However, we could still analyze the proteome of the perezone-producing rhizomes thanks to advances in proteomics. Other orphan organisms have been analyzed for secondary metabolism via proteomic studies: Picrorhiza kurroa, Podophyllum hexandrum, Panax ginseng, Catharanthus roseus, Withania somnifera, Euphorbia kansui, and Silybum marianum (Senthil et al., 2011; Champagne et al., 2012; Bhattacharyya et al., 2012; Gharechahi et al., 2013; Ma et al., 2013; Sud, Chauhan & Tandon, 2014; Zongo et al., 2013; Zhao et al., 2014; Romero-Sandoval, Kolano & Alvarado-Vázquez, 2017).

Previously, we commented that there is a positive relationship between soil nutrients and the perezone content. We explained the possible relationship between these factors and the compound of interest. However, the proteomics approach allowed us to present the results that broadened this perspective. In this sense, we mention that the nutrients in the soil may be found in a way not available for plants, even if the nutrients are in high concentrations. Since the functional annotation GO and KEGG analysis showed metabolic processes and pathways mainly related to the internal lack of nutrients, this hypothesis prompted ideas and possibilities to explain our observed results. Future work will continue to explore the A. cordata proteome under different types of stress and relate it to the production of perezone.

In this study, GO annotation analysis revealed that some up-regulated proteins were enriched in processes related to regulating nitrogenous compounds such as DNA and RNA biosynthesis. In addition, the secondary metabolism and the regulation of proteolysis processes were enriched together with the activity of proteins associated with ubiquitination: ubiquitin protein ligase and ubiquitin-like protein ligase play an essential role in the response of plants to nutritional homeostasis (Yates & Sadanandom, 2013; Rojas-Triana et al., 2013). These results indicate that such pathways are mainly related to the response to high nitrogen concentrations in the soil, their low availability and absorption for roots, and the biosynthesis of perezone.

The production of many secondary metabolites agrees with the carbon nutrient balance (CNB) hypothesis prediction (Coviella, 2002); however, many other results do not agree with such predictions (Manninen et al., 2002; Foyer, Ferrario-Méry & Noctor, 2001; Bryant, Chapin & Klein, 1983). Since the CNB hypothesis predicts that the amounts of secondary metabolites in plants crucially depend on carbon and nitrogen availability in their environment, the limited nutrient content, together with unlimited light, leads to growth inhibition and stimulation of carbon-based defense metabolites biosynthesis including tannins and terpenes (Bryant, Chapin & Klein, 1983). Due to these environmental factors, we found over-expressed enzymes in the group of high perezone production that participates in the metabolic pathway of terpenes. These enzymes are Acetyl-CoA C-acetyltransferase, geranyl-diphosphate synthase, and farnesyl diphosphate synthase. All three participate in the biosynthesis of terpenes via the mevalonate pathway (MVA) that occurs in the cytosol (Kumari et al., 2013). Farnesyl diphosphate synthase is a key enzyme in the biosynthesis of sesquiterpenes. It catalyzes the consecutive condensations of dimethylallyl diphosphate or geranyl diphosphate with isopentenyl pyrophosphate to produce farnesyl diphosphate (Zhao et al., 2015; Guo, Li & Peng, 2015). This molecule is a precursor of sesquiterpenes and other terpenes such as sterols, brassinosteroids, farnesol, nerolidol, germacrene, and valencene (Lombard & Moreira, 2011); it is likely participating in the biosynthesis of perezone. The sesquiterpenes are mostly volatile compounds with various functions in plants. They are released after damage due to the aerial or underground part of the plant by herbivores or pathogens (Dudareva, Pichersky & Gershenzon, 2004). They are also used as allelopathic agents and have other functions (Kochevenko et al., 2012; Assaeed et al., 2020).

We also found upregulated enzymes such as 2-oxoisovalerate dehydrogenase E2 component, malonate-semialdehyde dehydrogenase, and methylmalonate-semialdehyde dehydrogenase that participate in the valine, leucine, and isoleucine degradation pathway. In plants, branched-chain amino acids are an alternative energy source when the carbohydrate source is limited. These amino acids rapidly degrade under stress scenarios and development (Zhao et al., 2014). Moreover, amino acid catabolism is crucial in respiration in limited light conditions or prolonged darkness (Kochevenko et al., 2012; Araújo et al., 2011; Peng et al., 2015). These data suggest that the degradation of amino acids generates molecules of acetyl-CoA that can be used for the biosynthesis of terpenes. This process could be occurring in A. cordata plants with a high perezone yield: an over-expression pathway in terpene biosynthesis was observed along with the pathway of degradation of valine, leucine, and isoleucine.

We also observed some downregulated proteins such as glutamine synthetase participating in nitrogen metabolism. These were mainly seen in nitrate and ammonium assimilation and is also important to the photorespiration process (Lea & Miflin, 1974; Naliwajski & Skłodowska, 2018). We found a serine hydroxymethyltransferase implicated in glyoxylate and dicarboxylate metabolism; in plants, one of the main activities of this enzyme is to catalyze the reversible serine-to-glycine conversion. The one-carbon units mainly result from the activity of this enzyme (Walton & Woolhouse, 1986). Glyoxylate and dicarboxylate metabolism and photorespiration play a vital role in stress conditions, and enzymes such as glutamine synthetase and serine hydroxymethyltransferase are upregulated (Tang et al., 2019); in our study, however, they were downregulated according to other research results published previously. The reason for these results is still unknown, and some studies suggest that it could be related to post-transcriptional regulation (Xu et al., 2018).

In general, we found proteins that were under-expressed in the metabolism of glycolysis and gluconeogenesis, such as pyruvate decarboxylase, triosephosphate isomerase, fructose-bisphosphate aldolase, pyruvate dehydrogenase, phosphoenolpyruvate carboxykinase (GTP), phosphoglucomutase, and pyruvate kinase. One way to generate energy in plants is via the degradation of starch and sucrose. Starch is broken down to biosynthesize glucose molecules, which enter the glycolytic pathway (Fernie, Willmitzer & Trethewey, 2002; Bahaji et al., 2014). In summary, the downregulated expression of proteins related to the pentose phosphate pathway, glycolysis, as well as the starch and sucrose metabolism may indicate a lack of carbon skeletons necessary for nitrogen assimilation. These results agree and confirm our assumptions about the lack of internal nutrients and the plant allocation resources to synthesize molecules related to secondary metabolism.

Finally, the differentially expressed proteins were subjected to STRING analysis and visualized using Cytoscape software. The analysis of these proteins indicated that AT1G11660, P5CS2 and FPS1 are directly related and involved in heat stress response, proline biosynthesis, and terpenoid biosynthesis. Proline biosynthesis and accumulation have been reported to be involved in the adaptation of plants to environmental stresses (Hare & Cress, 1997) in addition to acting as an osmo-protective molecule (Meena et al., 2019). Proline is a multifunctional amino acid, and its biosynthesis has been linked to the production of secondary metabolites. In our analysis, the relationship between these proteins is likely linked to high concentrations of perezone and its biosynthesis due to nutritional stress.

In contrast, the biosynthesis of SM including volatile terpenes is related to heat stress. In that sense, Gupta et al. (2014) reported several volatile compounds differentially produced by Hymenaea courbaril under heat stress: isoprene, 2-methyl butanenitrile, β-ocimene, and sesquiterpenes (Gupta et al., 2014). Pazouki et al. (2016) demonstrate that heat stress results in a major enhancement of terpenoid emissions in Solanum lycopersicum (Bryant, Chapin & Klein, 1983).

The edaphic analysis showed a positive correlation between the concentrations of perezone and the levels of nitrogen, phosphorus, ammonia and organic matter. This relationship is weak and probably not completely linked to perezone biosynthesis because proteins were found overexpressed in the pathway of amino acid degradation. This suggests that the nutrients were not entirely in their available form to Acourtia cordata plants. Primary and secondary metabolic pathways did not take the nutrients from the soil to obtain the precursor molecules for the biosynthesis of sesquiterpenes, such as perezone. Indeed, the plants obtained these precursors from alternative routes, such as the degradation of amino acids and not from the photosynthetic pathway. In addition, the degradation of amino acids such as valine, leucine, and isoleucine, is related to low or no light; there could also be low rates of photosynthesis. This last idea agrees with our results because Acourtia cordata plants grow under the shade of the trees of the Quercus-Pinus forest.

Conclusions

This study correlates edaphic factors such as pH, O.M., Nt, NH4, and Pt with the contents of perezone in rhizomes of wild plants of A. cordata; it also includes the first proteomic report for A. cordata and the first to show a high variation of perezone production in wild plants. Also, it suggests that the genotypic variation could also be playing an important role in the production of perezone.

Moreover, all identified proteins gave a general view of the proteomic profile, specifically of the rhizome of this plant species. Some of these proteins were classified according to their gene ontology in biological processes, cellular components, and molecular function. The KEGG analysis classified proteins in several pathways of basal metabolism, including, glycolysis, gluconeogenesis, metabolism of amino acids/lipids, biosynthesis of SM, or proteins involved in the biosynthetic pathways of terpenes, phenylpropanoids, flavonoids, etc.

This work predicted probable interactions between some of the over-expressed proteins. It showed a possible expression of proteins involved in the biosynthesis of compounds, such as sesquiterpenes, together with the expression of related proteins during stress. This work will lead to other research, including further experiments using cloned A. cordata plants, to control the response of the genotype on varying pH, nitrogen, and phosphorous fertilizing concentrations, to elucidate the role of these environmental factors on the production/accumulation of perezone. RNA sequencing of high vs. low perezone producers could also be another approach.

Supplemental Information

Rhizomes of Acourtia cordata wild plants growing in Quercus-Pinus forest, were collected at three localities.

DOI: 10.7717/peerj.16136/supp-1

Voucher specimen of Acourtia cordata plants of the three localities.

DOI: 10.7717/peerj.16136/supp-2

Grouping of Acourtia cordata plants according with the perezone content in their rhizomes.

DOI: 10.7717/peerj.16136/supp-3

Protein profiles of perezone producing roots of A. cordata, Molecular Weight Marker (MWM), High 1, 2 and 3 (A1, A2 and A3), Medium 1, 2 and 3 (M1, M2 and M3), Down 1, 2 and 3 (B1, B2 and B3).

DOI: 10.7717/peerj.16136/supp-4

Analysis of GC-MS of perezone in the hexane extract of A.cordata roots.

Identification of the compound in the group of high producers with a RT = 17.75 min and m/z = 166, 191, 205 and 248.

DOI: 10.7717/peerj.16136/supp-5

Analysis of GC-MS of perezone in the hexane extract of A. cordata roots.

Identification of the compound in the group of low producers with a RT = 17.61 min y m/z = 166, 191, 205 and 248.

DOI: 10.7717/peerj.16136/supp-6

Figure S7.

Analysis of GC-MS of standard perezone. Identification of the compound. RT = 17.61 min y m/z = 166, 191, 205 and 248

DOI: 10.7717/peerj.16136/supp-7

Perezone quantification of A. cordata rhizomes and data for graphics 1A, 1B and 1C.

Plants from 1 to 20 belong to Chamilpa site, Plants from 21 to 40 belong to the Alarcon site, and Plants from 41 to 60 belong to the Felipe Neri site. Replicas 1, 2 and 3 stand for: R1, R2 and R3. Data for the Graphics 1A, 1B and 1C are after the quantification data.

DOI: 10.7717/peerj.16136/supp-8

Total identified proteins in A. cordata rhizomes.

DOI: 10.7717/peerj.16136/supp-9

Gene Ontology (GO) enrichment analyses for overexpressed proteins of A. cordata.

DOI: 10.7717/peerj.16136/supp-10

Gene Ontology (GO) enrichment analyses for repressed proteins of A. cordata.

DOI: 10.7717/peerj.16136/supp-11

KEGG enrichment analysis of upregulated proteins of A. cordata.

DOI: 10.7717/peerj.16136/supp-12

KEGG enrichment analysis of downregulated proteins of A. cordata.

DOI: 10.7717/peerj.16136/supp-13

List of proteins involved in protein-protein interaction network for upregulated proteins.

DOI: 10.7717/peerj.16136/supp-14

List of proteins involved in protein-protein interaction network for downregulated proteins.

DOI: 10.7717/peerj.16136/supp-15
1 Citation   Views   Downloads