Recent mobility of plastid encoded group II introns and twintrons in five strains of the unicellular red alga Porphyridium

View article
PeerJ

Main article text

 

Introduction

Materials and Methods

Porphyridium purpureum strains and plastid genomes

Mapping, polymorphism detection and analysis

Group II intron and IEP identification

Intron structure and evolution

Results and Discussion

Intron-encoded proteins

Group IIB intron secondary structure

Red algal twintrons

Conclusions

Supplemental Information

Nucleotide alignment of P. purpureum plastid introns

Boundaries used to determine homology are indicated in red (DI stem, DIV stem, DV and DVI stem, respectively). The IEP coding sequences are in yellow. Additional group II introns with degenerate IEPs (i.e., psbN-psbT, int.a rpoB, int mntA, int.b rpoC2) added to analysis are included. The mat1f-encoding group II intron illustrated here represents mat1fc; the nearly identical mat1fa and mat1fb are omitted.

DOI: 10.7717/peerj.1017/supp-1

Draft P. purpureum intron structure (intergenic region between atpB-atpE, mat1a IEP)

Only DIV, DV, and DVI were identified.

DOI: 10.7717/peerj.1017/supp-2

Alignment of P. purpureum intron-encoded protein domains

The four identified domains are separated by an artificial five amino acid gap. The unboxed 5′ sequence comprises the reverse transcriptase (RT) domain. The maturase (X) domain is boxed in black, the DNA-binding (D) domain in red and the endonuclease (En) domain in blue. The D and En domains are partial or absent in four IEPs (mat1a, mat1b, mat1c and mat1e). Asterisks are placed above the YADD domain.

DOI: 10.7717/peerj.1017/supp-3

Phylogeny of CL2B group II intron-encoded proteins

The nine plastidial IEP sequences from P. purpureum were added to selected sequences from the bacterial group II intron database, together with different eukaryote taxa such as Rhodophyta, Cryptophyta, Viridiplantae, Euglenozoa, and stramenopiles from the CL1 and CL2 group. The unrooted tree is annotated with the IEP classes (ML, bootstrap >70%).

DOI: 10.7717/peerj.1017/supp-4

Draft P. purpureum intron structure (int rpoC1, mat1g IEP)

The alternate secondary structure for domain III is depicted in the floating inset.

DOI: 10.7717/peerj.1017/supp-5

Draft P. purpureum intron structure (int tsf, mat1i IEP)

The alternate secondary structure for domain III is depicted in the floating inset.

DOI: 10.7717/peerj.1017/supp-6

Draft P. purpureum intron structure (int ycf46, mat1h IEP)

The alternate secondary structure for domain III is depicted in the floating inset.

DOI: 10.7717/peerj.1017/supp-7

Draft P. purpureum intron structure (int.a rpoC2, mat1e IEP)

The alternate secondary structure for domain III is depicted in the floating inset.

DOI: 10.7717/peerj.1017/supp-8

Draft P. purpureum intron structure (int gltB, mat1d IEP)

The alternate secondary structure for domain III is depicted in the floating inset.

DOI: 10.7717/peerj.1017/supp-9

Draft P. purpureum intron structure (int atpB, mat1f IEP)

The alternate secondary structure for domain III is depicted in the floating inset.

DOI: 10.7717/peerj.1017/supp-10

Draft P. purpureum intron structure (int.b atpI, ORF remnant and outer twintron)

The alternate secondary structure for domain III is depicted in the floating inset.

DOI: 10.7717/peerj.1017/supp-11

Draft P. purpureum intron structure (int.b rpoC2, IEP remnant and outer twintron)

DOI: 10.7717/peerj.1017/supp-12

Draft P. purpureum intron structure (int.c infC, mat1c IEP)

DOI: 10.7717/peerj.1017/supp-13

Draft P. purpureum intron structure (intergene psbN-psbT, IEP remnant)

DOI: 10.7717/peerj.1017/supp-14

Draft P. purpureum intron structure (int.a rpoB, IEP remnant)

DOI: 10.7717/peerj.1017/supp-15

Draft P. purpureum intron structure (int mntA, IEP remnant)

DOI: 10.7717/peerj.1017/supp-16

P. purpureum group II intron/IEP alignment

Alignment of 14 P. purpureum intron/intergenic regions containing an IEP/IEP remnant and four Rhodomonas salina introns. Secondary structures from each domain (DI–DVI) are marked and represented by different colors. The dnaK intron (containing mat1b) does not retain a group IIB intron structure. A partial structure was determined for the atpB-atpEintergenic region (containing mat1a). All the IEPs or IEP remnants are located in domain IV, including the R. salina introns (previously described as the only case of group II intron IEPs located outside of DIV). Twintron insertion sites are indicated with asterisks. The mat1f-encoding structure illustrated here is that encoding mat1fc (int.atpB); the nearly identical mat1fa- and mat1fb-encoding group II introns are omitted.

DOI: 10.7717/peerj.1017/supp-17

Nucleotide alignment of the exon and intron binding sites

The P. purpureum EBS and IBS pairings are unique to each intron/IEP. The complementarity between both is generally preserved; if not, the mutation is located in the 5′ region. EBS1 and/or EBS2 were not identified for the mat1a, mat1b, and mat1c introns. “Ghost” refers to remnant IEPs.

DOI: 10.7717/peerj.1017/supp-18

Modified Rhodomonas salina group II intron secondary structure (groEL gene, strain CCMP 1178)

The domains II, III and IV were modified on the original structure designed by Khan et al. (2007).

DOI: 10.7717/peerj.1017/supp-19

Modified Rhodomonas salina group II intron secondary structure (intron 1, groEL gene, strain CCMP 2045)

The domains II, III and IV were modified on the original structure designed by Khan et al. (2007).

DOI: 10.7717/peerj.1017/supp-20

Modified Rhodomonas salina group II intron secondary structure (intron 2, groEL gene, strain CCMP 2045)

The domains III and IV were modified on the original structure designed by Khan et al. (2007).

DOI: 10.7717/peerj.1017/supp-21

Modified Rhodomonas salina group II intron secondary structure (psbN gene, strain CCMP 1319)

The domains I, II, III, IV and VI were modified on the original structure designed by Maier et al. (1995).

DOI: 10.7717/peerj.1017/supp-22

Modified Rhodomonas salinagroup II intron secondary structure (groEL gene, strain Maier)

The domains I, II, III, IV and VI were modified on the original structure designed by Maier et al. (1995).

DOI: 10.7717/peerj.1017/supp-23

Domain IV primary binding site

The binding sites of the maturases were determined by comparing sequence alignments. The stem-loop structure from a purine-rich internal loop is framed in white, whereas the start- codon is framed in black.

DOI: 10.7717/peerj.1017/supp-24

Group II introns used in analysis

Sequences used to guide secondary structure homology search and included in phylogenetic analyses of P. purpureum group II introns.

DOI: 10.7717/peerj.1017/supp-25

Query sequences used for structural homology identification

Query sequences used to identify the DI, DIV, DV and DVI domains via BLASTn (Altschul et al., 1990).

DOI: 10.7717/peerj.1017/supp-26

Additional Information and Declarations

Competing Interests

The authors declare there are no competing interests.

Author Contributions

Marie-Mathilde Perrineau conceived and designed the experiments, performed the experiments, analyzed the data, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.

Dana C. Price performed the experiments, analyzed the data, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.

Georg Mohr analyzed the data, contributed reagents/materials/analysis tools, reviewed drafts of the paper.

Debashish Bhattacharya conceived and designed the experiments, performed the experiments, analyzed the data, contributed reagents/materials/analysis tools, wrote the paper, reviewed drafts of the paper.

DNA Deposition

The following information was supplied regarding the deposition of DNA sequences:

The group II intron/IEP sequences described here are accessible via GenBank accession numbers KJ826367 to KJ826395.

Funding

The work was funded by a grant from the National Science Foundation (1004213) and from the United States Department of Energy (DE-EE0003373/001) awarded to Debashish Bhattacharya. Research by Georg Mohr is supported by NIH grant GM37949 and Welch Foundation grant F-1607 to Alan M. Lambowitz. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

19 Citations 3,347 Views 1,038 Downloads

Your institution may have Open Access funds available for qualifying authors. See if you qualify

Publish for free

Comment on Articles or Preprints and we'll waive your author fee
Learn more

Five new journals in Chemistry

Free to publish • Peer-reviewed • From PeerJ
Find out more