Tandem-repeat protein domains across the tree of life

View article
PeerJ

Main article text

 

Introduction

Materials and Methods

ANK, TPR and ARM-containing protein data acquisition and analysis

16S rRNA phylogenetic tree and independence analysis

Results

The incidence of tandem-repeat containing proteins decreases from Eukaryotes > Bacteria > Archaea

The variation in intraproteomic abundance of each repeat domain is correlated in bacteria

ARM and TPR domains are not enriched in host-associated bacteria

TPR domains are not enriched in pathogenic bacteria

Discussion

TPR-containing proteins in bacteria

ARM-containing proteins in bacteria

Repeat domains in archaea

Evolution of tandem repeats

Supplemental Information

Figure S1

Bar graph of the percent of bacterial species analyzed (y axis) with the specified percent of ARM-containing proteins (x axis). The number above the bars on the graph lists the number of species with the specified percent of ARM-containing proteins.

DOI: 10.7717/peerj.732/supp-1

Figure S2

Bar graph of the percent of bacterial species analyzed (y axis) with the specified percent of TRP-containing proteins (x axis). The number above the bars on the graph lists the number of species with the specified percent of TPR-containing proteins.

DOI: 10.7717/peerj.732/supp-2

Lifestyle analysis of bacterial species with 1.5% of their proteome composed of TPR-containing proteins

An average of the percent of TPR-containing proteins for all organisms of the same species was used for these analyses. Bar graph of the average percent of the proteome composed of TPR-containing proteins in species of free-living (FL), facultative host-associated (FHA) and obligate intracellular (O) bacteria with 1.5% of their proteome composed of TPR-containing proteins. Error bars represent standard error. (all comparisons, P > .05).

DOI: 10.7717/peerj.732/supp-3

ARM family domains analysis within the ARM Superfamily

The ARM repeats of one bacterial and archaeal species within each phylogenetic family present in the data set (Tables S6 and S7), as well as 10 species of eukarya (each representing one phylogenetic family) were analyzed for their ARM domain composition. An average of the percent of ARM domains within each ARM family category was graphed. Error bars represent standard error.

DOI: 10.7717/peerj.732/supp-4

Table S1

Raw data extracted from SUPERFAMILY for domain analysis

DOI: 10.7717/peerj.732/supp-5

Table S2

Lifestyle analysis of bacterial species with ≥0.2% ARM-containing proteins.

DOI: 10.7717/peerj.732/supp-6

Table S3

Lifestyle analysis of bacterial species with ≥1.5% TPR-containing proteins.

DOI: 10.7717/peerj.732/supp-7

Table S4

Bacterial species with greater than 2.0% of their proteome composed of TPR-containing proteins.

DOI: 10.7717/peerj.732/supp-8

Table S5

Bacterial species known to contain TPR domain containing virulence factors.

DOI: 10.7717/peerj.732/supp-9

Table S6

Distribution of ARM domain families across bacterial classes.

DOI: 10.7717/peerj.732/supp-10

Table S7

Distribution of ARM domain families across archaeal classes.

DOI: 10.7717/peerj.732/supp-11

Additional Information and Declarations

Competing Interests

The authors declare there are no competing interests.

Author Contributions

Kristin K. Jernigan conceived and designed the experiments, performed the experiments, analyzed the data, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.

Seth R. Bordenstein conceived and designed the experiments, analyzed the data, wrote the paper, reviewed drafts of the paper.

Funding

This research was funded by NIH awards F32 GM 100778 and 5T32HD007043-34 to Kristin K. Jernigan, and R01GM085163 to Seth R. Bordenstein. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

54 Citations 4,655 Views 1,458 Downloads