SSREnricher: a computational approach for large-scale identification of polymorphic microsatellites based on comparative transcriptome analysis

View article
Bioinformatics and Genomics

Main article text

 

Introduction

Materials and Methods

Architectural structure

  1. SSR mining: MISA is integrated into SSREnricher and is used to mine SSR sequence information in transcripts, using the following settings: SSR motifs and the number of repeats shown respectively, monomer-10, dimer-6, trimer-5, tetramer-5, pentamer-5, and hexamer-5. The maximal number of interrupting basepairs in a compound microsatellite is set to 100.

  2. Sequence clustering: CD-HIT-EST is used to cluster transcripts with ≥90% similarity to obtain clustering and positive or negative strand information of sequences.

  3. Sequence modification: The negative-strand sequences are converted into reverse complementary sequences.

  4. Enrichment containing polymorphic SSR sequences: The repeat units possessing only one nucleotide are filtered out. Repeat units less than 50 bp from the beginning or end of the sequence are also removed.

  5. False-positive removal: The software screens for false-positive based on the enrichment results. The clusters that contain sequences with multiple SSRs loci with the same motif, while others miss some SSRs loci, are removed.

  6. Results output and multiple sequence alignment: Multi-sequence alignment is performed on the obtained clusters to observe the position of the SSRs and generates the result files.

Identification of polymorphic SSRs using SSREnricher

Validation experiments

Results

Discussion

Conclusions

Supplemental Information

Polymorphic SSRs found in grass carp and rice.

DOI: 10.7717/peerj.9372/supp-1

The primers of grass carp and rice.

DOI: 10.7717/peerj.9372/supp-2

The code of SSREnricher.

DOI: 10.7717/peerj.9372/supp-3

The face of SSREnricher.

DOI: 10.7717/peerj.9372/supp-4

The pictures of some of the identified polymorphic SSR which were genotyped by capillary electrophoresis.

A, SSR Ci-SSR28, the primer of which was labeled with ROX; B, SSR Ci-SSR26, the primer of which was labeled with HEX; C, SSR Os-SSR29, the primer of which was labeled with FAM.

DOI: 10.7717/peerj.9372/supp-5

Additional Information and Declarations

Competing Interests

The authors declare that they have no competing interests.

Author Contributions

Wei Luo conceived and designed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.

Qing Wu conceived and designed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.

Lan Yang performed the experiments, prepared figures and/or tables, and approved the final draft.

Pengyu Chen performed the experiments, prepared figures and/or tables, and approved the final draft.

Siqi Yang performed the experiments, prepared figures and/or tables, and approved the final draft.

Tianzhu Wang performed the experiments, prepared figures and/or tables, and approved the final draft.

Yan Wang performed the experiments, prepared figures and/or tables, and approved the final draft.

Zongjun Du conceived and designed the experiments, authored or reviewed drafts of the paper, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

Data is available at GitHub: https://github.com/byemaxx/SSREnricher.

Funding

This study was funded by the National Natural Science Foundation of China (No. 41706171) and the 13th five-year aquaculture-breeding project of Sichuan province (No. 2016NYZ0047), China. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

5 Citations 1,689 Views 517 Downloads