A Python script to merge Sanger sequences

View article
Bioinformatics and Genomics

Main article text

 

Introduction

Methods

Running environments and input files

Running script

  1. Make a new directory and copy the sequencing result files of the above section (Running environments and input files) to the newly made directory. Also, copy the script “Merge_Sanger_v2.py” (Supplemental Information S2) to the directory.

  2. Open a new terminal and “cd” to the directory. Run the following command. Notice that a space between every two items is needed and the backward slash is used for multiline command input.

    • python3.7 Merge_Sanger_v2.py 001F_sequence_to_be_merged.seq \

    • 002F_sequence_to_be_merged.seq 003F_sequence_to_be_merged.seq \

    • 004R_sequence_to_be_merged.seq 005R_sequence_to_be_merged.seq \

    • 006R_sequence_to_be_merged.seq

    The Sanger sequencing result files must be input from lower to higher according to the first three numbers of the file name. It is convenient to input the first three numbers and then use the tab key to fill the rest of the file name automatically. Our script is able to merge dozens of (up to 1,000) Sanger sequencing result files at a time, which is far enough for usual needs in labs.

  3. After running finishes, a folder named “merged_sequence”, which contains the merged sequence file with the name “merged.seq”, appears.

Manual validation of the merged Sanger sequences

Comparison with CAP3 and Fragment Merger for merging Sanger sequences

Algorithm

Results

Discussion

Conclusions

Supplemental Information

Sanger sequencing file 000F.

DOI: 10.7717/peerj.11354/supp-1

Sanger sequencing file 001F.

DOI: 10.7717/peerj.11354/supp-2

Sanger sequencing file 002F.

DOI: 10.7717/peerj.11354/supp-3

Sanger sequencing file 003F.

DOI: 10.7717/peerj.11354/supp-4

Sanger sequencing file 004R.

DOI: 10.7717/peerj.11354/supp-5

Sanger sequencing file 005R.

DOI: 10.7717/peerj.11354/supp-6

Sanger sequencing file 006R.

DOI: 10.7717/peerj.11354/supp-7

The Python script of this study.

DOI: 10.7717/peerj.11354/supp-8

Manual validation of the merged Sanger sequences.

DOI: 10.7717/peerj.11354/supp-9

Screen output of the script run.

DOI: 10.7717/peerj.11354/supp-10

Comparison with Fragment Merger for Sanger sequence merging.

DOI: 10.7717/peerj.11354/supp-11

Readme for the script of this study.

DOI: 10.7717/peerj.11354/supp-12

Additional Information and Declarations

Competing Interests

The authors declare that they have no competing interests.

Author Contributions

Cen Chen conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, and approved the final draft.

Bingguo Lu conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, and approved the final draft.

Xiaofang Huang performed the experiments, authored or reviewed drafts of the paper, and approved the final draft.

Chuyun Bi performed the experiments, authored or reviewed drafts of the paper, and approved the final draft.

Lili Zhao performed the experiments, authored or reviewed drafts of the paper, and approved the final draft.

Yunzhuo Hu performed the experiments, authored or reviewed drafts of the paper, and approved the final draft.

Xuanyang Chen performed the experiments, authored or reviewed drafts of the paper, and approved the final draft.

Shiqiang Lin conceived and designed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.

Kai Huang conceived and designed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The example input files, the Python script, the readme file, the manual validation of the merging of the example input files, the screen output of the script run, and the comparison with Fragment Merger for Sanger sequence merging are available in the Supplemental Files.

Funding

This study is supported by the National Natural Science Foundation of China, Grant/Award Number: 91949201 (Kai Huang). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

2 Citations 3,290 Views 1,738 Downloads

Your institution may have Open Access funds available for qualifying authors. See if you qualify

Publish for free

Comment on Articles or Preprints and we'll waive your author fee
Learn more

Five new journals in Chemistry

Free to publish • Peer-reviewed • From PeerJ
Find out more