CRSeek: a Python module for facilitating complicated CRISPR design strategies

Department of Microbiology and Immunology, Drexel University College of Medicine, Philadelphia, PA, United States
Center for Molecular Virology and Translational Neuroscience, Institute for Molecular & Medicine and Infectious Disease, Drexel University College of Medicine, Philadelphia, PA, United States
School of Biomedical Engineering, Science, and Health Systems, Drexel University, Philadelphia, PA, United States
Sidney Kimmel Cancer Center, Thomas Jefferson University, Philadelphia, PA, United States
DOI
10.7287/peerj.preprints.27094v1
Subject Areas
Bioinformatics, Genomics, Synthetic Biology, Data Mining and Machine Learning
Keywords
CRISPR, Python, machine learning
Copyright
© 2018 Dampier et al.
Licence
This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.
Cite this article
Dampier W, Chung C, Sullivan NT, Atkins AJ, Nonnemacher MR, Wigdahl B. 2018. CRSeek: a Python module for facilitating complicated CRISPR design strategies. PeerJ Preprints 6:e27094v1

Abstract

With the popularization of the CRISPR-Cas gene editing system there has been an explosion of new techniques made possible by this versatile technology. However, the computational field has lagged behind with a current lack of computational tools for developing complicated CRISPR-Cas gene editing strategies. We present crseek, a Python package that provides a consistent application programming interface (API) for multiple cleavage prediction algorithms. Four popular cleavage prediction algorithms were implemented and further adapted to work on draft-quality genomes. Furthermore, since crseek mirrors the popular scikit-learn API, the package can be easily integrated as an upstream processing module for facilitating further CRISPR-Cas machine learning research. The package is fully integrated with the biopython package facilitating simple import, export, and manipulation of sequences before and after gene editing. This manuscript presents four common gene editing tasks that would be difficult with current tools but are easily performed with the crseek package. We believe this package will help bioinformaticians rapidly design complex CRISPR-Cas gene editing strategies and will be a useful addition to the field.

Author Comment

This is a submission to PeerJ for review.