Loss of inner kinetochore genes is associated with the transition to an unconventional point centromere in budding yeast

View article
Bioinformatics and Genomics

Main article text

 

Introduction

Materials and Methods

Gene presence/absence screening

Multiple sequence alignments of CENPA/CSE4 gene

Calculation of dyad density

Results

Patterns of kinetochore gene loss

N-terminus divergence of CENPA/CSE4 gene in Naumovozyma

Dyad density at point centromeres

Discussion

Conclusions

Supplemental Information

Matrix of kinetochore gene presence or absence across 20 yeast species.

The gene presence/absence matrix across 20 yeast species has been downloaded from YGOB database for kinetochore genes. Genes missing in both copies for Post-WGD species are colored red, missing in Pre-WGD species are colored orange, genes with both copies present in Post-WGD are colored olive green, genes missing one of the copies are colored light blue and genes present in pre-WGD species are colored light green.

DOI: 10.7717/peerj.10085/supp-1

Intergenic sequences in regions orthologous to the lost gene.

Intergenic sequences in the regions orthologous to the lost gene are provided as a text file. These sequences were searched for the presence of ORF’s using NCBI’s ORF finder program.

DOI: 10.7717/peerj.10085/supp-2

Multiple sequence alignments and results of selection tests.

The multiple sequence alignment of open reading frame nucleotide sequences and amino acid sequences generated using MUSCLE, M-coffee, Clustal Omega and Guidance with PRANK are provided along with the commands used for the command line tools. Results of the selection tests from BUSTED, FEL, MEME and RELAX from the HYPHY package are provided as json files that can be visualized using the online hyphy vision website.

DOI: 10.7717/peerj.10085/supp-3

Validation of CENPA/CSE4 ORF in Naumovozyma species.

To rule out the possibility of erroneous annotation of shorter ORF’s in non-naumovozyma species I extracted the genomic sequence found between genes flanking CENPA (CENH3 and CSE4 homolog) and searched them for ORF’s using the NCBI ORF finder program with default settings. Screenshots of the results of the ORF finder program are also provided.

DOI: 10.7717/peerj.10085/supp-4

Amino acid sequence conservation scores for CENPA/CSE4 gene in yeast species.

The amino acid sequence of CENPA/CSE4 gene in 20 yeast species was aligned using MUSCLE multiple sequence aligner with default settings. The resultant alignment was used as input to the al2co program to calculate the conservation scores. First column in the file provides the position along the CENPA/CSE4 protein and the subsequent columns are the various metrics used to calculate conservation. All measures of sequence conservation show that the c-terminal region is conserved and n-terminal region is highly diverged between species.

DOI: 10.7717/peerj.10085/supp-5

Sequences of point centromeres in budding yeast species and their positions.

The multi-fasta file of centomere sequences of budding yeast species extracted from YGOB. Genes flanking each of the centromeres is also captured in the YGOB screenshots.

DOI: 10.7717/peerj.10085/supp-6

Palindromes identified in the budding yeast centromeres.

The palindromes program from the emboss package was used to identify inverted repeats along the length of budding yeast centromeres. Output files in the .pal format are provided.

DOI: 10.7717/peerj.10085/supp-7

GC content and dyad density of centromere sequences from budding yeast species.

The percent GC content and dyad density for each budding yeast centromere are provided. Number of A, T, G and C bases as well as the number of dyad covered bases is also provided. This table is used to generate Figs. 4A and 4B.

DOI: 10.7717/peerj.10085/supp-8

Additional Information and Declarations

Competing Interests

The author declares that he has no competing interests.

Author Contributions

Nagarjun Vijay conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The raw data are available in the Supplemental Files.

Funding

Nagarjun Vijay was awarded the Innovative Young Biotechnologist Award 2018 from the Department of Biotechnology and Early Career Research Award from the Department of Science and Technology (both Government of India). The computational analyses were performed on the Har Gobind Khorana Computational Biology cluster established and maintained by combining funds from IISER Bhopal under Grant # INST/BIO/2017/019, Innovative Young Biotechnologist Award 2018 from Department of Biotechnology and Early Career Research Award from Department of Science and Technology. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

1 Citation 1,426 Views 289 Downloads

Your institution may have Open Access funds available for qualifying authors. See if you qualify

Publish for free

Comment on Articles or Preprints and we'll waive your author fee
Learn more

Five new journals in Chemistry

Free to publish • Peer-reviewed • From PeerJ
Find out more