MGST2 and WNT2 are candidate genes for comitant strabismus susceptibility in Japanese patients

Jingjing Zhang; Toshihiko Matsuo

doi:10.7717/peerj.3935

MGST2 and WNT2 are candidate genes for comitant strabismus susceptibility in Japanese patients

Department of Ophthalmology, Okayama University Graduate School of Medicine, Dentistry, and Pharmaceutical Sciences, Okayama City, Okayama, Japan

DOI: 10.7717/peerj.3935

Published: 2017-10-17
Accepted: 2017-09-26
Received: 2017-06-02

Academic Editor: Steven Buyske

Subject Areas: Genetics, Genomics, Ophthalmology, Medical Genetics
Keywords: Comitant strabismus, Linkage analysis, Exotropia, Esotropia, Case-control association study, Transmission disequilibrium test allowing for errors (TDTae), Chromosomal susceptibility locus, Japanese families, Transmission disequilibrium test (TDT), Candidate gene

Copyright: © 2017 Zhang and Matsuo
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Zhang J, Matsuo T. 2017. MGST2 and WNT2 are candidate genes for comitant strabismus susceptibility in Japanese patients. PeerJ 5:e3935 https://doi.org/10.7717/peerj.3935

Abstract

Background/Aim

Strabismus is a common condition with misalignment between two eyes that may lead to decrease of visual acuity, lack of binocularity, and diplopia. It is caused by heterogeneous environmental and genetic risk factors. Our previous research has identified new chromosomal susceptibility loci in 4q28.3 and 7q31.2 regions for comitant strabismus in Japanese families. We conducted a verification study by linkage analysis to narrow the chromosomal loci down to a single gene.

Methods

From Japanese and U.S. databases, 24 rsSNPs and 233 rsSNPs were chosen from the 4q28.3 and 7q31.2 region, respectively, and were typed in 108 affected subjects and 96 unaffected subjects of 58 families with primary and non-syndromic comitant strabismus. Three major analytical methods were used: transmission disequilibrium test (TDT), TDT allowing for errors (TDTae), and linkage analysis under dominant and recessive inheritance.

Results

The SNPs with significant P values in TDT and TDTae were located solely at the gene, microsomal glutathione S-transferase 2 (MGST2), on chromosome 4q28.3 locus. In contrast, significant SNPs were dispersed in a few genes, containing wingless-type MMTV integration site family member 2 (WNT2), on chromosome 7q31.2 locus. The distribution of significant SNPs on the 7q31.2 locus showed that only the ST7 to WNT2 region in the same big haplotype block contained significant SNPs for all three methods of linkage analysis.

Conclusions

This study suggests that MGST2 and WNT2 are potential candidates for comitant strabismus in Japanese population.

Introduction

Strabismus refers to misalignment between two eyes that point in different directions, and is classified into comitant (concomitant) strabismus and incomitant (noncomitant or paralytic) strabismus. Structural anomalies of extraocular muscles, such as anomalous insertion, hypoplasia and aplasia, have long been recognized as congenital causes for hereditary incomitant strabismus (Matsuo et al., 1988; Uchiyama et al., 2010; Okano et al., 1990; Matsuo et al., 2009a). More recently, genomic mutations or polymorphisms have been identified in families with hereditary incomitant strabismus, including congenital fibrosis of extraocular muscles (CFEOM), Duane syndrome (Engle, 2007; Graeber, Hunter & Engle, 2013), or congenital (idiopathic) superior oblique muscle palsy (Jiang et al., 2004; Jiang et al., 2005; Imai et al., 2008; Ohkubo et al., 2012). Congenital progressive external ophthalmoplegia is also well recognized as a mitochondrial disease with hereditary background. Acquired incomitant strabismus is caused by vascular, traumatic or compression paralysis of ocular motility cranial nerves. It may also be developed as a consequence of muscle diseases, or resulted from hyper- or hypothyroidism, myasthenia gravis and other rare conditions.

Primary and non-syndromic comitant strabismus is a multifactorial disorder which has both genetic and environmental background with their undefined contribution (Michaelides & Moore, 2004; Maconachie, Gottlob & McLean, 2013; Ye et al., 2014). Genetic influence is evidenced by family history (Abrahamsson, Magnusson & Sjostrand, 1999; Matsuo, Yamane & Ohtsuki, 2001; Taira et al., 2003) and phenotypic concordance between monozygotic twins (Podgor, Remaley & Chew, 1996; Matsuo et al., 2002; Sanfilippo et al., 2012). Environmental influence is supported by the association with premature birth and perinatal hypoxia as it occurs with a higher incidence in cerebral palsy (Cotter et al., 2011; Jacobson & Dutton, 2000). At present, no gene has been identified to be responsible for the development of comitant strabismus. American and British researchers reported 7p22.1 as a chromosomal susceptibility locus for esotropia in Caucasian families (Parikh et al., 2003; Rice et al., 2009). Our previous research has identified the susceptibility loci in 4q28.3 and 7q31.2 regions for comitant strabismus that comprised both esotropia and exotropia in Japanese families (Fujiwara et al., 2003; Shaaban et al., 2009a; Shaaban et al., 2009b). Other chromosomal loci have also been reported to be associated with comitant strabismus in other ethnicity (Khan et al., 2011; Bosten et al., 2014).

Given this background, we conducted single nucleotide polymorphism (SNP) analyses to narrow the chromosomal loci down to a single gene in Japanese families. As an analytical method, we previously tried to use association study that examines the relationship between several polymorphic markers and the strabismus phenotype in the chromosomal regions (Matsuo, 2015). In this study, we used three different methods for linkage analysis: transmission disequilibrium test (TDT) (Spielman & Ewens, 1996), TDT allowing for errors (TDTae) (Gordon et al., 2004), and linkage analysis under dominant and recessive inheritance (Lathrop et al., 1984). We hypothesized that the results by different analytical methods of linkage analysis might localize a specific gene that would be responsible for comitant strabismus.

Materials and Methods

Subjects

This study involved 108 affected subjects and 96 unaffected subjects in 58 Japanese families with primary and non-syndromic comitant strabismus including both esotropia and exotropia, which mostly overlapped with subjects in the previous study for chromosomal loci identification (Shaaban et al., 2009a). The previous study used 55 families with at least four members in each family. Part of the genomic DNA samples that were used in the previous study were no longer available due to the DNA shortage. Thus, the present study involved new affected subjects and unaffected subjects in new families as well as available subjects of the previous study. The features of 58 families are summarized in Table 1. The study followed the tenets of the Declaration of Helsinki, and was approved by the Ethics Committee of Okayama University Graduate School of Medicine, Dentistry, and Pharmaceutical Sciences (Approval No. Genome 215).

Table 1:

Clinical features of affected subjects with primary and non-syndromic strabismus and unaffected subjects in families.

	Affected (%)	Unaffected (%)
The number of individuals	108 (52.9%)	96 (47.1%)
Male	45 (53.6%)	39 (46.4%)
Female	63 (52.5%)	57 (47.5%)
The number of families	55 (94.8%)	3 (5.2%)
Exotropia	22
Esotropia	25
Mixed phenotypes (exotropia and esotropia)	8
The number of individuals	108 (52.9%)	96 (47.1%)
Exotropia	52
Intermittent exotropia	44
Constant exotropia	8
Esotropia	56
Infantile esotropia	33
Accommodative or partially accommodative esotropia	14
Microtropia (Microesotropia)	3
Unclassified strabismus	6

DOI: 10.7717/peerj.3935/table-1

SNP selection and typing

Tag SNPs in the two chromosomal regions were first picked from the JSNP database for Japanese (Hirakawa et al., 2002). Finally, 24 rsSNPs were chosen at the 4q28.3 region and 233 rsSNPs were chosen at the 7q31.2 region from the dbSNP database of the US National Center for Biotechnology Information (NCBI). Genomic DNA that was isolated from peripheral blood leukocytes was amplified by multiplex polymerase chain reaction (PCR).

The MassARRAY system is a high-throughput matrix-assisted laser desorption ionization and time-of-flight mass spectrometry (MALDI-TOF MS) for detection of nucleic acids. After PCR-based multiplex reaction and clean-up to remove unincorporated dNTPs, the third primers were introduced into the reactions which correspond to DNA template immediately at front of polymorphic sites. Single nucleotide base extension reaction was performed with mass-modified nucleotides. Then SNPs were identified on the platform of MassARRAY Analyzer 4 (96 well) iPlex SNP Genotyping (Sequenom, San Diego, CA, USA). Overall call rates were 87%. We then proceeded to quality controls of SNPs and samples.

Hardy-Weinberg equilibrium and principal component analysis

We first performed Hardy-Weinberg equilibrium (HWE) testing for data quality control. Principal component analysis (PCA) was performed by the genome-wide complex trait analysis (GCTA) program (Yang et al., 2011) to calculate eigenvectors which were then put in the model as covariates to identify if there be a population substructure among families.

Family-based association study: TDT and TDTae

Transmission disequilibrium test (TDT) is a test for association in the presence of linkage for a case-parent trio. Thus, two parents had to be present in a pedigree with one affected subject. As it is customary to only show paternal (or maternal) in a pedigree drawing, we assigned all information unknown except for their sex in genetic analysis, but we did not expect parent-specific effects so that we treated the maternal and paternal genotypes symmetrically. Both parents with homozygous condition were not informative, as there was no genetic variation at the locus in the progeny. Only subjects with at least one heterozygous parent were informative. This situation led to reduction of effective sample size.

Plink program version 1.9 (Purcell et al., 2007; Purcell & Chang, 2015) was run to detect genotypes which violated the Mendelian rules in TDT analysis. The Plink program can handle errors by rendering the offending genotypes “unknown”. Therefore, it was run on the original data without deletion of any SNPs or families which went through the initial cleaning. TDT is a form of linkage analysis which is only powerful in the presence of genetic association (Ott, 1989). The errors were handled by Plink to eliminate the offending data in TDT analysis as mentioned above.

In contrast, transmission disequilibrium test allowing for errors (TDTae) is an implementation of TDT which allows errors to be present in estimating their rates in the course of analysis by TDTae program (Gordon et al., 2004). In the process of running the TDTae program, errors were estimated in the background of any one of a number of error models. We considered the two most reasonable and economical error models, which require few parameters: DSB (Douglas-Skol-Boehnke) allows for genotype errors (Douglas, Skol & Boehnke, 2002) and GHLO (Gordon-Heath-Liu-Ott) allows for allele errors (Gordon et al., 2001; Yang et al., 2008). DSB is analogous to error models proposed many years ago and has only been implemented in the last decade. Under the two error models, we run TDTae program for dominant (d), recessive (r) and multiplicative (m) inheritance.

Furthermore, we defined linkage disequilibrium (LD) blocks, using Haploview 4.2 (Gabriel et al., 2002), on chromosome 4q28.3 and 7q31.2. Haplotype analysis was performed based on haplotype blocks to figure out if there were haplotypes that would be associated with the strabismus phenotype.

Linkage analysis in large pedigrees

Linkage analysis estimates recombination fractions between a putative disease locus and marker loci (Lathrop et al., 1984), and the results were output as LOD (logarithm of odds) scores (Ott, 1999; Terwilliger & Ott, 1994; Terwilliger & Ott, 2016). The Pseudomarker program (Gertz et al., 2014; Goring & Terwilliger, 2000; Hiekkalinna et al., 2011) estimates allele frequencies by maximum likelihood, separately under linkage and no linkage, which makes the results virtually independent of allele frequencies. Since the Pseudomarker program requires error-free data, we had to remove SNPs as necessary as possible to obtain a pure and error-free dataset (Lathrop et al., 1984). In addition, this program can also take linkage disequilibrium (LD) between a SNP and the disease into account, thus resulting in gain of additional power. Linkage analysis generally requires absence of errors. Mendelian inconsistencies would be allowed in linkage analysis with a suitable choice of penetrance for SNPs, but the procedure is cumbersome and rarely done.

Results

Hardy-Weinberg equilibrium and principal component analysis

After quality control of SNPs and individuals, 19 SNPs with monomorphic or undetected types or at a low call rate were excluded, and one SNP was merged to another in the database of NCBI. Finally, all individuals and 237 SNPs remained. The flow chart of analytical process is shown in Fig. 1.

Figure 1: Flow chart of analytical process.
TDT, transmission disequilibrium test; TDTae, TDT allowing for errors.
Download full-size image
DOI: 10.7717/peerj.3935/fig-1

Twenty one out of 23 SNPs on chromosome 4q28.3, and 208 out of 214 SNPs on chromosome 7q31.2 were in Hardy-Weinberg equilibrium (P > 0.05, chi-square test). The results of PCA analysis showed no population substructure among families or between affected and unaffected individuals, and also detected no outliers (Fig. 2).

Figure 2: Scatter plot of eigenvectors in principal component analysis (PCA).
The horizontal axis is first principal component and the vertical axis is the second principal component. Each circle represents a subject. Open circles represent unaffected subjects and closed circles represent affected subjects. The same color indicates subjects in the same family. Neither the subjects in the same family nor the affected or unaffected subjects as a whole show clusters. There are no outliers.
Download full-size image
DOI: 10.7717/peerj.3935/fig-2

TDT and TDTae

A total of 261 Mendelian errors were detected. Errors could be due to faulty genotyping, clerical mistakes, pedigree mismatch (e.g., adopted child), or a new mutation which the program treated as an inherited variant. The dataset with Mendelian errors was applied to TDT directly. Mendelian errors were reduced when some families with large error rates were deleted. After five SNPs (rs4148690, rs3757807, rs3807975, rs3779551, and rs213987, all on chromosome 7) and four families (37, 8, 10, and 43) with the highest error rates were deleted, 186 individuals in 54 families and 232 SNPs with 71 errors remained, which were used as the analysis set for TDTae.

The best results of TDT (P < 0.08) are shown in Table 2. And the TDTae results with significant P values (P < 0.05) are shown in Table 3. Figures 3 and 4 show the results of TDT and TDTae, with haplotype analysis and LD blocks adjusted for families on chromosome 4q28.3 and on chromosome 7q31.2, respectively. In the 4q28.3 locus, all significant SNPs were located in the intron of MGST2. In the 7q31.2 locus, several SNPs with significant P values (P < 0.05) were distributed dispersedly in TES, ST7, WNT2, CAV1, and CFTR. Haplotype analysis in the 4q28.3 locus did not get a positive result. In the 7q31.2 locus, haplotypes with significant P values are located in TES, CAV1, or WNT2.

Table 2:

Results of transmission disequilibrium test (TDT).

Chromosome	dbSNP	Chromosomal location	Gene	Minor allele	Major allele	Transmitted alleles (no.)	Untransmitted alleles (no.)	Odds ratio	χ2	P value
7	rs3807961	115855791	TES intron	T	C	19	8	2.375	4.481	0.03426
7	rs1000966	115862902	TES intron	A	G	6	19	0.3158	6.76	0.009322
7	rs3807967	115863441	TES intron	G	A	16	4	4	7.2	0.00729
7	rs3807977	115881740	TES intron	A	T	14	6	2.333	3.2	0.07364
7	rs3757730	115882323	TES intron	G	A	11	21	0.5238	3.125	0.0771
7	rs3807979	115893362	TES intron	G	A	19	8	2.375	4.481	0.03426
7	rs3030648	115897792	TES3′-UTR	T	A	10	27	0.3704	7.811	0.005193
7	rs2074025	116550456	CAPZA2 intron	C	T	6	1	6	3.571	0.05878
7	rs3808193	116868696	ST7 intron	A	G	4	0	Infinity	4	0.0455
7	rs2024233	116917427	WNT23′-UTR	G	A	10	24	0.4167	5.765	0.01635
7	rs733154	116918911	WNT2 intron	G	A	5	13	0.3846	3.556	0.05935
7	rs2896218	116919978	WNT2 intron	G	A	7	17	0.4118	4.167	0.04123
7	rs3824028	116933900	WNT2 intron	G	A	0	3	0	3	0.08326
7	rs1049337	116200587	CAV13′-UTR	T	C	4	0	Infinity	4	0.0455
7	rs2299442	117131336	CFTR intron	A	T	0	4	0	4	0.0455
4	rs1000222	139666831	MGST2 intron	C	T	8	18	0.4444	3.846	0.04986
4	rs3836606	139701270	MGST2 intron	DEL	G	2	12	0.1667	7.143	0.007526

DOI: 10.7717/peerj.3935/table-2

Notes:

P values are nominal and not corrected for testing multiple SNPs.

Table 3:

Results of transmission disequilibrium test allowing for errors (TDTae).

Chromosome	dbSNP	Chromosomal location	Gene	^aR2						^cMinimum of corrected P value
				Dominant model		Recessive model		Multiplicative model
				^bDSB	GHLO	DSB	GHLO	DSB	GHLO
7	rs3807959	115853100	TES intron			4.4081	4.4081	12.5929	12.5929	0.008324
7	rs3807967	115863441	TES intron					0.1522	0.1522	0.0416597
7	rs3807986	116177825	CAV1 intron				2.0739			0.0471031
7	rs3801993	116190382	CAV1 intron			8.2562	8.2562			0.0378839
7	rs3801994	116190469	CAV1 intron			8.3368	8.3368			0.0366007
7	rs41735	116435416	MET intron			0.13	0.13	0.1164	0.1164	0.019151
7	rs2023748	116436022	MET intron			0.1482	0.1482	0.1228	0.1228	0.034239
7	rs41737	116436097	MET			0.2547	0.2547			0.0453541
7	rs2301649	116538634	CAPZA2 intron			0.3377				0.033912
7	rs2074025	116550456	CAPZA2 intron					0.0169	0.0169	0.016231
7	rs38861	116816284	ST7 intron	3.3527	3.3527			6.3843	6.3843	0.018479
7	rs3735646	116915684		3.3	3.3					0.0439441
7	rs2024233	116917427	WNT2 3′-UTR	6.8558	6.8558			6.8744	6.8744	0.009829
7	rs3779547	116930962	WNT2 intron			0.1892	0.1892			0.0491057
7	rs3779546	116934200	WNT2 intron			0.1779	0.1779			0.031462
7	rs2285544	116944283	WNT2 intron	3.698	3.698					0.023326
7	rs4148721	117267954	CFTR intron					19.973	19.973	0.0465618
4	rs3836607	140579303		0.1473	0.1473			0.0476	0.0476	0.019239

DOI: 10.7717/peerj.3935/table-3

Notes:

aR1 = R2 in dominant mode of inheritance; R1 = 1 in recessive mode of inheritance; R1² = R2 in multiplicative mode of inheritance.

bDouglas Skol Boehnke (DSB) error model; Gordon Heath Liu Ott (GHLO) error model.

cThe minimum P value (corrected for multiple testing) of dominant, multiplicative, and recessive mode of inheritance.

The corrected P value is given by 1 –(1 –p)^k−1 (Gordon et al., 2004).

R1 = Pr(aff | + d)/Pr(aff | + +) and R2 = Pr(aff |dd)/Pr(aff | + +) are genotypic relative risks for a di-allelic trait locus with low-risk (wild-type) allele + and high-risk (disease) allele d. If both R1 and R2 are less than 1, the genotypic relative risk value of the other allele would be calculated by R1′ = R1/R2 and R2′ = 1/R2. A few strange results are omitted from this table (e.g., R > 10,000, or R = 0).

Figure 3: TDT (transmission disequilibrium test) and TDTae (TDT allowing for errors) plots, and linkage disequilibrium (LD) blocks on the 4q28.3 locus.
(A) Physical position of SNPs in the genome (4:133152903-139701270, release 108), and annotation track is displayed by Integrative Genomics Viewer (IGV) 2.3. (B) The χ² and −log10(P) value in TDT, and (C) −log10(P) value in TDTae. The regions with significant markers (P < 0.05) are highlighted. (D) LD plot, showing LD patterns among the SNPs in TDT analysis. The LD between the SNPs is measured as D′ value and shown (×100) in the diamond at the intersection of the diagonals from each SNP. D′ < 1 and LOD < 2 is shown as white, D′ = 1 and LOD < 2 is shown as blue, D′ < 1 and LOD ≥ 2 is shown as shades of pink or red, and D′ = 1 and LOD ≥ 2 is shown as bright red. Haplotype blocks in high LD are outlined in bold black line.
Download full-size image
DOI: 10.7717/peerj.3935/fig-3

Figure 4: TDT (transmission disequilibrium test) and TDTae (TDT allowing for errors) plots, and linkage disequilibrium (LD) blocks on the 7q31.2 locus.
(A) Physical position of SNPs in the genome (7:116210883-117667022, release 108) and annotation track is displayed by Integrative Genomics Viewer (IGV) 2.3. (B) The χ² and −log10(P) value in TDT, and (C) −log10(P) value in TDTae. The regions with significant markers (P < 0.05) are highlighted. (D) The haplotype block structure from case-parent trios. There are nine haplotype blocks at the locus. The SNP numbers on the top of haplotypes correspond to those in the diagram of ped files. The haplotype frequencies are shown on the right of each haplotype. (E) LD plot, showing LD patterns among the SNPs in TDT analysis. The LD between the SNPs is measured as D′ value and shown (×100) in the diamond at the intersection of the diagonals from each SNP. D′ < 1 and LOD < 2 is shown as white, D′ = 1 and LOD < 2 is shown as blue, D′ < 1 and LOD ≥ 2 is shown as shades of pink or red, and D′ = 1 and LOD ≥ 2 is shown as bright red. Haplotype blocks in high LD are outlined in bold black line.
Download full-size image
DOI: 10.7717/peerj.3935/fig-4

Linkage analysis

Four families and 46 SNPs with Mendelian errors were deleted. Furthermore, linkage analysis requires valid data of complete family trios, so the program extracted only 90 individuals in 21 families which remained in the final analysis set. The dominant mode was less plausible because few parents in the present families were affected in themselves, or rather, the trait under consideration was frequently consistent with recessive inheritance. Table 4 shows all significant SNPs under dominant inheritance and recessive inheritance. The linkage analysis employed the error-free dataset which were only available by reducing the number of families, individuals, and SNPs. Based on this methodological limitation of linkage analysis, the LOD scores in the present analysis with the small number of families were as small as just over 1.

Table 4:

Results of linkage analysis.

Chromosome	dbSNP	Chromosomal location	Gene	LOD score	Linkage P value	Model
7	rs3757729	115861846	TES intron	1.263692	0.007935	Recessive
7	rs1004109	115862296	TES intron	1.051778	0.013885	Recessive
7	rs3807967	115863441	TES intron	1.051121	0.013909	Dominant
7	rs3757730	115882323	TES intron	1.283524	0.007534	Recessive
7	rs3823977	115893407	TES intron	1.051778	0.013885	Recessive
7	rs3807983	115898991	TES3′-near	1.051777	0.013885	Recessive
7	rs3840660	116917245	WNT23′-UTR	1.43437	0.005093	Recessive
7	rs3779550	116927360	WNT2 intron	1.064359	0.013427	Recessive
7	rs213976	117238379	CFTR intron	1.129688	0.011289	Recessive
7	rs213977	117238445	CFTR intron	1.720078	0.00245	Recessive
7	rs4148714	117238453	CFTR intron	1.725551	0.002416	Recessive
7	rs2246450	117240668	CFTR intron	1.505123	0.004244	Recessive
7	rs2299445	117246315	CFTR intron	1.424525	0.005224	Recessive
7	rs2237726	117256374	CFTR intron	1.725551	0.002416	Recessive
7	rs2254742	117264126	CFTR intron	1.721261	0.002442	Recessive
7	rs214167	117286524	CFTR intron	1.204099	0.009277	Recessive
7	rs4148724	117305151	CFTR intron	1.806151	0.001969	Recessive

DOI: 10.7717/peerj.3935/table-4

Notes:

LOD: logarithm of odds

P values are nominal and not corrected for testing multiple SNPs.

Expression quantitative trait locus (eQTL)

The significant SNPs in the 4q28.3 locus were related to MGST2 transcription in the search for expression quantitative trait locus (eQTL) in the Human Genetic Variation Database (HGVD) which displays the Japanese genetic variations and the association between the variations and transcription levels of genes (Higasa et al., 2016).

Discussion

In our preceding study, we tried to use a method that did not depend on kinship, such as association study adjusted by family, in order to examine the relationship between several polymorphic markers and the strabismus phenotype in the chromosomal regions (Matsuo, 2015). This strategy was based on the fact that some of the family trios were not complete and that there were many Mendelian errors which might be attributed to adoption. The preceding results showed that significant SNPs were in MGST2 and WNT2 on chromosomal 4q28.3 locus and 7q31.2 locus, respectively. However, the false discovery rate (FDR) was too high to reduce the power in conducting multiple comparisons among SNPs (Benjamini & Hochberg, 1995), and therefore, we turned in the present study to focus on methods for linkage analysis.

When a few families or SNPs seem to contribute to the majority of errors, it is best to delete these families or SNPs firstly and then to carry out TDTae. In contrast, TDT by Plink would handle errors by ignoring the offending genotypes. The TDTae program generally furnishes much smaller P values than the TDT since the error model is in a parametric manner. There is some agreement between the outputs from Plink and TDTae, although not very strong. In the present analyses, the two error models in the TDTae furnished similar results, suggesting that the results would not be unduly dependent on the assumptions regarding errors.

The P values shown in the linkage studies were nominal and not corrected for the testing of multiple SNPs. A correction for multiple comparisons was somewhat difficult to make since these SNPs are presumably highly correlated with each other. The Pseudomarker program requires strict error-free data with no Mendelian errors. Therefore, some potential candidates of SNPs might be deleted in the process of preparing error-free data which were based merely on a single faulty genotyping. Furthermore, it is worth noting that genotyping errors would cause inflation in the recombination fraction between the disease and marker loci, leading to the consequence that recombination fractions may appear larger than they truly are Lincoln & Lander (1992).

In the present study, we clearly demonstrated that MGST2 is a candidate for the chromosomal 4q28.3 locus. As for the 7q31.2 locus, in contrast, the results of different kinds of statistical analyses could not narrow the locus to a single gene. Under the circumstances, the distribution of significant SNPs in the locus showed that only the ST7 to WNT2 region contained significant SNPs for all three methods of linkage analysis (Fig. 5). In the 7q31.2 locus, ST7 is indeed in the same big haplotype block with WNT2.

Figure 5: Summary of SNP −log10(P) values for each method of linkage analysis on chromosome 7q31.2.
(A) Physical position of SNPs in the genome. (B) The plots represent −log10(P) values of sigificant P values obtained from different analyses.
Download full-size image
DOI: 10.7717/peerj.3935/fig-5

Primary and non-syndromic comitant strabismus contains several different clinical entities: esotropia includes infantile esotropia, accommodative esotropia, partially accommodative esotropia, late-onset (acute-onset) esotropia, and microtropia (microesotropia) while exotropia includes intermittent exotropia, constant exotropia and congenital (infantile) exotropia (Matsuo et al., 2003; Matsuo et al., 2005; Matsuo & Matsuo, 2005; Matsuo & Matsuo, 2007; Matsuo et al., 2007). Furthermore, patients with the same clinical entity or clinical diagnosis show varying degrees of manifestations, not only in horizontal and vertical deviations but also in the state of binocular vision. Under the circumstances, one way to define comitant strabismus is as a disease with abnormal binocular vision, namely abnormalities in simultaneous perception, fusion and stereopsis.

In our ongoing research, different clinical entities of primary and non-syndromic comitant strabismus were analyzed altogether in chromosomal mapping and SNP typing (Fujiwara et al., 2003; Shaaban et al., 2009a; Shaaban et al., 2009b; Matsuo, 2015). In other words, the presence or the absence of a phenotype “strabismus” was used as a single phenotypic descriptor in the genetic statistical analysis. This approach was justified by the fact that in our previous study the same chromosomal susceptibility loci were replicated in stratified groups of the families either with esotropia or with exotropia (Shaaban et al., 2009a).

In the Japanese population, exotropia is more prevalent than esotropia (Matsuo & Matsuo, 2005; Matsuo & Matsuo, 2007; Matsuo et al., 2009b; Matsuo et al., 2010), in contrast with the Caucasian population which shows higher prevalence of esotropia. A common genetic mechanism is assumed to give rise to exotropia and esotropia since both entities of comitant strabismus share abnormal binocular vision as a phenotype. In addition, there are indeed families which show mixed phenotypes of exotropia and esotropia, as observed in this study: one member shows exotropia and another member shows esotropia in a family. Abnormal activities of unknown genes in the central nervous system might be responsible for the abnormal binocular vision in patients with comitant strabismus.

The large number of Mendelian errors seems to be the limitation of this study. The original data sets with Mendelian errors were used in TDT analyses (Spielman & Ewens, 1996; Gordon et al., 2004). In contrast, the error-free data sets were used in TDTae and linkage analyses under dominant and recessive inheritance (Lathrop et al., 1984). The common use of the original data sets should have underlain the more consistent results whereas sharing of the same data sets would not necessarily mean that the data applied actually in analysis are the same. Or rather, the difference is merely the methods to handle the errors prior to software application or in the process by ignoring a single cell or deleting the whole series. In the present study, permutation tests were done to check the robustness of the results. Therefore, the results should be affected mainly by the methods, and would not be decided by the number of Mendelian errors.

Both MGST2 and WNT2 are known to be expressed in the brain (Jakobsson, Mancini & Ford-Hutchinson, 1996; Cadigan & Nusse, 1997) and likely to be involved in the development of comitant strabismus. Different analytical methods shed light on the data from different angles, therefore it is useful to apply more than one type of analysis. Strict Mendelian application as in this study, might not be appropriate in multifactorial disorders such as comitant strabismus, but would certainly provide a step to get guidance for detecting genetic risks of the disease. Since the proof of a responsible gene in a multifactorial disorder is difficult to be obtained in animal experiments, a different approach in patients, such as whole exome sequencing, would provide support for the present results of SNP typing. Further functional studies are necessary to clarify the mechanisms of the two genes on the susceptibility of comitant strabismus. Finally, it should be noted that there is a limitation in applying the eQTL to the present study since the eQTL data have been obtained in analyses of blood cells (Higasa et al., 2016).

Conclusions

This study with different analytical methods for genetic statistics provides evidence that MGST2 and WNT2 are potential candidate genes for comitant strabismus in Japanese population.

Supplemental Information

Raw data

The terms are sample ID, groups and varient call information and pedigree information.

DOI: 10.7717/peerj.3935/supp-1

Download

[1] Abrahamsson M, Magnusson G, Sjostrand J. 1999. Inheritance of strabismus and the gain of using heredity to determine populations at risk of developing strabismus. Acta Ophthalmologica Scandinavica 77:653-657

[2] Benjamini Y, Hochberg Y. 1995. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of Royal Statistical Society Series B (Methodological) 57:289-300

[3] Bosten JM, Hogg RE, Bargary G, Goodbourn PT, Lawrance-Owen AJ, Mollon JD. 2014. Suggestive association with ocular phoria at chromosome 6p22. Investigative Ophthalmology and Visual Science 55:345-352

[4] Cadigan KM, Nusse R. 1997. Wnt signaling: a common theme in animal development. Genes and Development 11:3286-3305

[5] Cotter SA, Varma R, Tarczy-Hornoch K, McKean-Cowdin R, Lin J, Wen G, Wei J, Borchert M, Azen SP, Torres M, Tielsch JM, Friedman DS, Repka MX, Katz J, Ibironke J, Giordano L. 2011. Risk factors associated with childhood strabismus: the multi-ethnic pediatric eye disease and Baltimore pediatric eye disease studies. Ophthalmology 118:2251-2261

[6] Douglas JA, Skol AD, Boehnke M. 2002. Probability of detection of genotyping errors and mutations as inheritance inconsistencies in nuclear-family data. American Journal of Human Genetics 70:487-495

[7] Engle EC. 2007. Genetic basis of congenital strabismus. Archives of Ophthalmology 125:189-195

[8] Fujiwara H, Matsuo T, Sato M, Yamane T, Kitada M, Hasebe S, Ohtsuki H. 2003. Genome-wide search for strabismus susceptibility loci. Acta Medica Okayama 57:109-116

[9] Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, Higgins J, DeFelice M, Lochner A, Faggart M, Liu-Cordero SN, Rotimi C, Adeyemo A, Cooper R, Ward R, Lander ES, Daly MJ, Altshuler D. 2002. The structure of haplotype blocks in the human genome. Science 296:2225-2229

[10] Gertz EM, Hiekkalinna T, Digabel SL, Audet C, Terwilliger JD, Schaffer AA. 2014. PSEUDOMARKER 2.0: efficient computation of likelihoods using NOMAD. BMC Bioinformatics 15:47

[11] Gordon D, Haynes C, Johnnidis C, Patel SB, Bowcock AM, Ott J. 2004. A transmission disequilibrium test for general pedigrees that is robust to the presence of random genotyping errors and any number of untyped parents. European Journal of Human Genetics 12:752-761

[12] Gordon D, Heath SC, Liu X, Ott J. 2001. A transmission/disequilibrium test that allows for genotyping errors in the analysis of single-nucleotide polymorphism data. American Journal of Human Genetics 69:371-380

[13] Goring HH, Terwilliger JD. 2000. Linkage analysis in the presence of errors IV: joint pseudomarker analysis of linkage and/or linkage disequilibrium on a mixture of pedigrees and singletons when the mode of inheritance cannot be accurately specified. American Journal of Human Genetics 66:1310-1327

[14] Graeber CP, Hunter DG, Engle EC. 2013. The genetic basis of incomitant strabismus: consolidation of the current knowledge of the genetic foundations of disease. Seminars in Ophthalmology 28:427-437

[15] Hiekkalinna T, Schaffer AA, Lambert B, Norrgrann P, Goring HH, Terwilliger JD. 2011. PSEUDOMARKER: a powerful program for joint linkage and/or linkage disequilibrium analysis on mixtures of singletons and related individuals. Human Heredity 71:256-266

[16] Higasa K, Miyake N, Yoshimura J, Okamura K, Niihori T, Saitsu H, Doi K, Shimizu M, Nakabayashi K, Aoki Y, Tsurusaki Y, Morishita S, Kawaguchi T, Migita O, Nakayama K, Nakashima M, Mitsui J, Narahara M, Hayashi K, Funayama R, Yamaguchi D, Ishiura H, Ko WY, Hata K, Nagashima T, Yamada R, Matsubara Y, Umezawa A, Tsuji S, Matsumoto N, Matsuda F. 2016. Human genetic variation database, a reference database of genetic variations in the Japanese population. Journal of Human Genetics 61:547-553

[17] Hirakawa M, Tanaka T, Hashimoto Y, Kuroda M, Takagi T, Nakamura Y. 2002. JSNP: a database of common gene variations in the Japanese population. Nucleic Acids Research 30:158-162

[18] Imai S, Matsuo T, Itoshima E, Ohtsuki H. 2008. Clinical features, ARIX and PHOX2B nucleotide changes in three families with congenital superior oblique muscle palsy. Acta Medica Okayama 62:45-53

[19] Jacobson LK, Dutton GN. 2000. Periventricular leukomalacia: an important cause of visual and ocular motility dysfunction in children. Survey of Ophthalmology 45:1-13

[20] Jakobsson PJ, Mancini JA, Ford-Hutchinson AW. 1996. Identification and characterization of a novel human microsomal glutathione S-transferase with leukotriene C4 synthase activity and significant sequence identity to 5-lipoxygenase-activating protein and leukotriene C4 synthase. Journal of Biological Chemistry 271:22203-22210

[21] Jiang Y, Matsuo T, Fujiwara H, Hasebe S, Ohtsuki H, Yasuda T. 2004. ARIX gene polymorphisms in patients with congenital superior oblique muscle palsy. British Journal of Ophthalmology 88:263-267

[22] Jiang Y, Matsuo T, Fujiwara H, Hasebe S, Ohtsuki H, Yasuda T. 2005. ARIX and PHOX2B polymorphisms in patients with congenital superior oblique muscle palsy. Acta Medica Okayama 59:55-62

[23] Khan AO, Shinwari J, Abu Dhaim N, Khalil D, Al Sharif L, Al Tassan N. 2011. Potential linkage of different phenotypic forms of childhood strabismus to a recessive susceptibility locus (16p13.12-p12.3) Molecular Vision 17:971-976

[24] Lathrop GM, Lalouel JM, Julier C, Ott J. 1984. Strategies for multilocus linkage analysis in humans. Proceedings of the National Academy of Sciences of the United States of America 81:3443-3446

[25] Lincoln SE, Lander ES. 1992. Systematic detection of errors in genetic linkage data. Genomics 14:604-610

[26] Maconachie GD, Gottlob I, McLean RJ. 2013. Risk factors and genetics in common comitant strabismus: a systematic review of the literature. JAMA Ophthalmology 131:1179-1186

[27] Matsuo T. 2015. Candidate genes for strabismus susceptibility chromosomal loci. In: Ozkan SB, ed. Advances in strabismus. Proceedings of the XIIth meeting of the international strabismological association in Kyoto, JAPAN, December 1–4, 2014. Ankara: Rotatip Publisher. 225-229

[28] Matsuo T, Hayashi M, Fujiwara H, Yamane T, Ohtsuki H. 2002. Concordance of strabismic phenotypes in monozygotic versus multizygotic twins and other multiple births. Japanese Journal of Ophthalmology 46:59-64

[29] Matsuo T, Kawaishi Y, Kuroda R, Ohtsuki H, Watanabe Y. 2003. Long-term visual outcome in primary microtropia. Japanese Journal of Ophthalmology 47:507-511

[30] Matsuo T, Matsuo C. 2005. The prevalence of strabismus and amblyopia in Japanese elementary school children. Ophthalmic Epidemiology 12:31-36

[31] Matsuo T, Matsuo C. 2007. Comparison of prevalence rates of strabismus and amblyopia in Japanese elementary school children between the years 2003 and 2005. Acta Medica Okayama 61:329-334

[32] Matsuo T, Matsuo C, Kio K, Ichiba N, Matsuoka H. 2009b. Is refraction with a hand-held autorefractometer useful in addition to visual acuity testing and questionnaires in preschool vision screening at 3.5 years in Japan? Acta Medica Okayama 63:195-202

[33] Matsuo T, Matsuo C, Matsuoka H, Kio K. 2007. The detection of strabismus and amblyopia at 1.5- and 3-year-old children by preschool vision-screening program in Japan. Acta Medica Okayama 61:9-16

[34] Matsuo T, Ohtsuki H, Sogabe Y, Konishi H, Takenawa K, Watanabe Y. 1988. Vertical abnormal retinal correspondence in three patients with congenital absence of the superior oblique muscle. American Journal of Ophthalmology 106:341-345

[35] Matsuo T, Watanabe T, Furuse T, Hasebe S, Ohtsuki H. 2009a. Case report and literature review of inferior rectus muscle aplasia in 16 Japanese patients. Strabismus 17:66-74

[36] Matsuo T, Yabuki A, Hasebe K, Shira YH, Imai S, Ohtsuki H. 2010. Postural stability changes during the prism adaptation test in patients with intermittent and constant exotropia. Investigative Ophthalmology and Visual Science 51:6341-6347

[37] Matsuo T, Yamane T, Fujiwara H, Ohtsuki H, Watanabe Y. 2005. Predictive factors for long-term outcome of stereoacuity in Japanese patients with pure accommodative esotropia. Strabismus 13:79-84

[38] Matsuo T, Yamane T, Ohtsuki H. 2001. Heredity versus abnormalities in pregnancy and delivery as risk factors for different types of comitant strabismus. Journal of Pediatric Ophthalmology and Strabismus 38:78-82

[39] Michaelides M, Moore AT. 2004. The genetics of strabismus. Journal of Medical Genetics 41:641-646

[40] Ohkubo SI, Matsuo T, Hasebe K, Shira YH, Itoshima E, Ohtsuki H. 2012. Phenotype-phenotype and genotype-phenotype correlations in patients with idiopathic superior oblique muscle palsy. Journal of Human Genetics 57:122-129

[41] Okano M, Matsuo T, Konishi H, Hasebe S, Tadokoro Y, Ohtsuki H. 1990. Anomalous posterior insertion of medial rectus muscle simulating congenital oculomotor palsy. Japanese Journal of Ophthalmology 34:275-279

[42] Ott J. 1989. Statistical properties of the haplotype relative risk. Genetic Epidemiology 6:127-130

[43] Ott J. 1999. Analysis of human genetic linkage. Baltimore: The Johns Hopkins University Press.

[44] Parikh V, Shugart YY, Doheny KF, Zhang J, Li L, Williams J, Hayden D, Craig B, Capo H, Chamblee D, Chen C, Collins M, Dankner S, Fiergang D, Guyton D, Hunter D, Hutcheon M, Keys M, Morrison N, Munoz M, Parks M, Plotsky D, Protzko E, Repka MX, Sarubbi M, Schnall B, Siatkowski RM, Traboulsi E, Waeltermann J, Nathans J. 2003. A strabismus susceptibility locus on chromosome 7p. Proceedings of the National Academy of Sciences of the United States of America 100:12283-12288

[45] Podgor MJ, Remaley NA, Chew E. 1996. Associations between siblings for esotropia and exotropia. Archives of Ophthalmology 114:739-744

[46] Purcell S, Chang C. 2015. PLINK 1.9. https://www.cog-genomics.org/plink2 (accessed on 2015) software

[47] Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC. 2007. PLINK: a tool set for whole-genome association and population-based linkage analyses. American Journal of Human Genetics 81:559-575

[48] Rice A, Nsengimana J, Simmons IG, Toomes C, Hoole J, Willoughby CE, Cassidy F, Williams GA, George ND, Sheridan E, Young TL, Hunter TI, Barrett BT, Elliott DB, Bishop DT, Inglehearn CF. 2009. Replication of the recessive STBMS1 locus but with dominant inheritance. Investigative Ophthalmology and Visual Science 50:3210-3217

[49] Sanfilippo PG, Hammond CJ, Staffieri SE, Kearns LS, Melissa Liew SH, Barbour JM, Hewitt AW, Ge D, Snieder H, Mackinnon JR, Brown SA, Lorenz B, Spector TD, Martin NG, Wilmer JB, Mackey DA. 2012. Heritability of strabismus: genetic influence is specific to eso-deviation and independent of refractive error. Twin Research and Human Genetics 15:624-630

[50] Shaaban S, Matsuo T, Fujiwara H, Itoshima E, Furuse T, Hasebe S, Zhang Q, Ott J, Ohtsuki H. 2009a. Chromosomes 4q28.3 and 7q31.2 as new susceptibility loci for comitant strabismus. Investigative Ophthalmology and Visual Science 50:654-661

[51] Shaaban S, Matsuo T, Strauch K, Ohtsuki H. 2009b. Investigation of parent-of-origin effect in comitant strabismus using MOD score analysis. Molecular Vision 15:1351-1358

[52] Spielman RS, Ewens WJ. 1996. The TDT and other family-based tests for linkage disequilibrium and association. American Journal of Human Genetics 59:983-989

[53] Taira Y, Matsuo T, Yamane T, Hasebe S, Ohtsuki H. 2003. Clinical features of comitant strabismus related to family history of strabismus or abnormalities in pregnancy and delivery. Japanese Journal of Ophthalmology 47:208-213

[54] Terwilliger J, Ott J. 1994. Handbook of human genetic linkage. Baltimore: Johns Hopkins University Press.

[55] Terwilliger DJ, Ott J. 2016. Handbook of human genetic linkage. (accessed on 2016) software

[56] Uchiyama E, Matsuo T, Imai S, Itoshima E. 2010. Paretic side/normal side ratios of cross-sectional areas of the superior oblique muscle vary largely in idiopathic superior oblique palsy. American Journal of Ophthalmology 149:508-512

[57] Yang J, Lee SH, Goddard ME, Visscher PM. 2011. GCTA: a tool for genome-wide complex trait analysis. American Journal of Human Genetics 88:76-82

[58] Yang Y, Wise CA, Gordon D, Finch SJ. 2008. A family-based likelihood ratio test for general pedigree structures that allows for genotyping error and missing data. Human Heredity 66:99-110

[59] Ye XC, Pegado V, Patel MS, Wasserman WW. 2014. Strabismus genetics across a spectrum of eye misalignment disorders. Clinical Genetics 86:103-111

Introduction

Materials and Methods

Subjects

SNP selection and typing

Hardy-Weinberg equilibrium and principal component analysis

Family-based association study: TDT and TDTae

Linkage analysis in large pedigrees

Results

Hardy-Weinberg equilibrium and principal component analysis

Figure 1: Flow chart of analytical process.

Figure 2: Scatter plot of eigenvectors in principal component analysis (PCA).

TDT and TDTae

Figure 3: TDT (transmission disequilibrium test) and TDTae (TDT allowing for errors) plots, and linkage disequilibrium (LD) blocks on the 4q28.3 locus.

Figure 4: TDT (transmission disequilibrium test) and TDTae (TDT allowing for errors) plots, and linkage disequilibrium (LD) blocks on the 7q31.2 locus.

Linkage analysis

Expression quantitative trait locus (eQTL)

Discussion

Figure 5: Summary of SNP −log10(P) values for each method of linkage analysis on chromosome 7q31.2.

Conclusions

Supplemental Information

Raw data