Haplotype-sharing analysis using Mantel statistics for combined genetic effects

Beckmann, Lars; Fischer, Christine; Obreiter, Markus; Rabes, Michael; Chang-Claude, Jenny

doi:10.1186/1471-2156-6-S1-S70

Volume 6 Supplement 1

Genetic Analysis Workshop 14: Microsatellite and single-nucleotide polymorphism

Proceedings
Open access
Published: 30 December 2005

Haplotype-sharing analysis using Mantel statistics for combined genetic effects

Lars Beckmann¹,
Christine Fischer²,
Markus Obreiter¹,
Michael Rabes¹ &
…
Jenny Chang-Claude¹

BMC Genetics volume 6, Article number: S70 (2005) Cite this article

3327 Accesses
12 Citations
Metrics details

Abstract

We applied a new approach based on Mantel statistics to analyze the Genetic Analysis Workshop 14 simulated data with prior knowledge of the answers. The method was developed in order to improve the power of a haplotype sharing analysis for gene mapping in complex disease. The new statistic correlates genetic similarity and phenotypic similarity across pairs of haplotypes from case-control studies. The genetic similarity is measured as the shared length between haplotype pairs around a genetic marker. The phenotypic similarity is measured as the mean corrected cross-product based on the respective phenotypes. Cases with phenotype P1 and unrelated controls were drawn from the population of Danacaa. Power to detect main effects was compared to the X²-test for association based on 3-marker haplotypes and a global permutation test for haplotype association to test for main effects. Power to detect gene × gene interaction was compared to unconditional logistic regression. The results suggest that the Mantel statistics might be more powerful than alternative tests.

Background

Recently we proposed a flexible approach to gene mapping of complex diseases, whereby we combine Mantel statistics for space-time clustering with genetic information obtained from haplotypes [1]. It has been shown that haplotype sharing methods are well suited for mapping such genes [2–5]. Mantel statistics were introduced in 1967 to correlate temporal and spatial distributions of cancer, notably childhood leukemia, in a generalized regression approach [6]. The Mantel statistic M is the sum of the cross product of the spatial similarity X_ijmultiplied by the temporal similarity Y_ijacross all pairs of cases i and j:

The idea behind this approach is that in the presence of space-time clustering the values of spatial similarity X_ijcorrespond to the values of temporal similarity Y_ijfor correlated cases i and j.

Methods

Mantel statistics using haplotypes

Here we apply the general approach of Mantel's statistics for space-time clustering (Equation 1) to correlate genetic and phenotypic similarity, and to test for gene × gene interaction. The first statistic has the form:

where x denotes a genetic marker, and i and j are haplotypes. L_ij(x) denotes the genetic similarity between the haplotypes i and j at x, and is defined as the number of intervals surrounding x that are flanked by markers with the same alleles, i.e., that are identical by state (IBS). The phenotypic similarity for two haplotype copies i and j derived from individuals s_iand s_jis defined as the mean corrected product Y_sisj= (y_si- μ)(y_si- μ), where y_siand y_sjare the phenotypes of s_iand s_j, and μ denotes the expectation of the phenotype. Here, we chose μ as the sample mean, i.e., μ = 0.5. Concordant pairs of affected and concordant pairs of unaffected individuals have the weights Y_sisj= 0.25, while discordant pairs have the weights Y_sisj= -0.25. Alternative measures of phenotypic similarity were discussed in the framework of sib-pair analysis, e.g., the Haseman-Elston method [7] and the weighted pair-wise correlation statistics [8], as well as in family-based association analysis [9]. The summation is over all pairwise comparisons of haplotypes for i ≠ j, where the haplotypes are derived from case-control studies.

The second statistic is constructed to test for the combined effect of two loci:

The information of the first locus x is incorporated as the shared length L_ij(x). At the second locus only genotype information is used. The variable z_siis coded in a dominant way, i.e., z_siis 1, if the individual s_icarries at least one mutant allele, and 0 otherwise. The measure of genotypic similarity Z_sisjis then 1, if z_si= z_sj, and 0 otherwise.

The summands of the Mantel statistic are highly correlated, and any statistical procedure to test for significance has to take into account the interrelationship of the data. Here, we use a Monte Carlo permutation approach to test for significance, as proposed by Mantel [6]. For M₀(x) the phenotype y_siis permuted over the individuals. The definition of Z is such that M₁(x) is the sum over all comparisons of haplotypes from individuals who have the same genotype coding z at the second locus. To derive the null hypothesis of no statistical interaction, the phenotype y_siand the genotype coding z_siat the second locus for individual s_iare permuted jointly over the individuals, and thus the comparisons of haplotypes derived from discordant individuals are incorporated under the null hypothesis.

Statistical tests for comparison

Main effects

We used two alternative tests for power comparison.

1. We applied the X²-test for association to 3-marker haplotypes. The region of interest was covered by overlapping sliding windows. The haplotypes consisted of 3 consecutive genetic markers. The test was based on a 2xk X²-table, with k- 1 degrees of freedom, where k denotes the number of haplotypes that occurred in either the case or the control sample. A p-value was assigned to the marker in the center of the window. Note that no tests were performed for the marginal markers.

2. The haplotype assignment software PHASE [10, 11] performs a global permutation test for significant differences in haplotype frequencies in case and control groups. PHASE tests the null hypothesis that the case and control haplotypes are a random sample from a single set of haplotype frequencies, versus the alternative that cases are more similar to other cases than to controls. Here, this test was based on 100 permutations due to computational burden.

Gene × gene interaction

We compared the test statistic M₁(x) using haplotypes to unconditional logistic regression based on the genotypes at 2 genetic markers [12]. The respective genotypes were coded for both the recessive and the dominant model.

Datasets and genetic data

The case-control study samples for two different samples sizes were drawn from the population Danacaa to limit the analysis to individuals defined by phenotype P1.

In this dataset, two major genes, D1 and D2, interacted in an epistatic model. Mode of inheritance is dominant for both D1 and D2.

Table 1 shows the samples that were used to test for main effects. Major gene D1 is located on chromosome 1. We chose flanking single-nucleotide polymorphisms (SNPs) of the disease locus between C01R0045 and C01R0055 from the initial set of markers (sample A), and additional SNPs and microsatellites from packages 28 and 29 (samples B to D). For major gene D2, which is located at the very end of chromosome 3, we analyzed 6 flanking SNPs C03R0276–0281 (samples E and F). To test for gene × gene interaction, information from both disease loci D1 and D2 were used to define the measures L and Z for M₁(x). For samples A-D, the markers in Table 1 were used to define the variable L at gene D1, and the SNPs C03R0276–C03R0281 at gene D2 to define the variable Z. For samples E and F, the markers in Table 1 were used to define the variable L at gene D2, and the SNPs C01R0050–C01R0053 at gene D1 to define the variable Z.

Table 1 Study samples used in the analysis

Full size table

Software

Haplotype pairs assigned to the unrelated individuals were estimated by the use of the PHASE program [10, 11]. PHASE lists the most likely pairs of haplotypes for each individual, together with their posterior probability. The most likely (best) estimate of haplotype pairs was chosen for our analysis. SAS 8.02 (SAS Institute Inc., Cary, NC, USA) was used to test for normality and for logistic regression. All other calculations were performed with software developed within our group. Software for the proposed Mantel statistics is available upon request.

Results

Main effects

Table 2 shows the results for the analysis of main effects of genetic markers close to D1 and D2. For D1, the Mantel statistic M₀(x) yielded point-wise significant results at the marker position C01R052 (p = 0.042), which is the marker closest to D1 for the small sample B. For the large sample D, which included additional SNPs, M₀(x) yielded the most significant result at SNP C01R0045 (p = 0.014).

Table 2 Results of the Mantel statistic (x) and the haplotype-based X²_hap – test for main effects

Full size table

M₀(x) did not yield significant results for the markers flanking D2 with small sample size. The most significant SNP in the large sample was C03R0280 (p = 0.002). The X²_hap-test for association, however, did not produce significant results with either the small or the large samples. The permutation test yielded one globally significant p-value of 0.03 in the large sample D.

Gene × gene interaction

Table 3 shows the results for M₁(x). The genetic similarity was defined by the same marker sets as in Table 1. M₁(x) yielded significant results for all samples except sample A. The most significant results were at the closest markers for D2 (samples E and F), but not for D1. Logistic regression did not reveal significant results for interaction between SNPs surrounding gene D1 and SNPs flanking D2 for the different samples (results not shown).

Table 3 Results of the Mantel statistic M₁(x) to test for gene × gene interaction

Full size table

Conclusion

We successfully employed a new approach to map disease predisposing genes in case-control studies based on Mantel statistics that correlate genetic and phenotypic similarity. Two types of gene effects involved in complex diseases were considered: main effects and joint effects.

1. The Mantel statistic M₀(x) identified the major gene D2 on chromosome 3 given adequate sample size, whereas the alternative methods failed. Major gene D1 on chromosome 1 was simulated without linkage disequilibrium (LD). LD is necessary for haplotype association methods, therefore M₀(x)-as expected-did not map D1 correctly.

We acknowledge that the comparison against the X² association test for 3 marker haplotypes is somewhat unfair, but we know of no other standard association test examining longer haplotypes that is not confronted with problems of huge degrees of freedom and sparse data. Additionally, other more sophisticated haplotype-based methods cannot yet be regarded as standard.

2. The Mantel statistic M₁(x) accounted for the joint effects of 2 putative disease loci. Taking the combined effects into account, the results were significant for the major genes D1 and D2 and showed lower p-values than the results obtained when considering main effects only.

These results show that main effects might not be detectable if gene × gene interaction is present and not considered in the analysis. Our proposed method M₁(x) revealed significant statistical interaction between the genes analyzed in contrast to the results obtained in the logistic regression model.

The proposed Mantel statistics employ haplotypes from case-control data and might not be robust to population stratification. In our analysis, we used samples drawn from the Danacaa population and affection status defined by phenotype P1 to reduce heterogeneity in the data. Population stratification is therefore not a major concern in this analysis. We did not adjust the p-values for multiple comparisons in this candidate analysis.

Multiple testing is a serious problem especially if all possible gene × gene interactions increase the multiplicity. We solved the problem in the mean time by implementing a step-down algorithm to take into account multiple testing [13, 14].

Comprehensive power comparisons are currently being carried out to reveal under which conditions our approach is more powerful than alternative methods.

Abbreviations

GAW14:: Genetic Analysis Workshop 14
IBS:: Identical by state
SNP:: Single-nucleotide polymorphism

References

Beckmann L, Thomas D, Fischer C, Chang-Claude J: Haplotype sharing analysis using Mantel statistics. Hum Hered. 2005, 59: 67-78. 10.1159/000085221.
Article CAS PubMed Google Scholar
Beckmann L, Fischer C, Deck KG, Nolte IM, te Meerman G, Chang-Claude J: Exploring haplotype sharing methods in general and isolated populations to detect gene(s) of a complex genetic trait. Genet Epidemiol. 2001, 21 (Suppl 1): S554-S559.
PubMed Google Scholar
Fischer C, Beckmann L, Majoram P, te Meerman G, Chang-Claude J: Haplotype sharing analysis with SNPs in candidate genes: the Genetic Analysis Workshop 12 example. Genet Epidemiol. 2003, 24: 68-73. 10.1002/gepi.10207.
Article PubMed Google Scholar
Qian D, Thomas DC: Genome scan of complex traits by haplotype sharing correlation. Genet Epidemiol. 2001, 21 (Suppl 1): S582-S587.
PubMed Google Scholar
Boon M, Nolte IM, Bruinenberg M, Spijker GT, Terpstra P, Raelson J, De Keyser J, Zwanikken CP, Hulsbeek M, Hofstra RM, Buys CH, te Meerman GJ: Mapping of a susceptibility gene for multiple sclerosis to the 51 kb interval between G511525 and D6S1666 using a new method of haplotype sharing analysis. Neurogenetics. 2001, 3: 221-230.
CAS PubMed Google Scholar
Mantel N: The detection of disease clustering and a generalized regression approach. Cancer Res. 1967, 27: 209-220.
CAS PubMed Google Scholar
Wang K, Huang J: A score-statistic approach for the mapping of quantitative-trait loci with sibships of arbitrary size. Am J Hum Genet. 2002, 70: 412-424. 10.1086/338659.
Article PubMed Central CAS PubMed Google Scholar
Commenges D, Beurton-Aimar M: Multipoint linkage analysis using the weighted-pairwise correlation statistic. Genet Epidemiol. 1999, 17 (Suppl 1): S515-S519.
Article PubMed Google Scholar
Lunetta KL, Faraone SV, Biederman J, Laird NM: Family-based tests of association and linkage that use unaffected sibs, covariates, and interactions. Am J Hum Genet. 2000, 66: 605-614. 10.1086/302782.
Article PubMed Central CAS PubMed Google Scholar
Stephens M, Smith NJ, Donnelly P: A new statistical method for haplotype reconstruction from population data. Am J Hum Genet. 2001, 68: 978-989. 10.1086/319501.
Article PubMed Central CAS PubMed Google Scholar
Stephens M, Donnelly P: A comparison of bayesian methods for haplotype reconstruction from population genotype data. Am J Hum Genet. 2003, 73: 1162-1169. 10.1086/379378.
Article PubMed Central CAS PubMed Google Scholar
Gauderman WJ: Sample size requirements for association studies of gene-gene interaction. Am J Epidemiol. 2002, 155: 478-484. 10.1093/aje/155.5.478.
Article PubMed Google Scholar
Ge YC, Dudoit S, Speed TP: Resampling-based multiple testing for microarray data analysis. Test. 2003, 12: 1-77. 10.1007/BF02595811.
Article Google Scholar
Obreiter M, Fischer C, Chang-Claude J, Beckman L: SDMinP: a program to control the family wise error rate using step-down minP adjusted P-values. Bioinformatics. 2005, 21: 3183-3184. 10.1093/bioinformatics/bti480.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by a Deutsche Forschungsgemeinschaft grant (CH117/3-1) (LB, MR, MO). We thank Kati Smit for technical assistance.

Author information

Authors and Affiliations

German Cancer Research Center DKFZ, Heidelberg, Germany
Lars Beckmann, Markus Obreiter, Michael Rabes & Jenny Chang-Claude
Institute of Human Genetics, University of Heidelberg, Heidelberg, Germany
Christine Fischer

Authors

Lars Beckmann
View author publications
You can also search for this author in PubMed Google Scholar
Christine Fischer
View author publications
You can also search for this author in PubMed Google Scholar
Markus Obreiter
View author publications
You can also search for this author in PubMed Google Scholar
Michael Rabes
View author publications
You can also search for this author in PubMed Google Scholar
Jenny Chang-Claude
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lars Beckmann.

Additional information

Authors' contributions

LB participated in planning, interpreting data, carrying out the statistical analysis, and drafting the manuscript. CF participated in planning, interpreting data, writing the manuscript. MO and MR participated in computation and statistical analysis. JC-C participated in planning, interpreting data, and writing the manuscript. All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Beckmann, L., Fischer, C., Obreiter, M. et al. Haplotype-sharing analysis using Mantel statistics for combined genetic effects. BMC Genet 6 (Suppl 1), S70 (2005). https://doi.org/10.1186/1471-2156-6-S1-S70

Download citation

Published: 30 December 2005
DOI: https://doi.org/10.1186/1471-2156-6-S1-S70

Genetic Analysis Workshop 14: Microsatellite and single-nucleotide polymorphism

Haplotype-sharing analysis using Mantel statistics for combined genetic effects

Abstract

Background

Methods

Mantel statistics using haplotypes

Statistical tests for comparison

Main effects

Gene × gene interaction

Datasets and genetic data

Software

Results

Main effects

Gene × gene interaction

Conclusion

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Authors' contributions

Rights and permissions

About this article

Cite this article

Keywords

BMC Genomic Data

Contact us

Genetic Analysis Workshop 14: Microsatellite and single-nucleotide polymorphism

Haplotype-sharing analysis using Mantel statistics for combined genetic effects

Abstract

Background

Methods

Mantel statistics using haplotypes

Statistical tests for comparison

Main effects

Gene × gene interaction

Datasets and genetic data

Software

Results

Main effects

Gene × gene interaction

Conclusion

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Authors' contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Genomic Data

Contact us