Association Analysis of 6,736 U.K. Subjects Provides Replication and C(期刊论文)

Association Analysis of 6,736 U.K. Subjects Provides Replication and C

http://www.100md.com 《糖尿病学杂志》

     1 Diabetes Research Laboratories, Oxford Centre for Diabetes, Endocrinology and Metabolism, Churchill Hospital, Oxford, U.K

    2 Wellcome Trust Centre for Human Genetics, Headington, Oxford, U.K

    3 Department of Diabetes Research and Vascular Medicine, Peninsula Medical School, Exeter, U.K

    4 Department of Diabetes and Metabolic Medicine, Barts and the London, Queen Mary School of Medicine and Dentistry, University of London, London, U.K

    5 Department of Medicine, School of Medicine, Newcastle University, Newcastle upon Tyne, U.K

    SNP, single nucleotide polymorphism

    ABSTRACT

    Recent data suggest that common variation in the transcription factor 7-like 2 (TCF7L2) gene is associated with type 2 diabetes. Evaluation of such associations in independent samples provides necessary replication and a robust assessment of effect size. Using four TCF7L2 single nucleotide polymorphisms (SNPs; including the two most associated in the previous study), we conducted a case-control study in 2,158 type 2 diabetic subjects and 2,574 control subjects and a family-based association analysis in 388 parent-offspring trios all from the U.K. All SNPs showed powerful associations with diabetes in the case-control analysis, with strongest effects at rs7903146 (allele-wise relative risk 1.36 [95% CI 1.24–1.48], P = 1.3 x 10–11). Data were consistent with a multiplicative model. The family-based analyses provided independent evidence for association at all loci (e.g., rs4506565, 62% transmission, P = 7 x 10–5) with no parent-of-origin effects. The frequency of diabetes-associated TCF7L2 genotypes was greater in cases ascertained for positive family history and early onset (rs4606565, P = 0.02); the population-attributable risk, estimated from the least-selected cases, is 16%. The overall evidence for association for these variants (P = 4.4 x 10–14 combining case-control and family-based analyses for rs4506565) exceeds genome-wide significance criteria and clearly establishes TCF7L2 as a type 2 diabetes susceptibility gene of substantial importance.

    By common consent, the history of genetic association studies in type 2 diabetes has been a checkered one (1), and the number of variants so far shown to be reproducibly associated with type 2 diabetes is limited (2,3). The variants with the most convincing claims, such as the P12A variant in PPARG and the E23K variant in KCNJ11 (4,5), lie in genes whose products were already established as players in pathways implicated in glucose and lipid homeostasis. The modest effect sizes associated with these variants restrict their value for the assignment of risk in the clinical setting. The objective of genome-wide strategies to diabetes susceptibility variant identification, whether based around linkage or association, is to characterize novel etiological pathways and to pinpoint variants with more substantial impacts on diabetes risk.

    Recently, Grant et al. (6) reported that variation within the transcription factor 7-like 2 (TCF7L2) gene was strongly associated with type 2 diabetes in Icelandic subjects. These associations had been detected in the course of positional cloning efforts in a region of chromosome 10q previously linked to diabetes in Icelandic pedigrees (7) and were confirmed in case-control analyses of Danish and U.S. cohorts (6). Despite combined significance values for the associated variants well beyond those required for attesting genome-wide significance, the variants studied do not appear to account for the chromosome 10q linkage signal. Neither is it obvious that the single nucleotide polymorphisms (SNPs) examined are themselves etiological; in the Icelandic sample, for example, the strongest associations were observed at a nearby microsatellite (6).

    Notwithstanding these compelling findings, past concerns over the performance of association studies in complex traits mandate independent replication (1,8). Given the large numbers of errors and biases that can compromise any individual study, replication ensures that the original findings are robust and provides a more accurate estimate of the likely effect size (9,10).

    In the present study, we have typed four SNPs in TCF7L2 in 6,736 subjects from well-characterized case, control, and family sample sets from the U.K. (Table 1). The samples typed for the main case-control analysis (all previously described) comprised 1) 572 probands, all ascertained for positive family history, from the Diabetes U.K. Warren 2 sibpair collection (Warren 2 sibpair probands [W2SPs]) (11); 2) 1,586 further type 2 diabetic cases from the Medical Research Council/Diabetes U.K. Case resource, ascertained for type 2 diabetes diagnosed before age 65 (Warren 2 cases [W2Cs]) (12); 3) 550 U.K. control subjects (HRC+), 480 from the Human Random Control resource, with an additional 70 control samples from the same source (European Collection of Cell Cultures, Salisbury, U.K.); and 4) 2,024 U.K. control subjects from the British Birth Cohort of 1958 (58BC). All cases were ascertained using similar criteria for a diagnosis of diabetes (based on usage of oral agents and/or insulin and/or biochemical evidence of hyperglycemia), with subtypes other than type 2 diabetes excluded using clinical, genetic, and/or immunological criteria (all are GAD antibody negative). Glucose tolerance status is not known for any of the control subjects. All subjects in the case-control analysis are of known British/Irish European origin.

    For family-based association analyses, we typed 1,170 members of 390 complete parent-offspring trios (the Warren 2 trio families), each ascertained for a European proband with type 2 diabetes (13). Due to Mendelian errors in two of these, data are reported on 388 trios. Of the 388 Warren 2 trio probands (W2TPs), 350 have a three-generation history of exclusively British/Irish origin (13,14). For analysis of the effects of TCF7L2 variation on the evidence for linkage to diabetes on chromosome 10q, we obtained genotypes from all 1,406 members of the 573 Warren 2 sibpair families (11). For further details concerning all subjects, see the online appendix (available at http://diabetes.diabetesjournals.org).

    We genotyped the two variants displaying the strongest SNP association signal in the Icelandic study (rs12255372 and rs7903146), plus two other SNPs (rs4506565 and rs12243326) selected using Phase II HapMap data as the best-correlated proxies available. Genotyping was performed at KBiosciences (Hoddesdon, U.K.) using a fluorescence-based competitive allele-specific assay (KASPar) (details available from the authors). Call rates for all SNPs exceeded 95% overall (with no SNP in any sample below 91.7%). Genotype data performed well against stringent quality-control criteria, including a discrepancy rate on duplicate genotyping of 2/2,416 (0.04% error), two instances of Mendelian inconsistency in 963 families, and no evidence of departure from Hardy-Weinberg equilibrium (all P > 0.05) in control subjects.

    Genotype frequency distributions for the four variants in case and control groups studied are shown in Table 2. In our samples, mutual r2 values exceeded 0.70 for all pairwise combinations (see online appendix Table 1). As expected, SNPs rs4506565 and rs7903146 formed one pair of highly correlated variants, as did rs12255372 and rs12243326. To maximize the power of the main type 2 diabetic case-control comparison, case (W2SPs and W2Cs, but not W2TPs, to ensure independence of the case-control and family-based analyses) and control (those from the British Birth Cohort of 1958 and the Human Random Control resources) subjects were pooled after first confirming homogeneity of genotype frequencies between subgroups on an SNP-by-SNP basis using standard contingency table methods (2).

    Allele and genotype frequency comparisons were conducted using standard contingency table analyses in Stata version 8 (Stata, College Station, TX) and StatXact 6 (Cytel, Cambridge, MA). All SNPs were significantly associated with type 2 diabetes. SNP rs7903146 showed the strongest single-point associations in the case-control analysis, with an allele-wise relative risk (RR) of 1.36 (95% CI 1.24–1.48, P = 1.3 x 10–11) for allele T. Heterozygous and homozygous carriers for allele T have genotype RRs of 1.35 (1.19–1.53, P = 3.1 x 10–6) and 1.90 (1.54–2.33, P = 3.6 x 10–10), respectively, relative to AA homozygotes (Table 3). These data are consistent with a multiplicative (or, equally, an additive) genetic model, in keeping with the original observations by Grant et al. (6). The population-attributable risk, as estimated using the least-selected cases (W2C) alone, was 16%.

    The associations described above do not include the W2TP samples (trio probands). Family-based association analysis within the full set of 388 trios (using TDTphase [15]) provided strong independent evidence for association in the form of substantial overtransmission of the high-risk alleles for all four variants (Table 4). In this analysis, the most extreme signal (62% transmission of the susceptibility allele, P = 7.7 x 10–5) was seen at rs4506565 rather than rs7903146. There was no evidence of parent-of-origin effects at any variant. In both the case-control and family-based studies, haplotype-based analyses did not generate additional evidence for association with disease (data not shown). Using Fisher’s method to combine significance values from the case-control and family-based association analyses (16), the combined P value for rs4506565 is 4.4 x 10–14.

    When we considered all three case groups (but restricting the W2TP group to the 350 with British/Irish origin), case-control effects sizes were seen to be appreciably greater for those cases ascertained for familiality and/or early onset (i.e., the W2SPs plus the British/Irish W2TPs [n = 922]) than for the less-selected (W2C) cases (n = 1,586). For example, at rs4506565, the allele-wise RR for the familial/early-onset cases was 1.50 (95% CI 1.34–1.68, P = 2.4 x 10–12) compared with 1.28 (1.17–1.41, P = 2.8 x 10–7) for the W2Cs. There were modest differences in genotype frequencies between the two groups of case subjects (P = 0.02). However, we found no significant relationship between TCF7L2 variants and age at diagnosis in any of the individual case samples (P > 0.05).

    The genome-wide linkage scan undertaken in the Warren 2 sibpair families had identified chromosome 10q as one of the strongest linkage signals in the U.K. (11). Though TCF7L2 maps 25 Mb (29 cM) from the peak of this linkage signal and 19 Mb (19 cM) outside the 1–logarithm of odds support interval, we used the Genotype-IBD Sharing Test (17) to establish whether the typed variants in TCF7L2 were in any way contributing to the linkage signal. Using rs7903146 genotypes exclusively from siblings with type 2 diabetes, we generated family-based weighted variables under dominant, recessive, and additive models. We found no evidence that TCF7L2 variants contributed to the previously described linkage, whether we tested at the marker nearest TCF7L2 itself (D10S597) or at the peak of linkage (D10S1765) (all P > 0.6).

    Given increasing evidence of overlap between the genes implicated in type 2 diabetes susceptibility and those causative for related Mendelian subtypes of diabetes (18), we sought to establish whether TCF7L2 mutations were involved in the pathogenesis of permanent diabetes arising during infancy (19). Accordingly, we studied 48 subjects (27 male) from an international cohort with permanent diabetes, all diagnosed before 6 months of age (20), and in whom mutations in the major neonatal diabetes genes (KCNJ11, IPF-1, and GCK) had been excluded (19). All 14 exons of TCF7L2 were amplified using M13-tailed primers (details available from the authors) and sequenced using an ABI 3730 DNA sequencer according to the manufacturer’s instructions (Applied Biosystems, Foster City, CA). Four novel, rare, heterozygous variants were identified. Numbered with respect to adenine thymine guanine translation-initiation site these were c.A879G p.Pro294Pro (exon 8; in a single Asian subject), c.C1429A p.Thr488Pro (exon 14; in three subjects of Vietnamese, Chilean, and Swedish origin), c.A1579C p.M526M (exon 14; one Syrian subject), and c.T1735C p.Ser579Pro (exon 14; one Brazilian subject). In each case, the rare allele was also observed in an unaffected parent, effectively excluding an etiological role.

    Several conclusions can be drawn from these data. First, the independent replication provided by this study confirms TCF7L2 as a genuine type 2 diabetes susceptibility gene, with the magnitude of the association exceeding even the strictest criteria for genome-wide significance. Second, the strong association seen within the family-based association analysis discounts the, admittedly, remote possibility that the original findings had arisen as a result of latent population substructure. Third, though estimates of effect size obtained in the present study, using the least-selected cases, are somewhat lower than those observed in the original study, the "winner’s curse" effect at this gene is less pronounced than that seen for other complex trait susceptibility genes (21). The high population-attributable risk and the almost twofold difference in RR between homozygote groups clearly establish TCF7L2 as the most important player yet identified in susceptibility to multifactorial type 2 diabetes. However, it is worth noting that all studies to date have been conducted in populations of Northern European origin and that unbiased estimates of population effect size will require analysis of more representative cohorts. Fourth, demonstration that the inherited component of individual type 2 diabetes susceptibility extends to an influence by common susceptibility variants with substantial effect sizes (here, a control minor allele frequency of 30% and allele-wise RR of 1.3) augurs well for ongoing efforts to map additional etiological variants through genome-wide association methods and encourages the belief that such endeavors will generate information of clinical value (22). Fifth, the observation that the frequency of the diabetes-associated TCF7L2 genotypes is greater among cases ascertained for familiality and early onset (boosting the allele-wise RR to almost 1.5), reaffirms the value of such case-enrichment strategies in disease-gene identification (23). Sixth, the lack of evidence that TCF7L2 variation contributes to linkage signals in our data is not surprising, given the considerable distance between the gene and our nearest linkage peak. As with the Pro12Ala variant at PPARG (4), the large population risk associated with TCF7L2 variants does not translate into a detectable linkage signal; the sibling RR attributable to the typed TCF7L2 variants is only 1.02 and the increase in the mean allele-sharing statistic among affected sibpairs limited to a modest 50.4% (24). Seventh, despite indications that the known functions of TCF7L2 point toward loss of -cell function (rather than insulin action) as the anticipated consequence of gene disruption, we find no evidence that mutations within the coding regions of this gene are responsible for the -cell phenotype of permanent neonatal or infancy-onset diabetes (19).

    The present study does not contribute to efforts to specify which TCF7L2 variants are etiological; it is likely that the SNPs typed here are not causal (6). Until such time as these are identified, measures of effect size derived from partially correlated variants may substantially underestimate the total contribution of TCF7L2 variation to type 2 diabetes susceptibility. Nevertheless, our data clearly establish TCF7L2 as a gene of particular importance to the development of type 2 diabetes. Unraveling of the mechanisms whereby changes in the function or regulation of this transcription factor lead to loss of -cell performance and/or insulin sensitivity is likely to provide crucial new insights into disease pathogenesis.

    ACKNOWLEDGMENTS

    We thank Diabetes U.K. for funding this research and David Altshuler, Leif Groop, and colleagues who kindly shared data from their ongoing association analyses of TCF7L2. We acknowledge use of DNA from the British 1958 Birth Cohort collection, funded by the U.K. Medical Research Council Grant G0000934 and Wellcome Trust Grant 068545/Z/02, and the support of Diabetes U.K. and the Medical Research Council for collection of the various case resources. A.T.H. is a Wellcome Trust Research Leave Fellow and M.N.W. a Vandervell Foundation Research Fellow.

    FOOTNOTES

    C.J.G. and E.Z. contributed equally to this work.

    Additional information for this article can be found in an online appendix at http://diabetes.diabetesjournals.org.

    The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

    REFERENCES

    Hattersley AT, McCarthy MI: A question of standards: what makes a good genetic association study Lancet 366:1315–1323, 2005

    O’Rahilly S, Barroso I, Wareham NJ: Genetic factors in type 2 diabetes: the end of the beginning Science 307:370–373, 2005

    Parikh H, Groop L: Candidate genes for type 2 diabetes (Review). Rev Endocr Metab Disord 5:151–176, 2004

    Altshuler D, Hirschhorn JN, Klannemark M, Lindgren CM, Vohl MC, Nemesh J, Lane CR, Schaffner SF, Bolk S, Brewer C, Tuomi T, Gaudet D, Hudson TJ, Daly M, Groop L, Lander ES: The common PPARgamma Pro12Ala polymorphism is associated with decreased risk of type 2 diabetes. Nat Genet 26:76–80, 2000

    Gloyn AL, Weedon MN, Owen KR, Turner MJ, Knight BA, Hitman GA, Walker M, Levy JC, Sampson MJ, Halford S, McCarthy MI, Hattersley AT, Frayling TM: Large-scale association studies of variants in genes encoding the pancreatic -cell KATP channel subunits Kir6.2 KCNJ11 and SUR1 (ABCC8) confirm that the (KCNJ11) E23K variant is associated with type 2 diabetes. Diabetes 52:568–572, 2003

    Grant SF, Thorleifsson G, Reynisdottir I, Benediktsson R, Manolescu A, Sainz J, Helgason A, Stefansson H, Emilsson V, Helgadottir A, Styrkarsdottir U, Magnusson KP, Walters GB, Palsdottir E, Jonsdottir T, Gudmundsdottir T, Gylfason A, Saemundsdottir J, Wilensky RL, Reilly MP, Rader DJ, Bagger Y, Christiansen C, Gudnason V, Sigurdsson G, Thorsteinsdottir U, Gulcher JR, Kong A, Stefansson K: Variant of transcription factor 7-like 2 (TCF7L2) gene confers risk of type 2 diabetes. Nat Genet 38:320–323, 2006

    Reynisdottir I, Thorleifsson G, Benediktsson R, Sigurdsson G, Emilsson V, Einarsdottir AS, Hjorleifdottir EE, Orlygsdottir GT, Bjornsdottir GT, Saemundsdottir J, Halldorsson S, Hrafnkelsdottir S, Sigurjonsdottir SB, Steinsdottir S, Martin M, Kochan JP, Rhees BK, Grant SFA, Frigge ML, Kong A, Gudnason V, Stefansson K, Gulcher JR: Localization of a susceptibility gene for type 2 diabetes to chromosome 5q34–q35.2. Am J Hum Genet 73:323–335, 2003

    Patterson M, Cardon LR: Replication publication (Editorial). PLoS Biol 3:e327, 2005

    Page GP, George V, Page PZ, Allison DB: "Are we there yet" Deciding when one has demonstrated specific genetic causation in complex diseases and quantitative traits (Review). Am J Hum Genet 73:711–719, 2003

    Hirschhorn JN, Lohmueller K, Byrne E, Hirschhorn K: A comprehensive review of genetic association studies. Genet Med 4:45–61, 2002

    Wiltshire S, Hattersley AT, Hitman GA, Walker M, Levy JC, Sampson M, O’Rahilly S, Frayling TM, Bell JI, Lathrop GM, Bennett A, Dhillon R, Fletcher C, Groves CJ, Jones E, Prestwich P, Simecek N, Rao PV, Wishart M, Foxon R, Bottazzo GF, Howell S, Smedley D, Cardon LR, Menzel S, McCarthy MI: A genomewide scan for loci predisposing to type 2 diabetes in a U.K. population (the Diabetes UK Warren 2 Repository): analysis of 573 pedigrees provides independent replication of a susceptibility locus on chromosome 1q. Am J Hum Genet 69:553–569, 2001

    Weedon MN, Owen KR, Shields B, Hitman G, Walker M, McCarthy MI, Love-Gregory LD, Permutt MA, Hattersley AT, Frayling TM: Common variants of the hepatocyte nuclear factor-4 P2 promoter are associated with type 2 diabetes in the U.K. population. Diabetes 53:3002–3006, 2004

    Frayling T, Walker M, McCarthy M, Evans J, Allen L, Lynn S, Ayres S, Millauer B, Turner C, Turner R, Sampson M, Hitman G, Ellard S, Hattersley A: Parent-offspring trios: a resource to facilitate the identification of type 2 diabetes genes. Diabetes 48:2475–2479, 1999

    Minton JA, Hattersley AT, Owen K, McCarthy MI, Walker M, Latif F, Barrett T, Frayling TM: Association studies of genetic variation in the WFS1 gene and type 2 diabetes in U.K. populations. Diabetes 51:1287–1290, 2002

    Dudbridge F: Pedigree disequilibrium tests for multilocus haplotypes. Genet Epidemiol 25:115–121, 2003

    Fisher R: Statistical Methods and Scientific Interference. 3rd ed. New York, Macmillan, 1973

    Li C, Scott LJ, Boehnke M: Assessing whether an allele can account in part for a linkage signal: the Genotype-IBD Sharing Test (GIST). Am J Hum Genet 74:418–431, 2004

    McCarthy MI: Progress in defining the molecular basis of type 2 diabetes mellitus through susceptibility-gene identification (Review). Hum Mol Genet 13:R33–R41, 2004

    Gloyn AL, Pearson ER, Antcliff JF, Proks P, Bruining GJ, Slingerland AS, Howard N, Srinivasan S, Silva JM, Molnes J, Edghill EL, Frayling TM, Temple IK, Mackay D, Shield JP, Sumnik Z, van Rhijn A, Wales JK, Clark P, Gorman S, Aisenberg J, Ellard S, Njolstad PR, Ashcroft FM, Hattersley AT: Activating mutations in the gene encoding the ATP-sensitive potassium-channel subunit Kir6.2 and permanent neonatal diabetes. N Engl J Med 350:1838–1849, 2004

    Edghill EL, Dix RJ, Flanagan SE, Bingley PJ, Hattersley AT, Ellard S, Gillespie KM: HLA genotyping supports a nonautoimmune aetiology in patients diagnosed with diabetes under the age of 6 months. Diabetes 55:1895–1898, 2006

    Lohmueller KE, Pearce CL, Pike M, Lander ES, Hirschhorn JN: Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease. Nat Genet 33:177–182, 2003

    Yang Q, Khoury MJ, Friedman JM, Little J, Flanders WD: How many genes underlie the occurrence of common complex diseases in the population Int J Epidemiol 34:1129–1137, 2005

    Risch N, Teng J: The relative power of family-based and case-control designs for linkage disequilibrium studies of complex human diseases. I. DNA pooling. Genome Res 8:1273–1288, 1998

    Risch N, Merikangas K: The future of genetic studies of complex human diseases. Science 273:1516–1517, 1996(Christopher J. Groves, Eleftheria Zeggin)

http://www.100md.com/html/DirDu/2007/04/04/41/07/83.htm