Recent work has shown that much of the missing heritability of complex traits can be resolved by estimates of heritability explained by all genotyped SNPs. However, it is currently unknown how much heritability is missing due to poor tagging or additional causal variants at known GWAS loci. Here, we use variance components to quantify the heritability explained by all SNPs at known GWAS loci in nine diseases from WTCCC1 and WTCCC2. After accounting for expectation, we observed all SNPs at known GWAS loci to explain 1.29 x more heritability than GWAS-associated SNPs on average (P=3.3 x 10⁻⁵). For some diseases, this increase was individually significant: 2.07 x for Multiple Sclerosis (MS) (P=6.5 x 10⁻⁹) and 1.48 x for Crohn's Disease (CD) (P = 1.3 x 10⁻³); all analyses of autoimmune diseases excluded the well-studied MHC region. Additionally, we found that GWAS loci from other related traits also explained significant heritability. The union of all autoimmune disease loci explained 7.15 x more MS heritability than known MS SNPs (P < 1.0 x 10⁻¹⁶ and 2.20 x more CD heritability than known CD SNPs (P = 6.1 x 10⁻⁹), with an analogous increase for all autoimmune diseases analyzed. We also observed significant increases in an analysis of > 20,000 Rheumatoid Arthritis (RA) samples typed on ImmunoChip, with 2.37 x more heritability from all SNPs at GWAS loci (P = 2.3 x 10⁻⁶) and 5.33 x more heritability from all autoimmune disease loci (P < 1 x 10⁻¹⁶ compared to known RA SNPs (including those identified in this cohort). Our methods adjust for LD between SNPs, which can bias standard estimates of heritability from SNPs even if all causal variants are typed. By comparing adjusted estimates, we hypothesize that the genome-wide distribution of causal variants is enriched for low-frequency alleles, but that causal variants at known GWAS loci are skewed towards common alleles. These findings have important ramifications for fine-mapping study design and our understanding of complex disease architecture.
The extent to which variants in the protein-coding sequence of genes contribute to risk of rheumatoid arthritis (RA) is unknown. In this study, we addressed this issue by deep exon sequencing and large-scale genotyping of 25 biological candidate genes located within RA risk loci discovered by genome-wide association studies (GWASs). First, we assessed the contribution of rare coding variants in the 25 genes to the risk of RA in a pooled sequencing study of 500 RA cases and 650 controls of European ancestry. We observed an accumulation of rare nonsynonymous variants exclusive to RA cases in IL2RA and IL2RB (burden test: p = 0.007 and p = 0.018, respectively). Next, we assessed the aggregate contribution of low-frequency and common coding variants to the risk of RA by dense genotyping of the 25 gene loci in 10,609 RA cases and 35,605 controls. We observed a strong enrichment of coding variants with a nominal signal of association with RA (p < 0.05) after adjusting for the best signal of association at the loci (p(enrichment) = 6.4 × 10(-4)). For one locus containing CD2, we found that a missense variant, rs699738 (c.798C>A [p.His266Gln]), and a noncoding variant, rs624988, reside on distinct haplotypes and independently contribute to the risk of RA (p = 4.6 × 10(-6)). Overall, our results indicate that variants (distributed across the allele-frequency spectrum) within the protein-coding portion of a subset of biological candidate genes identified by GWASs contribute to the risk of RA. Further, we have demonstrated that very large sample sizes will be required for comprehensively identifying the independent alleles contributing to the missing heritability of RA.
Defining and characterizing pathologies of the immune system requires precise and accurate quantification of abundances and functions of cellular subsets via cytometric studies. At this time, data analysis relies on manual gating, which is a major source of variability in large-scale studies. We devised an automated, user-guided method, X-Cyt, which specializes in rapidly and robustly identifying targeted populations of interest in large data sets. We first applied X-Cyt to quantify CD4(+) effector and central memory T cells in 236 samples, demonstrating high concordance with manual analysis (r = 0.91 and 0.95, respectively) and superior performance to other available methods. We then quantified the rare mucosal associated invariant T cell population in 35 samples, achieving manual concordance of 0.98. Finally we characterized the population dynamics of invariant natural killer T (iNKT) cells, a particularly rare peripheral lymphocyte, in 110 individuals by assaying 19 markers. We demonstrated that although iNKT cell numbers and marker expression are highly variable in the population, iNKT abundance correlates with sex and age, and the expression of phenotypic and functional markers correlates closely with CD4 expression.
If trait-associated variants alter regulatory regions, then they should fall within chromatin marks in relevant cell types. However, it is unclear which of the many marks are most useful in defining cell types associated with disease and fine mapping variants. We hypothesized that informative marks are phenotypically cell type specific; that is, SNPs associated with the same trait likely overlap marks in the same cell type. We examined 15 chromatin marks and found that those highlighting active gene regulation were phenotypically cell type specific. Trimethylation of histone H3 at lysine 4 (H3K4me3) was the most phenotypically cell type specific (P < 1 × 10(-6)), driven by colocalization of variants and marks rather than gene proximity (P < 0.001). H3K4me3 peaks overlapped with 37 SNPs for plasma low-density lipoprotein concentration in the liver (P < 7 × 10(-5)), 31 SNPs for rheumatoid arthritis within CD4(+) regulatory T cells (P = 1 × 10(-4)), 67 SNPs for type 2 diabetes in pancreatic islet cells (P = 0.003) and the liver (P = 0.003), and 14 SNPs for neuropsychiatric disease in neuronal tissues (P = 0.007). We show how cell type-specific H3K4me3 peaks can inform the fine mapping of associated SNPs to identify causal variation.
While studies to associate genomic variants to complex traits have gradually become increasingly productive, the molecular mechanisms that underlie these associations are rarely understood. Because only a small fraction of trait-associated variants can be linked to coding sequences, investigators have speculated that many of the underlying causal alleles influence non-coding gene regulatory sites. Recent studies have successfully identified examples of mechanisms for non-coding alleles at individual loci. Now, genome-wide chromatin assays have resulted in maps of dozens of genomic annotations of the non-coding genome across multiple different tissues, cell types and cell lines. This gives a tremendous opportunity to integrate these annotations with complex trait signals to globally interpret associated variants, and prioritize likely causal alleles. Here, we review the examples of mechanisms by which non-coding, common alleles result in phenotypes. We discuss the efforts to integrate common trait-associated variants with genomic annotations. Finally, we highlight some caveats of these approaches and outline future directions for improvement.
Susceptibility to primary biliary cirrhosis (PBC) is strongly associated with human leukocyte antigen (HLA)-region polymorphisms. To determine if associations can be explained by classical HLA determinants, we studied Italian, 676 cases and 1440 controls, genotyped with dense single-nucleotide polymorphisms (SNPs) for which classical HLA alleles and amino acids were imputed. Although previous genome-wide association studies and our results show stronger SNP associations near DQB1, we demonstrate that the HLA signals can be attributed to classical DRB1 and DPB1 genes. Strong support for the predominant role of DRB1 is provided by our conditional analyses. We also demonstrate an independent association of DPB1. Specific HLA-DRB1 genes (*08, *11 and *14) account for most of the DRB1 association signal. Consistent with previous studies, DRB1*08 (P=1.59 × 10(-11)) was the strongest predisposing allele, whereas DRB1*11 (P=1.42 × 10(-10)) was protective. Additionally, DRB1*14 and the DPB1 association (DPB1*03:01; P=9.18 × 10(-7)) were predisposing risk alleles. No signal was observed in the HLA class 1 or class 3 regions. These findings better define the association of PBC with HLA and specifically support the role of classical HLA-DRB1 and DPB1 genes and alleles in susceptibility to PBC.
Palmer ND, McDonough CW, Hicks PJ, Roh BH, Wing MR, An SS, Hester JM, Cooke JN, Bostrom MA, Rudock ME, Talbert ME, Lewis JP, Lewis JP, Lewis JP, Ferrara A, Lu L, Ziegler JT, Sale MM, Divers J, Shriner D, Adeyemo A, Rotimi CN, Ng MCY, Langefeld CD, Freedman BI, Bowden DW, Voight BF, Scott LJ, Steinthorsdottir V, Morris AP, Dina C, Welch RP, Zeggini E, Huth C, Aulchenko YS, Thorleifsson G, McCulloch LJ, Ferreira T, Grallert H, Amin N, Wu G, Willer CJ, Raychaudhuri S, McCarroll SA, Langenberg C, Hofmann OM, Dupuis J, Qi L, Segrè AV, van Hoek M, Navarro P, Ardlie K, Balkau B, Benediktsson R, Bennett AJ, Blagieva R, Boerwinkle E, Bonnycastle LL, Bengtsson Boström K, Bravenboer B, Bumpstead S, Burtt NP, Charpentier G, Chines PS, Cornelis M, Couper DJ, Crawford G, Doney ASF, Elliott KS, Elliott AL, Erdos MR, Fox CS, Franklin CS, Ganser M, Gieger C, Grarup N, Green T, Griffin S, Groves CJ, Guiducci C, Hadjadj S, Hassanali N, Herder C, Isomaa B, Jackson AU, Johnson PRV, Jørgensen T, Kao WHL, Klopp N, Kong A, Kraft P, Kuusisto J, Lauritzen T, Li M, Lieverse A, Lindgren CM, Lyssenko V, Marre M, Meitinger T, Midthjell K, Morken MA, Narisu N, Nilsson P, Owen KR, Payne F, Perry JRB, Petersen A-K, Platou C, Proença C, Prokopenko I, Rathmann W, Rayner WN, Robertson NR, Rocheleau G, Roden M, Sampson MJ, Saxena R, Shields BM, Shrader P, Sigurdsson G, Sparsø T, Strassburger K, Stringham HM, Sun Q, Swift AJ, Thorand B, Tichet J, Tuomi T, van Dam RM, van Haeften TW, van Herpt T, van Vliet-Ostaptchouk JV, Walters BG, Weedon MN, Wijmenga C, Witteman J, Bergman RN, Cauchi S, Collins FS, Gloyn AL, Gyllensten U, Hansen T, Hide WA, Hitman GA, Hofman A, Hunter DJ, Hveem K, Laakso M, Mohlke KL, Morris AD, Palmer CNA, Pramstaller PP, Rudan I, Sijbrands E, Stein LD, Tuomilehto J, Uitterlinden A, Walker M, Wareham NJ, Watanabe RM, Abecasis GR, Boehm BO, Campbell H, Daly MJ, Hattersley AT, Hu FB, Meigs JB, Pankow JS, Pedersen O, Wichmann H-E, Barroso I, Florez JC, Frayling TM, Groop L, Sladek R, Thorsteinsdottir U, Wilson JF, Illig T, Froguel P, van Duijn CM, Stefansson K, Altshuler D, Boehnke M, McCarthy MI, Soranzo N, Wheeler E, Glazer NL, Bouatia-Naji N, Mägi R, Randall J, Johnson T, Elliott P, Rybin D, Henneman P, Dehghan A, Hottenga JJ, Song K, Goel A, Egan JM, Lajunen T, Doney A, Kanoni S, Cavalcanti-Proença C, Kumari M, Timpson NJ, Zabena C, Ingelsson E, An P, O'Connell J, Luan J'an, Elliott A, McCarroll SA, Roccasecca RM, Pattou F, Sethupathy P, Ariyurek Y, Barter P, Beilby JP, Ben-Shlomo Y, Bergmann S, Bochud M, Bonnefond A, Borch-Johnsen K, Böttcher Y, Brunner E, Bumpstead SJ, Chen Y-DI, Chines P, Clarke R, Coin LJM, Cooper MN, Crisponi L, Day INM, de Geus EJC, Delplanque J, Fedson AC, Fischer-Rosinsky A, Forouhi NG, Frants R, Franzosi MG, Galan P, Goodarzi MO, Graessler J, Grundy S, Gwilliam R, Hallmans G, Hammond N, Han X, Hartikainen A-L, Hayward C, Heath SC, Hercberg S, Hicks AA, Hillman DR, Hingorani AD, Hui J, Hung J, Jula A, Kaakinen M, Kaprio J, Kesaniemi AY, Kivimaki M, Knight B, Koskinen S, Kovacs P, Kyvik KO, Lathrop MG, Lawlor DA, Le Bacquer O, Lecoeur C, Li Y, Mahley R, Mangino M, Manning AK, Martínez-Larrad MT, McAteer JB, McPherson R, Meisinger C, Melzer D, Meyre D, Mitchell BD, Mukherjee S, Naitza S, Neville MJ, Oostra BA, Orrù M, Pakyz R, Paolisso G, Pattaro C, Pearson D, Peden JF, Pedersen NL, Perola M, Pfeiffer AFH, Pichler I, Polasek O, Posthuma D, Potter SC, Pouta A, Province MA, Psaty BM, Rayner NW, Rice K, Ripatti S, Rivadeneira F, Rolandsson O, Sandbaek A, Sandhu M, Sanna S, Sayer AA, Scheet P, Seedorf U, Sharp SJ, Shields B, Sijbrands EJG, Silveira A, Simpson L, Singleton A, Smith NL, Sovio U, Swift A, Syddall H, Syvänen A-C, Tanaka T, Tönjes A, Uitterlinden AG, van Dijk KW, Varma D, Visvikis-Siest S, Vitart V, Vogelzangs N, Waeber G, Wagner PJ, Walley A, Ward KL, Watkins H, Wild SH, Willemsen G, Witteman JCM, Yarnell JWG, Zelenika D, Zethelius B, Zhai G, Zhao JH, Zillikens CM, Borecki IB, Loos RJF, Meneton P, Magnusson PKE, Nathan DM, Williams GH, Silander K, Salomaa V, Smith GD, Bornstein SR, Schwarz P, Spranger J, Karpe F, Shuldiner AR, Cooper C, Dedoussis GV, Serrano-Ríos M, Lind L, Palmer LJ, Franks PW, Ebrahim S, Marmot M, Kao LWH, Pramstaller PP, Wright AF, Stumvoll M, Hamsten A, Buchanan TA, Valle TT, Rotter JI, Siscovick DS, Penninx BWJH, Boomsma DI, Deloukas P, Spector TD, Ferrucci L, Cao A, Scuteri A, Schlessinger D, Uda M, Ruokonen A, Jarvelin M-R, Waterworth DM, Vollenweider P, Peltonen L, Mooser V, Sladek R. A genome-wide association search for type 2 diabetes genes in African Americans [Internet]. PLoS One 2012;7(1):e29202. Publisher's VersionAbstract
African Americans are disproportionately affected by type 2 diabetes (T2DM) yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wide Association Study (GWAS) using the Affymetrix 6.0 array in 965 African-American cases with T2DM and end-stage renal disease (T2DM-ESRD) and 1029 population-based controls. The most significant SNPs (n = 550 independent loci) were genotyped in a replication cohort and 122 SNPs (n = 98 independent loci) were further tested through genotyping three additional validation cohorts followed by meta-analysis in all five cohorts totaling 3,132 cases and 3,317 controls. Twelve SNPs had evidence of association in the GWAS (P<0.0071), were directionally consistent in the Replication cohort and were associated with T2DM in subjects without nephropathy (P<0.05). Meta-analysis in all cases and controls revealed a single SNP reaching genome-wide significance (P<2.5×10(-8)). SNP rs7560163 (P = 7.0×10(-9), OR (95% CI) = 0.75 (0.67-0.84)) is located intergenically between RND3 and RBM43. Four additional loci (rs7542900, rs4659485, rs2722769 and rs7107217) were associated with T2DM (P<0.05) and reached more nominal levels of significance (P<2.5×10(-5)) in the overall analysis and may represent novel loci that contribute to T2DM. We have identified novel T2DM-susceptibility variants in the African-American population. Notably, T2DM risk was associated with the major allele and implies an interesting genetic architecture in this population. These results suggest that multiple loci underlie T2DM susceptibility in the African-American population and that these loci are distinct from those identified in other ethnic populations.
PURPOSE: To investigate whether the 2 subtypes of advanced age-related macular degeneration (AMD), choroidal neovascularization (CNV), and geographic atrophy (GA) segregate separately in families and to identify which genetic variants are associated with these 2 subtypes. DESIGN: Sibling correlation study and genome-wide association study (GWAS). PARTICIPANTS: For the sibling correlation study, 209 sibling pairs with advanced AMD were included. For the GWAS, 2594 participants with advanced AMD subtypes and 4134 controls were included. Replication cohorts included 5383 advanced AMD participants and 15 240 controls. METHODS: Participants had the AMD grade assigned based on fundus photography, examination, or both. To determine heritability of advanced AMD subtypes, a sibling correlation study was performed. For the GWAS, genome-wide genotyping was conducted and 6 036 699 single nucleotide polymorphisms (SNPs) were imputed. Then, the SNPs were analyzed with a generalized linear model controlling for genotyping platform and genetic ancestry. The most significant associations were evaluated in independent cohorts. MAIN OUTCOME MEASURES: Concordance of advanced AMD subtypes in sibling pairs and associations between SNPs with GA and CNV advanced AMD subtypes. RESULTS: The difference between the observed and expected proportion of siblings concordant for the same subtype of advanced AMD was different to a statistically significant degree (P = 4.2 × 10(-5)), meaning that in siblings of probands with CNV or GA, the same advanced subtype is more likely to develop. In the analysis comparing participants with CNV to those with GA, a statistically significant association was observed at the ARMS2/HTRA1 locus (rs10490924; odds ratio [OR], 1.47; P = 4.3 × 10(-9)), which was confirmed in the replication samples (OR, 1.38; P = 7.4 × 10(-14) for combined discovery and replication analysis). CONCLUSIONS: Whether CNV versus GA develops in a patient with AMD is determined in part by genetic variation. In this large GWAS meta-analysis and replication analysis, the ARMS2/HTRA1 locus confers increased risk for both advanced AMD subtypes, but imparts greater risk for CNV than for GA. This locus explains a small proportion of the excess sibling correlation for advanced AMD subtype. Other loci were detected with suggestive associations that differ for advanced AMD subtypes and deserve follow-up in additional studies.
Jostins L, Ripke S, Weersma RK, Duerr RH, McGovern DP, Hui KY, Lee JC, Schumm PL, Sharma Y, Anderson CA, Essers J, Mitrovic M, Ning K, Cleynen I, Theatre E, Spain SL, Raychaudhuri S, Goyette P, Wei Z, Abraham C, Achkar J-P, Ahmad T, Amininejad L, Ananthakrishnan AN, Andersen V, Andrews JM, Baidoo L, Balschun T, Bampton PA, Bitton A, Boucher G, Brand S, Büning C, Cohain A, Cichon S, D'Amato M, De Jong D, Devaney KL, Dubinsky M, Edwards C, Ellinghaus D, Ferguson LR, Franchimont D, Fransen K, Gearry R, Georges M, Gieger C, Glas J, Haritunians T, Hart A, Hawkey C, Hedl M, Hu X, Karlsen TH, Kupcinskas L, Kugathasan S, Latiano A, Laukens D, Lawrance IC, Lees CW, Louis E, Mahy G, Mansfield J, Morgan AR, Mowat C, Newman W, Palmieri O, Ponsioen CY, Potocnik U, Prescott NJ, Regueiro M, Rotter JI, Russell RK, Sanderson JD, Sans M, Satsangi J, Schreiber S, Simms LA, Sventoraityte J, Targan SR, Taylor KD, Tremelling M, Verspaget HW, De Vos M, Wijmenga C, Wilson DC, Winkelmann J, Xavier RJ, Zeissig S, Zhang B, Zhang CK, Zhao H, Zhao H, Silverberg MS, Annese V, Hakonarson H, Brant SR, Radford-Smith G, Mathew CG, Rioux JD, Schadt EE, Daly MJ, Franke A, Parkes M, Vermeire S, Barrett JC, Cho JH. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease [Internet]. Nature 2012;491(7422):119-24. Publisher's VersionAbstract
Crohn's disease and ulcerative colitis, the two common forms of inflammatory bowel disease (IBD), affect over 2.5 million people of European ancestry, with rising prevalence in other populations. Genome-wide association studies and subsequent meta-analyses of these two diseases as separate phenotypes have implicated previously unsuspected mechanisms, such as autophagy, in their pathogenesis and showed that some IBD loci are shared with other inflammatory diseases. Here we expand on the knowledge of relevant pathways by undertaking a meta-analysis of Crohn's disease and ulcerative colitis genome-wide association scans, followed by extensive validation of significant findings, with a combined total of more than 75,000 cases and controls. We identify 71 new associations, for a total of 163 IBD loci, that meet genome-wide significance thresholds. Most loci contribute to both phenotypes, and both directional (consistently favouring one allele over the course of human history) and balancing (favouring the retention of both alleles within populations) selection effects are evident. Many IBD loci are also implicated in other immune-mediated disorders, most notably with ankylosing spondylitis and psoriasis. We also observe considerable overlap between susceptibility loci for IBD and mycobacterial infection. Gene co-expression network analysis emphasizes this relationship, with pathways shared between host responses to mycobacteria and those predisposing to IBD.
Dastani Z, Hivert M-F, Timpson N, Perry JRB, Yuan X, Scott RA, Henneman P, Heid IM, Kizer JR, Lyytikäinen L-P, Fuchsberger C, Tanaka T, Morris AP, Small K, Isaacs A, Beekman M, Coassin S, Lohman K, Qi L, Kanoni S, Pankow JS, Uh H-W, Wu Y, Bidulescu A, Rasmussen-Torvik LJ, Greenwood CMT, Ladouceur M, Grimsby J, Manning AK, Liu C-T, Kooner J, Mooser VE, Vollenweider P, Kapur KA, Chambers J, Wareham NJ, Langenberg C, Frants R, Willems-Vandijk K, Oostra BA, Willems SM, Lamina C, Winkler TW, Psaty BM, Tracy RP, Brody J, Chen I, Viikari J, Kähönen M, Pramstaller PP, Evans DM, St Pourcain B, Sattar N, Wood AR, Bandinelli S, Carlson OD, Egan JM, Böhringer S, van Heemst D, Kedenko L, Kristiansson K, Nuotio M-L, Loo B-M, Harris T, Garcia M, Kanaya A, Haun M, Klopp N, Wichmann H-E, Deloukas P, Katsareli E, Couper DJ, Duncan BB, Kloppenburg M, Adair LS, Borja JB, Borja JB, Borja JB, Borja JB, Borja JB, Wilson JG, Musani S, Guo X, Johnson T, Semple R, Teslovich TM, Allison MA, Redline S, Buxbaum SG, Mohlke KL, Meulenbelt I, Ballantyne CM, Dedoussis GV, Hu FB, Liu Y, Paulweber B, Spector TD, Slagboom EP, Ferrucci L, Jula A, Perola M, Raitakari O, Florez JC, Salomaa V, Eriksson JG, Frayling TM, Hicks AA, Lehtimäki T, Smith GD, Siscovick DS, Kronenberg F, van Duijn C, Loos RJF, Waterworth DM, Meigs JB, Dupuis J, Richards BJ, Voight BF, Scott LJ, Steinthorsdottir V, Dina C, Welch RP, Zeggini E, Huth C, Aulchenko YS, Thorleifsson G, McCulloch LJ, Ferreira T, Grallert H, Amin N, Wu G, Willer CJ, Raychaudhuri S, McCarroll SA, Hofmann OM, Segrè AV, van Hoek M, Navarro P, Ardlie K, Balkau B, Benediktsson R, Bennett AJ, Blagieva R, Boerwinkle E, Bonnycastle LL, Bengtsson Boström K, Bravenboer B, Bumpstead S, Burtt NP, Charpentier G, Chines PS, Cornelis M, Crawford G, Doney ASF, Elliott KS, Elliott AL, Erdos MR, Fox CS, Franklin CS, Ganser M, Gieger C, Grarup N, Green T, Griffin S, Groves CJ, Guiducci C, Hadjadj S, Hassanali N, Herder C, Isomaa B, Jackson AU, Johnson PRV, Jørgensen T, Kao WHL, Kong A, Kraft P, Kuusisto J, Lauritzen T, Li M, Lieverse A, Lindgren CM, Lyssenko V, Marre M, Meitinger T, Midthjell K, Morken MA, Narisu N, Nilsson P, Owen KR, Payne F, Petersen A-K, Platou C, Proença C, Prokopenko I, Rathmann W, Rayner WN, Robertson NR, Rocheleau G, Roden M, Sampson MJ, Saxena R, Shields BM, Shrader P, Sigurdsson G, Sparsø T, Strassburger K, Stringham HM, Sun Q, Swift AJ, Thorand B, Tichet J, Tuomi T, van Dam RM, van Haeften TW, van Herpt T, van Vliet-Ostaptchouk JV, Walters BG, Weedon MN, Wijmenga C, Witteman J, Bergman RN, Cauchi S, Collins FS, Gloyn AL, Gyllensten U, Hansen T, Hide WA, Hitman GA, Hofman A, Hunter DJ, Hveem K, Laakso M, Morris AD, Palmer CNA, Rudan I, Sijbrands E, Stein LD, Tuomilehto J, Uitterlinden A, Walker M, Watanabe RM, Abecasis GR, Boehm BO, Campbell H, Daly MJ, Hattersley AT, Pedersen O, Barroso I, Groop L, Sladek R, Thorsteinsdottir U, Wilson JF, Illig T, Froguel P, van Duijn CM, Stefansson K, Altshuler D, Boehnke M, McCarthy MI, Soranzo N, Wheeler E, Glazer NL, Bouatia-Naji N, Mägi R, Randall J, Elliott P, Rybin D, Dehghan A, Hottenga JJ, Song K, Goel A, Lajunen T, Doney A, Cavalcanti-Proença C, Kumari M, Timpson NJ, Zabena C, Ingelsson E, An P, O'Connell J, Luan J'an, Elliott A, McCarroll SA, Roccasecca RM, Pattou F, Sethupathy P, Ariyurek Y, Barter P, Beilby JP, Ben-Shlomo Y, Bergmann S, Bochud M, Bonnefond A, Borch-Johnsen K, Böttcher Y, Brunner E, Bumpstead SJ, Chen Y-DI, Chines P, Clarke R, Coin LJM, Cooper MN, Crisponi L, Day INM, de Geus EJC, Delplanque J, Fedson AC, Fischer-Rosinsky A, Forouhi NG, Franzosi MG, Galan P, Goodarzi MO, Graessler J, Grundy S, Gwilliam R, Hallmans G, Hammond N, Han X, Hartikainen A-L, Hayward C, Heath SC, Hercberg S, Hillman DR, Hingorani AD, Hui J, Hung J, Kaakinen M, Kaprio J, Kesaniemi AY, Kivimaki M, Knight B, Koskinen S, Kovacs P, Kyvik KO, Lathrop MG, Lawlor DA, Le Bacquer O, Lecoeur C, Li Y, Mahley R, Mangino M, Martínez-Larrad MT, McAteer JB, McPherson R, Meisinger C, Melzer D, Meyre D, Mitchell BD, Mukherjee S, Naitza S, Neville MJ, Orrù M, Pakyz R, Paolisso G, Pattaro C, Pearson D, Peden JF, Pedersen NL, Pfeiffer AFH, Pichler I, Polasek O, Posthuma D, Potter SC, Pouta A, Province MA, Rayner NW, Rice K, Ripatti S, Rivadeneira F, Rolandsson O, Sandbaek A, Sandhu M, Sanna S, Sayer AA, Scheet P, Seedorf U, Sharp SJ, Shields B, Sigurðsson G, Sijbrands EJG, Silveira A, Simpson L, Singleton A, Smith NL, Sovio U, Swift A, Syddall H, Syvänen A-C, Tönjes A, Uitterlinden AG, van Dijk KW, Varma D, Visvikis-Siest S, Vitart V, Vogelzangs N, Waeber G, Wagner PJ, Walley A, Ward KL, Watkins H, Wild SH, Willemsen G, Witteman JCM, Yarnell JWG, Zelenika D, Zethelius B, Zhai G, Zhao JH, Zillikens CM, Zillikens CM, Zillikens CM, Consortium GPB, Borecki IB, Meneton P, Magnusson PKE, Nathan DM, Williams GH, Silander K, Bornstein SR, Schwarz P, Spranger J, Karpe F, Shuldiner AR, Cooper C, Serrano-Ríos M, Lind L, Palmer LJ, Hu FB, Franks PW, Ebrahim S, Marmot M, Kao LWH, Pramstaller PP, Wright AF, Stumvoll M, Hamsten A, Hamsten A, Buchanan TA, Valle TT, Rotter JI, Penninx BWJH, Boomsma DI, Cao A, Scuteri A, Schlessinger D, Uda M, Ruokonen A, Jarvelin M-R, Peltonen L, Mooser V, Sladek R, Sladek R, Sladek R, Musunuru K, Smith AV, Edmondson AC, Stylianou IM, Koseki M, Pirruccello JP, Chasman DI, Johansen CT, Fouchier SW, Peloso GM, Barbalic M, Ricketts SL, Bis JC, Feitosa MF, Orho-Melander M, Melander O, Li X, Li M, Cho YS, Go MJ, Kim YJ, Lee J-Y, Park T, Kim K, Sim X, Ong RT-H, Croteau-Chonka DC, Lange LA, Smith JD, Ziegler A, Zhang W, Zee RYL, Whitfield JB, Thompson JR, Surakka I, Spector TD, Smit JH, Sinisalo J, Scott J, Saharinen J, Sabatti C, Rose LM, Roberts R, Rieder M, Parker AN, Pare G, O'Donnell CJ, Nieminen MS, Nickerson DA, Montgomery GW, McArdle W, Masson D, Martin NG, Marroni F, Lucas G, Luben R, Lokki M-L, Lettre G, Launer LJ, Lakatta EG, Laaksonen R, Kyvik KO, König IR, Khaw K-T, Kaplan LM, Johansson Å, Janssens CAJW, Igl W, Hovingh KG, Hengstenberg C, Havulinna AS, Hastie ND, Harris TB, Haritunians T, Hall AS, Groop LC, Gonzalez E, Freimer NB, Erdmann J, Ejebe KG, Döring A, Dominiczak AF, Demissie S, Deloukas P, de Faire U, Crawford G, Chen Y-DI, Caulfield MJ, Boekholdt MS, Assimes TL, Quertermous T, Seielstad M, Wong TY, Tai E-S, Feranil AB, Kuzawa CW, Taylor HA, Gabriel SB, Holm H, Gudnason V, Krauss RM, Ordovas JM, Munroe PB, Kooner JS, Tall AR, Hegele RA, Kastelein JJP, Schadt EE, Strachan DP, Reilly MP, Samani NJ, Schunkert H, Cupples AL, Sandhu MS, Ridker PM, Rader DJ, Kathiresan S. Novel loci for adiponectin levels and their influence on type 2 diabetes and metabolic traits: a multi-ethnic meta-analysis of 45,891 individuals [Internet]. PLoS Genet 2012;8(3):e1002607. Publisher's VersionAbstract
Circulating levels of adiponectin, a hormone produced predominantly by adipocytes, are highly heritable and are inversely associated with type 2 diabetes mellitus (T2D) and other metabolic traits. We conducted a meta-analysis of genome-wide association studies in 39,883 individuals of European ancestry to identify genes associated with metabolic disease. We identified 8 novel loci associated with adiponectin levels and confirmed 2 previously reported loci (P = 4.5×10(-8)-1.2×10(-43)). Using a novel method to combine data across ethnicities (N = 4,232 African Americans, N = 1,776 Asians, and N = 29,347 Europeans), we identified two additional novel loci. Expression analyses of 436 human adipocyte samples revealed that mRNA levels of 18 genes at candidate regions were associated with adiponectin concentrations after accounting for multiple testing (p<3×10(-4)). We next developed a multi-SNP genotypic risk score to test the association of adiponectin decreasing risk alleles on metabolic traits and diseases using consortia-level meta-analytic data. This risk score was associated with increased risk of T2D (p = 4.3×10(-3), n = 22,044), increased triglycerides (p = 2.6×10(-14), n = 93,440), increased waist-to-hip ratio (p = 1.8×10(-5), n = 77,167), increased glucose two hours post oral glucose tolerance testing (p = 4.4×10(-3), n = 15,234), increased fasting insulin (p = 0.015, n = 48,238), but with lower in HDL-cholesterol concentrations (p = 4.5×10(-13), n = 96,748) and decreased BMI (p = 1.4×10(-4), n = 121,335). These findings identify novel genetic determinants of adiponectin levels, which, taken together, influence risk of T2D and markers of insulin resistance.
Rheumatoid arthritis (RA) is partly heritable; genetic and serological markers are known to confer risk of developing pathology. But given clinical heterogeneity in RA, can we predict who will develop severe disease? Substantial heritability of erosive progression rates has now been identified, but better prognostic biomarkers remain wanting.
The genetic association of the major histocompatibility complex (MHC) to rheumatoid arthritis risk has commonly been attributed to alleles in HLA-DRB1. However, debate persists about the identity of the causal variants in HLA-DRB1 and the presence of independent effects elsewhere in the MHC. Using existing genome-wide SNP data in 5,018 individuals with seropositive rheumatoid arthritis (cases) and 14,974 unaffected controls, we imputed and tested classical alleles and amino acid polymorphisms in HLA-A, HLA-B, HLA-C, HLA-DPA1, HLA-DPB1, HLA-DQA1, HLA-DQB1 and HLA-DRB1, as well as 3,117 SNPs across the MHC. Conditional and haplotype analyses identified that three amino acid positions (11, 71 and 74) in HLA-DRβ1 and single-amino-acid polymorphisms in HLA-B (at position 9) and HLA-DPβ1 (at position 9), which are all located in peptide-binding grooves, almost completely explain the MHC association to rheumatoid arthritis risk. This study shows how imputation of functional variation from large reference panels can help fine map association signals in the MHC.
We have previously shown that rheumatoid arthritis (RA) risk alleles overlap between different ethnic groups. Here, we utilize a multiethnic approach to show that we can effectively discover RA risk alleles. Thirteen putatively associated SNPs that had not yet exceeded genome-wide significance (p < 5 × 10(-8)) in our previous RA genome-wide association study (GWAS) were analyzed in independent sample sets consisting of 4,366 cases and 17,765 controls of European, African American, and East Asian ancestry. Additionally, we conducted an overall association test across all 65,833 samples (a GWAS meta-analysis plus the replication samples). Of the 13 SNPs investigated, four were significantly below the study-wide Bonferroni corrected p value threshold (p < 0.0038) in the replication samples. Two SNPs (rs3890745 at the 1p36 locus [p = 2.3 × 10(-12)] and rs2872507 at the 17q12 locus [p = 1.7 × 10(-9)]) surpassed genome-wide significance in all 16,659 RA cases and 49,174 controls combined. We used available GWAS data to fine map these two loci in Europeans and East Asians, and we found that the same allele conferred risk in both ethnic groups. A series of bioinformatic analyses identified TNFRSF14-MMEL1 at the 1p36 locus and IKZF3-ORMDL3-GSDMB at the 17q12 locus as the genes most likely associated with RA. These findings demonstrate empirically that a multiethnic approach is an effective strategy for discovering RA risk loci, and they suggest that combining GWASs across ethnic groups represents an efficient strategy for gaining statistical power.
The genetic architectures of common, complex diseases are largely uncharacterized. We modeled the genetic architecture underlying genome-wide association study (GWAS) data for rheumatoid arthritis and developed a new method using polygenic risk-score analyses to infer the total liability-scale variance explained by associated GWAS SNPs. Using this method, we estimated that, together, thousands of SNPs from rheumatoid arthritis GWAS explain an additional 20% of disease risk (excluding known associated loci). We further tested this method on datasets for three additional diseases and obtained comparable estimates for celiac disease (43% excluding the major histocompatibility complex), myocardial infarction and coronary artery disease (48%) and type 2 diabetes (49%). Our results are consistent with simulated genetic models in which hundreds of associated loci harbor common causal variants and a smaller number of loci harbor multiple rare causal variants. These analyses suggest that GWAS will continue to be highly productive for the discovery of additional susceptibility loci for common diseases.
Yu Y, Bhangale TR, Fagerness J, Ripke S, Thorleifsson G, Tan PL, Souied EH, Richardson AJ, Merriam JE, Buitendijk GHS, Reynolds R, Raychaudhuri S, Chin KA, Sobrin L, Evangelou E, Lee PH, Lee AY, Leveziel N, Zack DJ, Campochiaro B, Campochiaro P, Smith TR, Barile GR, Guymer RH, Hogg R, Chakravarthy U, Robman LD, Gustafsson O, Sigurdsson H, Ortmann W, Behrens TW, Stefansson K, Uitterlinden AG, van Duijn CM, Vingerling JR, Klaver CCW, Allikmets R, Brantley MA, Baird PN, Katsanis N, Thorsteinsdottir U, Ioannidis JPA, Daly MJ, Graham RR, Seddon JM. Common variants near FRK/COL10A1 and VEGFA are associated with advanced age-related macular degeneration [Internet]. Hum Mol Genet 2011;20(18):3699-709. Publisher's VersionAbstract
Despite significant progress in the identification of genetic loci for age-related macular degeneration (AMD), not all of the heritability has been explained. To identify variants which contribute to the remaining genetic susceptibility, we performed the largest meta-analysis of genome-wide association studies to date for advanced AMD. We imputed 6 036 699 single-nucleotide polymorphisms with the 1000 Genomes Project reference genotypes on 2594 cases and 4134 controls with follow-up replication of top signals in 5640 cases and 52 174 controls. We identified two new common susceptibility alleles, rs1999930 on 6q21-q22.3 near FRK/COL10A1 [odds ratio (OR) 0.87; P = 1.1 × 10(-8)] and rs4711751 on 6p12 near VEGFA (OR 1.15; P = 8.7 × 10(-9)). In addition to the two novel loci, 10 previously reported loci in ARMS2/HTRA1 (rs10490924), CFH (rs1061170, and rs1410996), CFB (rs641153), C3 (rs2230199), C2 (rs9332739), CFI (rs10033900), LIPC (rs10468017), TIMP3 (rs9621532) and CETP (rs3764261) were confirmed with genome-wide significant signals in this large study. Loci in the recently reported genes ABCA1 and COL8A1 were also detected with suggestive evidence of association with advanced AMD. The novel variants identified in this study suggest that angiogenesis (VEGFA) and extracellular collagen matrix (FRK/COL10A1) pathways contribute to the development of advanced AMD.
Epidemiology and candidate gene studies indicate a shared genetic basis for celiac disease (CD) and rheumatoid arthritis (RA), but the extent of this sharing has not been systematically explored. Previous studies demonstrate that 6 of the established non-HLA CD and RA risk loci (out of 26 loci for each disease) are shared between both diseases. We hypothesized that there are additional shared risk alleles and that combining genome-wide association study (GWAS) data from each disease would increase power to identify these shared risk alleles. We performed a meta-analysis of two published GWAS on CD (4,533 cases and 10,750 controls) and RA (5,539 cases and 17,231 controls). After genotyping the top associated SNPs in 2,169 CD cases and 2,255 controls, and 2,845 RA cases and 4,944 controls, 8 additional SNPs demonstrated P<5 × 10(-8) in a combined analysis of all 50,266 samples, including four SNPs that have not been previously confirmed in either disease: rs10892279 near the DDX6 gene (P(combined) = 1.2 × 10(-12)), rs864537 near CD247 (P(combined) = 2.2 × 10(-11)), rs2298428 near UBE2L3 (P(combined) = 2.5 × 10(-10)), and rs11203203 near UBASH3A (P(combined) = 1.1 × 10(-8)). We also confirmed that 4 gene loci previously established in either CD or RA are associated with the other autoimmune disease at combined P<5 × 10(-8) (SH2B3, 8q24, STAT4, and TRAF1-C5). From the 14 shared gene loci, 7 SNPs showed a genome-wide significant effect on expression of one or more transcripts in the linkage disequilibrium (LD) block around the SNP. These associations implicate antigen presentation and T-cell activation as a shared mechanism of disease pathogenesis and underscore the utility of cross-disease meta-analysis for identification of genetic risk factors with pleiotropic effects between two clinically distinct diseases.
Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein-protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in line with observations in Mendelian disease.
UNLABELLED: Primary sclerosing cholangitis (PSC) is a chronic cholestatic liver disease characterized by inflammation and fibrosis of the bile ducts. Both environmental and genetic factors contribute to its pathogenesis. To further clarify its genetic background, we investigated susceptibility loci recently identified for ulcerative colitis (UC) in a large cohort of 1,186 PSC patients and 1,748 controls. Single nucleotide polymorphisms (SNPs) tagging 13 UC susceptibility loci were initially genotyped in 854 PSC patients and 1,491 controls from Benelux (331 cases, 735 controls), Germany (265 cases, 368 controls), and Scandinavia (258 cases, 388 controls). Subsequently, a joint analysis was performed with an independent second Scandinavian cohort (332 cases, 257 controls). SNPs at chromosomes 2p16 (P-value 4.12 × 10(-4) ), 4q27 (P-value 4.10 × 10(-5) ), and 9q34 (P-value 8.41 × 10(-4) ) were associated with PSC in the joint analysis after correcting for multiple testing. In PSC patients without inflammatory bowel disease (IBD), SNPs at 4q27 and 9q34 were nominally associated (P < 0.05). We applied additional in silico analyses to identify likely candidate genes at PSC susceptibility loci. To identify nonrandom, evidence-based links we used GRAIL (Gene Relationships Across Implicated Loci) analysis showing interconnectivity between genes in six out of in total nine PSC-associated regions. Expression quantitative trait analysis from 1,469 Dutch and UK individuals demonstrated that five out of nine SNPs had an effect on cis-gene expression. These analyses prioritized IL2, CARD9, and REL as novel candidates. CONCLUSION: We have identified three UC susceptibility loci to be associated with PSC, harboring the putative candidate genes REL, IL2, and CARD9. These results add to the scarce knowledge on the genetic background of PSC and imply an important role for both innate and adaptive immunological factors.
A common allele at the TAGAP gene locus demonstrates a suggestive, but not conclusive association with risk of rheumatoid arthritis (RA). To fine map the locus, we conducted comprehensive imputation of CEU HapMap single-nucleotide polymorphisms (SNPs) in a genome-wide association study (GWAS) of 5,500 RA cases and 22,621 controls (all of European ancestry). After controlling for population stratification with principal components analysis, the strongest signal of association was to an imputed SNP, rs212389 (P=3.9 × 10(-8), odds ratio=0.87). This SNP remained highly significant upon conditioning on the previous RA risk variant (rs394581, P=2.2 × 10(-5)) or on a SNP previously associated with celiac disease and type I diabetes (rs1738074, P=1.7 × 10(-4)). Our study has refined the TAGAP signal of association to a single haplotype in RA, and in doing so provides conclusive statistical evidence that the TAGAP locus is associated with RA risk. Our study also underscores the utility of comprehensive imputation in large GWAS data sets to fine map disease risk alleles.
Discovering and following up on genetic associations with complex phenotypes require large patient cohorts. This is particularly true for patient cohorts of diverse ancestry and clinically relevant subsets of disease. The ability to mine the electronic health records (EHRs) of patients followed as part of routine clinical care provides a potential opportunity to efficiently identify affected cases and unaffected controls for appropriate-sized genetic studies. Here, we demonstrate proof-of-concept that it is possible to use EHR data linked with biospecimens to establish a multi-ethnic case-control cohort for genetic research of a complex disease, rheumatoid arthritis (RA). In 1,515 EHR-derived RA cases and 1,480 controls matched for both genetic ancestry and disease-specific autoantibodies (anti-citrullinated protein antibodies [ACPA]), we demonstrate that the odds ratios and aggregate genetic risk score (GRS) of known RA risk alleles measured in individuals of European ancestry within our EHR cohort are nearly identical to those derived from a genome-wide association study (GWAS) of 5,539 autoantibody-positive RA cases and 20,169 controls. We extend this approach to other ethnic groups and identify a large overlap in the GRS among individuals of European, African, East Asian, and Hispanic ancestry. We also demonstrate that the distribution of a GRS based on 28 non-HLA risk alleles in ACPA+ cases partially overlaps with ACPA- subgroup of RA cases. Our study demonstrates that the genetic basis of rheumatoid arthritis risk is similar among cases of diverse ancestry divided into subsets based on ACPA status and emphasizes the utility of linking EHR clinical data with biospecimens for genetic studies.