Genome-wide association studies, which typically report regression coefficients summarizing the associations of many genetic variants with various traits, are potentially a powerful source of data for Mendelian randomization investigations. We demonstrate how such coefficients from multiple variants can be combined in a Mendelian randomization analysis to estimate the causal effect of a risk factor on an outcome. The bias and efficiency of estimates based on summarized data are compared to those based on individual-level data in simulation studies. We investigate the impact of gene–gene interactions, linkage disequilibrium, and ‘weak instruments’ on these estimates. Both an inverse-variance weighted average of variant-specific associations and a likelihood-based approach for summarized data give similar estimates and precision to the two-stage least squares method for individual-level data, even when there are gene–gene interactions. However, these summarized data methods overstate precision when variants are in linkage disequilibrium. If the P-value in a linear regression of the risk factor for each variant is less than , then weak instrument bias will be small. We use these methods to estimate the causal association of low-density lipoprotein cholesterol (LDL-C) on coronary artery disease using published data on five genetic variants. A 30% reduction in LDL-C is estimated to reduce coronary artery disease risk by 67% (95% CI: 54% to 76%). We conclude that Mendelian randomization investigations using summarized data from uncorrelated variants are similarly efficient to those using individual-level data, although the necessary assumptions cannot be so fully assessed.
SummaryMany common variants have been associated with hematological traits, but identification of causal genes and pathways has proven challenging. We performed a genome-wide association analysis in the UK Biobank and INTERVAL studies, testing 29.5 million genetic variants for association with 36 red cell, white cell, and platelet properties in 173,480 European-ancestry participants. This effort yielded hundreds of low frequency (<5%) and rare (<1%) variants with a strong impact on blood cell phenotypes. Our data highlight general properties of the allelic architecture of complex traits, including the proportion of the heritable component of each blood trait explained by the polygenic signal across different genome regulatory domains. Finally, through Mendelian randomization, we provide evidence of shared genetic pathways linking blood cell indices with complex pathologies, including autoimmune diseases, schizophrenia, and coronary heart disease and evidence suggesting previously reported population associations between blood cell indices and cardiovascular disease may be non-causal.
The genetic architecture of common traits, including the number, frequency, and effect sizes of inherited variants that contribute to individual risk, has been long debated. Genome-wide association studies have identified scores of common variants associated with type 2 diabetes, but in aggregate, these explain only a fraction of heritability. To test the hypothesis that lower-frequency variants explain much of the remainder, the GoT2D and T2D-GENES consortia performed whole genome sequencing in 2,657 Europeans with and without diabetes, and exome sequencing in a total of 12,940 subjects from five ancestral groups. To increase statistical power, we expanded sample size via genotyping and imputation in a further 111,548 subjects. Variants associated with type 2 diabetes after sequencing were overwhelmingly common and most fell within regions previously identified by genome-wide association studies. Comprehensive enumeration of sequence variation is necessary to identify functional alleles that provide important clues to disease pathophysiology, but large-scale sequencing does not support a major role for lower-frequency variants in predisposition to type 2 diabetes.
British Heart Foundation, Medical Research Council, Cambridge National Institute for Health Research Biomedical Research Centre, and Gates Cambridge.
Summary: PhenoScanner is a curated database of publicly available results from large-scale genetic association studies. This tool aims to facilitate ‘phenome scans’, the cross-referencing of genetic variants with many phenotypes, to help aid understanding of disease pathways and biology. The database currently contains over 350 million association results and over 10 million unique genetic variants, mostly single nucleotide polymorphisms. It is accompanied by a web-based tool that queries the database for associations with user-specified variants, providing results according to the same effect and non-effect alleles for each input variant. The tool provides the option of searching for trait associations with proxies of the input variants, calculated using the European samples from 1000 Genomes and Hapmap. Availability and Implementation: PhenoScanner is available at www.phenoscanner.medschl.cam.ac.uk. Contact: jrs95@medschl.cam.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
Objective To assess the association between leucocyte telomere length and risk of cardiovascular disease.Design Systematic review and meta-analysis.
The Emerging Risk Factors Collaboration IMPORTANCE The prevalence of cardiometabolic multimorbidity is increasing. OBJECTIVE To estimate reductions in life expectancy associated with cardiometabolic multimorbidity. DESIGN, SETTING, AND PARTICIPANTS Age-and sex-adjusted mortality rates and hazard ratios (HRs) were calculated using individual participant data from the Emerging Risk Factors Collaboration (689 300 participants; 91 cohorts; years of baseline surveys: 1960-2007; latest mortality follow-up: April 2013; 128 843 deaths). The HRs from the Emerging Risk Factors Collaboration were compared with those from the UK Biobank (499 808 participants; years of baseline surveys: 2006-2010; latest mortality follow-up: November 2013; 7995 deaths).Cumulative survival was estimated by applying calculated age-specific HRs for mortality to contemporary US age-specific death rates.EXPOSURES A history of 2 or more of the following: diabetes mellitus, stroke, myocardial infarction (MI). MAIN OUTCOMES AND MEASURESAll-cause mortality and estimated reductions in life expectancy. RESULTSIn participants in the Emerging Risk Factors Collaboration without a history of diabetes, stroke, or MI at baseline (reference group), the all-cause mortality rate adjusted to the age of 60 years was 6.8 per 1000 person-years. Mortality rates per 1000 person-years were 15.6 in participants with a history of diabetes, 16.1 in those with stroke, 16.8 in those with MI, 32.0 in those with both diabetes and MI, 32.5 in those with both diabetes and stroke, 32.8 in those with both stroke and MI, and 59.5 in those with diabetes, stroke, and MI. Compared with the reference group, the HRs for all-cause mortality were 1.9 (95% CI, 1.8-2.0) in participants with a history of diabetes, 2.1 (95% CI, 2.0-2.2) in those with stroke, 2.0 (95% CI, 1.9-2.2) in those with MI, 3.7 (95% CI, 3.3-4.1) in those with both diabetes and MI, 3.8 (95% CI, 3.5-4.2) in those with both diabetes and stroke, 3.5 (95% CI, 3.1-4.0) in those with both stroke and MI, and 6.9 (95% CI, 5.7-8.3) in those with diabetes, stroke, and MI. The HRs from the Emerging Risk Factors Collaboration were similar to those from the more recently recruited UK Biobank. The HRs were little changed after further adjustment for markers of established intermediate pathways (eg, levels of lipids and blood pressure) and lifestyle factors (eg, smoking, diet). At the age of 60 years, a history of any 2 of these conditions was associated with 12 years of reduced life expectancy and a history of all 3 of these conditions was associated with 15 years of reduced life expectancy. CONCLUSIONS AND RELEVANCEMortality associated with a history of diabetes, stroke, or MI was similar for each condition. Because any combination of these conditions was associated with multiplicative mortality risk, life expectancy was substantially lower in people with multimorbidity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.