Human gut microbiome composition is shaped by multiple factors but the relative contribution of host genetics remains elusive. Here we examine genotype and microbiome data from 1,046 healthy individuals with several distinct ancestral origins who share a relatively common environment, and demonstrate that the gut microbiome is not significantly associated with genetic ancestry, and that host genetics have a minor role in determining microbiome composition. We show that, by contrast, there are significant similarities in the compositions of the microbiomes of genetically unrelated individuals who share a household, and that over 20% of the inter-person microbiome variability is associated with factors related to diet, drugs and anthropometric measurements. We further demonstrate that microbiome data significantly improve the prediction accuracy for many human traits, such as glucose and obesity measures, compared to models that use only host genetic and environmental data. These results suggest that microbiome alterations aimed at improving clinical outcomes may be carried out across diverse genetic backgrounds.
AbstractTo study the effect of host genetics on gut microbiome composition, the MiBioGen consortium curated and analyzed whole-genome genotypes and 16S fecal microbiome data from 18,473 individuals (25 cohorts). Microbial composition showed high variability across cohorts: we detected only 9 out of 410 genera in more than 95% of the samples. A genome-wide association study (GWAS) of host genetic variation in relation to microbial taxa identified 30 loci affecting microbome taxa at a genome-wide significant (P<5×10-8) threshold. Just one locus, the lactase (LCT) gene region, reached study-wide significance (GWAS signal P=8.6×10−21); it showed an age-dependent association with Bifidobacterium abundance. Other associations were suggestive (1.94×10−10<P<5×10−8) but enriched for taxa showing high heritability and for genes expressed in the intestine and brain. A phenome-wide association study and Mendelian randomization analyses identified enrichment of microbiome trait loci SNPs in the metabolic, nutrition and environment domains and indicated food preferences and diseases as mediators of genetic effects.
Human gut microbiome composition is shaped by multiple host intrinsic and extrinsic factors, but the relative contribution of host genetic compared to environmental factors remains elusive. Here, we genotyped a cohort of 696 healthy individuals from several distinct ancestral origins and a relatively common environment, and demonstrate that there is no statistically significant association between microbiome composition and ethnicity, single nucleotide polymorphisms (SNPs), or overall genetic similarity, and that only 5 of 211 (2.4%) previously reported microbiome-SNP associations replicate in our cohort. In contrast, we find similarities in the microbiome composition of genetically unrelated individuals who share a household. We define the term biome-explainability as the variance of a host phenotype explained by the microbiome after accounting for the contribution of human genetics. Consistent with our finding that microbiome and host genetics are largely independent, we find significant biome-explainability levels of 16-33% for body mass index (BMI), fasting glucose, high-density lipoprotein (HDL) cholesterol, waist circumference, waist-hip ratio (WHR), and lactose consumption. We further show that several human phenotypes can be predicted substantially more accurately when adding microbiome data to host genetics data, and that the contribution of both data sources to prediction accuracy is largely additive. Overall, our results suggest that human microbiome composition is dominated by environmental factors rather than by host genetics.
The non-coding genome is substantially larger than the protein-coding genome, but the lack of appropriate methodologies for identifying functional variants limits genetic association studies. Here, we developed analytical tools to identify rare variants in pre-miRNAs, miRNA recognition elements in 3′UTRs, and miRNA-target networks. Region-based burden analysis of >23,000 variants in 6,139 amyotrophic lateral sclerosis (ALS) whole-genomes and 70,403 non-ALS controls identified Interleukin-18 Receptor Accessory Protein (IL18RAP) 3′UTR variants significantly enriched in non-ALS genomes, replicate in an independent cohort and associate with a five-fold reduced risk of developing ALS. IL18RAP 3′UTR variants modify NF-κB signaling, provide survival advantage for cultured ALS motor neurons and ALS patients, and reveal direct genetic evidence and therapeutic targets for neuro-inflammation. This systematic analysis of the non-coding genome and specifically miRNA-networks will increase the power of genetic association studies and uncover mechanisms of neurodegeneration.
Recent studies indicate that the gut microbiome is partially heritable, motivating the need to investigate microbiome-host genome associations via microbial genome-wide association studies (mGWAS). Existing mGWAS demonstrate that microbiome-host genotypes associations are typically weak and are spread across multiple variants, similar to associations often observed in genome-wide association studies (GWAS) of complex traits. Here we reconsider mGWAS by viewing them through the lens of GWAS, and demonstrate that there are striking similarities between the challenges and pitfalls faced by the two study designs. We further advocate the mGWAS community to adopt three key lessons learned over the history of GWAS: (a) Adopting uniform data and reporting formats to facilitate replication and meta-analysis efforts; (b) enforcing stringent statistical criteria to reduce the number of false positive findings; and (c) considering the microbiome and the host genome as distinct entities, rather than studying different taxa and single nucleotide polymorphism (SNPs) separately. Finally, we anticipate that mGWAS sample sizes will have to increase by orders of magnitude to reproducibly associate the host genome with the gut microbiome.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.