Multi-omics is function module integrating multiple omics datasets. It is constructed based on association analysis, such genome-wide association study (GWAS), eQTL, transciptome-wide association study (TWAS) and fine-mapping methods of causal variations, such as colocation analysis and SMR (Summary databased Mendelian randomization analysis).
Genome-wide association study is an approach for identifying the genes that underlie common diseases and related quantitative traits. This strategy combines a comprehensive and unbiased survey of the genome with the power to detect common alleles with modest phenotypic effects. In CottonMD, we collected 9 traits of 4180 cotton accessions, including Fiber elongation rate (FE), Fiber length (FL), Fiber strength (FS), Fiber uniformity (FU), Flowering day (FD), Short fiber rate (FR), Leaf pubescence amount (LPA), Micronaire value (MV), Verticillium wilt disease index (DI), Effective boll number (EBN), Fiber upper half mean length (FUHML), First fruit branch position (FFBP), First fruit spur height (FFSH), Flowering period (FP), Fruit spur branch number (FSBN), Lint percentage (LP), Lint weight (LW), Plant height (PH), Seed cotton weight (SCW), Whole growth period (WGP)[1-11] and performed GWAS combining their genotypes using GEMMA. The most significant SNP in every 500kb-window was retained. User can browse GWAS results and search significant SNPs and genes of 6 fiber-associated traits.
Expression quantitative trait loci (eQTL) are the loci in the genome that contribute to the expression levels of messenger RNAs in an organism. They link the static information of DNA sequence variation with the dynamic information of gene expression variation in an organism. In CottonMD, we collected RNA-seq data of fibres at 15 DPA (day post anthesis)[12] from 251 Upland cotton accessions and performed eQTL mapping combining their genotypes using GEMMA. User can browse the links between SNPs and genes by inputting the interested region or genes.
Transcriptome-wide association studies (TWASs) have been widely used to integrate gene expression and genetic data to identify gene–trait associations. In CottonMD, we collected RNA-seq data of fibres at 15 DPA (day post anthesis)[12] and 6 fiber-associated traits from 251 Upland cotton accessions and performed TWAS using mixed-linear module (MLM). User can browse the links between phenotypes and genes by inputting the interested genes.
Colocalization analyses assess the degree to which independent signals of association, including eQTL and GWAS signals, share the same causal variant. In CottonMD, user can make full use of RNA-seq data in CottonMD to perform colocalization analyses by uploading user’s phenotype datasets. Colocalization analyses in CottonMD was performed using COLOC R package.
SMR (Summary data–based Mendelian randomization analysis) is an method based on multi-omics, which integrates summary-level data from GWAS with
data from expression quantitative trait locus (eQTL) studies to identify genes whose expression levels are associated with a complex trait because of pleiotropy.