We collected RNA sequencing data from a total of 11 previous studies, including different tissues, or under different treatment conditions. The tissues mainly come from root, stem, leaf, callus, and various tissues in flowers. Environmental stresses mainly include relatively low temperature stress, ethephon, sodidum thiosulfate and 5'-azacytidine.
The quality of the RNA sequencing reads was examined by FastQC (v0.11.9) (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Barcode adaptors and low-quality reads (read quality < 80 for paired-end reads) were removed by Trimmomatic (v0.38) [1]. Then, the filtered reads were aligned to the O. fragrans "Liuyejingui" reference genome using Hisat2 (v2.1.0) [2] with default parameters. Bam files containing aligned reads were inputted into StringTie (v1.3.3b) [3] to measure the expression level of genes. Gene-level raw count data files were generated using featureCounts (v1.6.4) [4]. The raw count data were imported into Bioconductor package DESeq2 [5] in the R language to identify the differentially expressed genes. Genes had a log2-converted fold change ≥ 1 or ≤ -1 with an FDR (False Discovery Rate) ≤ 0.05 were considered as DEGs.