- Open Access
Transcriptome analysis reveals SALL4 as a prognostic key gene in gastric adenocarcinoma
Journal of the Egyptian National Cancer Institute volume 34, Article number: 11 (2022)
Stomach adenocarcinoma (STAD) dominates 80–90% of gastric cancer (GC). Over the years, it has been realized that the identification of the genes responsible for gastric carcinogenesis is essential to understand the biomarker discovery.
This study aims to identify candidate genes for biomarker discovery in STAD. RNA-Seq was performed on three paired tumor-normal and one unpaired tumor samples from four GC patients and investigated for differentially expressed genes (DEGs) using DESeq2. Gene set enrichment analysis were performed. The DEGs were compared with two STAD microarray datasets available on Gene Expression Omnibus (GEO) database. Survival study (OS) were performed using KM-Plotter on the common genes between all the datasets.
Totally, 148 DEGs were identified, wherein 55 genes were upregulated and 93 genes were downregulated with |log2foldchange| > 1 and Benjamini-Hochberg (BH) Adjusted P value < 0.01. Cell adhesion molecule (CAM) Pathway was found to be the most significant among the upregulated genes. Gastric acid secretion and mineral absorption pathways were the most significant pathways among the downregulated genes. Comparison with two GEO datasets followed by OS analysis revealed two upregulating genes, APOC1 and SALL4 with prognostic significance.
Upregulation of APOC1 is associated with marginal overall survival (OS) and SALL4 over-expression was associated with the poor OS using KM-Plotter during 5 years data period. Our study suggests that SALL4 could be a promising biomarker candidate in STAD.
Gastric cancer symptoms are misunderstood often with the other stomach complications, which leads to diagnosis at an advanced stage and poor treatment due to cancer heterogeneity . STAD dominates the major type of GC; the second deadliest cancer type worldwide and associated with a poor survival rate . Eighty to 90% of GC cases belong to STAD are primarily associated with intestinal metaplasia; however, surgical resection is still a promising curative treatment [3, 4]. However, the identification of biomarkers to predict the outcome of the particular treatment is another challenging task and equally crucial.
Biomarker discovery enables the understanding of the disease diagnosis, prognostic events, and selection of the treatment strategies. CEA, CA19-9, and CA72-4 belong to carcinoembryonic antigens and are currently used biomarkers in clinical setup despite having low specificity for evaluating diagnosis and the prognosis of GC [5, 6].
Recent advances in transcriptome array and whole transcriptome sequencing have generated a tremendous amount of data and are being deposited in the publicly available databases. Experimental data could be compared with such publicly available data to increases the authenticity of the study . Comparison of tumor gene expression profiles with the normal tissues is crucial for investigating differentially expressed genes (DEGs), different pathways which enable researchers to understand mechanisms of cancer development, progression, and response to the targeted therapies .
Identification of such biomarkers is crucially important to monitor the patient health after treatment and the after-effects during the recovery period could be avoided. This study aims to identify the potential pathways in GC and the genes involved in STAD using transcriptomics approach.
Sample collection and total RNA isolation
Paired tumor-normal samples from four GC patients were obtained after surgery and the samples were immediately transferred into RNAlater (Thermo Fischer Scientific, USA) solution and stored at – 80 °C temperature. The participants did not have previous GC history. High quality total RNA was extracted from adjacent normal and Tumor Tissue samples using the PureLink RNA mini kit (Ambion, Inc.) as per the manufacture’s protocol. Agilent RNA 6000 Nanochips in 2100 Bioanalyzer (Agilent, Inc) was used to check the quality of isolated total RNA and quantitation was done by Qubit using the Quant-iT RNA assay kit broad range and NanoDrop spectrophotometer (Thermo Ficher Scientific, USA). A summary of the clinical data of the patients is given in the Supplementary Table 1.
Library preparation, sequencing, and data processing
The sequencing library was prepared using Illumina TruSeq Stranded Total RNA Library preparation kit (Illumina Inc., USA) from the samples bearing RNA Integrity Number (RIN) ≥ 6. Three paired tumor-normal and one unpaired tumor sample from four GC patients possessed RIN ≥ 6 (Supplementary Table 2) were further processed for RNA-Seq. Briefly, 1 μg of total RNA was taken for library preparation and rRNAs were removed before fragmentation and adapter ligation. cDNA library containing first and second-strand cDNA was synthesized from rRNA-depleted fragmented total RNA, both ends of cDNA were repaired and adapters were ligated, and final libraries were enriched using limited cycle PCR. The yield of cDNA libraries was quantified using Qubit dsDNA HS assay kit (Invitrogen, USA), size distribution and quality of the cDNA libraries were assessed using a High Sensitivity chip in Bioanalyzer (Agilent Technologies, USA), respectively. Quantitative Real-Time PCR was used to quantify the final library. The clusters were generated on a cBot cluster generation system (Illumina) and paired-end 2 × 100 bp sequencing was performed in Illumina HiSeq-2500 (Illumina Inc., USA).
Data processing and analysis
The raw data were checked with FASTQC tool . The low-quality bases and adapters were removed using Trimmomatic v0.38 . The average number of reads after trimming was reduced to 16 million reads from 18 million reads (Supplementary Table 3). The raw reads were aligned to the human reference genome (GRCh38) using STAR aligner . The generated BAM files were sorted by coordinates using Samtools . The transcript counts were estimated using featureCount tool  with GRCh38 version-based gene annotations. The low expressed genes which did not have more than 15 counts per million (CPM) reads in all the samples were removed from subsequent analysis. The null hypothesis for the experiment was that there was no difference in expression between the paired normal samples (log2foldchange is equal to Zero). DESeq2 , a R/Bioconductor  package which uses the Wald test for hypothesis testing was used to produce the gene list after ranked by P value and Adjusted P value for multiple testing using the Benjamini-Hochberg method. The log-transformed normalized counts were used to calculate the principal component analysis (PCA) to assess normalized expression pattern before differential expression test between tumor and adjacent normal tissues. |Log2foldchange| > 1 (rejection of null hypothesis) and adjusted P value < 0.01 were set as the criteria to get the significantly upregulated and downregulated genes. “ggplot2” packages were used to generate the volcano plot.
Functional enrichment analysis
EnrichR (https://maayanlab.cloud/Enrichr/) was used to study the Gene Ontology (GO) for the Biological process (BP), Molecular function (MF) and Cellular component (CP). Similarly, KEGG Pathway and Disease-Gene Association (DisGeNET) analysis using EnrichR. EnrichR uses Fisher-exact test to calculate the P value and adjusted P value using Benjamini-Hochberg method for correction for multiple hypotheses testing [16,17,18]. The Adjusted P value < 0.05 was considered statistically significant for both the GO analysis and the pathway enrichment analysis to identify the significant GO terms and pathways.
GEO2R analysis and comparison with RNA-Seq data
For comparative study with gene expression data from other STAD patients, microarray datasets GSE19826 and GSE79973 from Gene Expression Omnibus (GEO) were used (Table 1). The differential gene expression was analyzed using GEO2R (https://www.ncbi.nlm.nih.gov/geo/geo2r/). We have identified the DEGs from the two datasets using the criteria of |log2folchange| > 1 to get upregulated and downregulated genes and adjusted P value < 0.01 as statistically significant. The Venn Diagram tool (http://bioinformatics.psb.ugent.be/webtools/Venn/) was used to find out the common genes from the DEGs identified from the RNA-Seq experiment and two GEO Datasets.
Overall Survival (OS) analysis
The “Kaplan-Meier plotter” (https://kmplot.com) tool was used to analyze the overall survival of the key overlapping genes. KM-Plotter utilizes Cox proportional hazards regression analysis and calculates the log-rank P value . KM-Plotter uses background databases derived from manually curated clinical data as well as the gene expression data from GEO datasets. The analysis was restricted to 60 months OS at cancer stage III. Log-rank P value < 0.05 was set to be statistically significant.
Identification of the DEGs from RNA-Seq data
Principal component analysis (PCA) was performed to understand the variation in the normalized expression pattern between the tumor and the adjacent normal samples. PC1 and PC2 were observed to be 52% and 31% variance, respectively (Supplementary Figure 1). Total 148 genes showed highly significant differential expression with adjusted P value < 0.01 wherein 55 genes were upregulated (log2foldchange > 1) and 93 genes were downregulated (log2foldchange < − 1) (Fig. 1). The upregulated and downregulated genes with adjusted P value and log2foldchange are provided in Supplementary Table 4A and B.
Functional enrichment of the DEGs
The Gene Set Enrichment Analysis for upregulated and downregulated genes was performed for Gene Ontology terms for biological process, molecular function, and cellular component (Table 2). Pathway analysis of the upregulated gene set identified the Cell Adhesion molecule (CAM) pathway as the most significant pathway with adjusted P value of 0.004517. Among the downregulated genes, the Gastric acid secretion pathway and Mineral absorption pathway were found to be the most significant pathways with adjusted P values 0.001302 and 0.002315 (Table 3).
In the Disease-Gene Association analysis, the upregulated genes were found to be associated with stomach neoplasm (adjusted P value: 0.0001783), malignant neoplasm of stomach (adjusted P value: 0.002834), and tumor progression (adjusted P value: 0.004874). On the other hand, hypokelmia, characterized by less K+ absorptions are found to be significant (adjusted P value: 0.04754) among the downregulated genes (Table 4).
Identification of DEGs from GEO datasets and comparation with RNA-Seq data
In GSE19826, we found 145 upregulated and 103 downregulated genes. Similarly, in GSE79973, we found 210 genes as upregulated and 343 genes as downregulated. The DEGs obtained from two datasets from the GEO database using GEO2R as well as the DEGs from our study were used to find out the most common genes among the upregulated and downregulated genes. APOC1, SALL4 were the commonly upregulated genes and PSAPL1, CLIC6, TRIM50 were the commonly downregulated genes in all the three datasets. Comparison of the datasets for upregulated and downregulated genes is represented as Venn diagram (Fig. 2A, B). The genes common among the three datasets are also provided in detail in the Supplementary Table 5A-B.
The KM plotter was used to investigate the genes for overall survival and the final analysis was run based on 305 patients’ data for the APOC1 gene and 197 patients’ data for SALL4, PSAPL1, CLIC6, and TRIM50 genes. It was found that high expression of APOC1 is associated with marginal better OS with log-rank P value 0.03 and hazard ratio 0.70. SALL4 upregulation is significantly associated with very poor OS with log-rank P value 0.000021 and hazard ratio 3.19 (Fig. 3A, B). The median OS associated with APOC1 was found 35.5 months in the high expression cohort and 27.4 months in the low expression cohort. The median OS associated with SALL4 was 44.07 months in the low expression cohort and 13.04 month in the high expression cohort. The OS associated with PSAPL1, CLIC6, and TRIM50 genes were found to be insignificant with log-rank P values of 0.061, 0.064, and 0.25, respectively (Fig. 3C–E).
In gastric adenocarcinoma, we have analyzed patients’ tumor samples along with adjacent normal collected after radical resection and 148 highly significant (adjusted P value < 0.01) differentially expressed genes were identified using RNA-Seq data. We have analyzed for functional enrichment of the upregulated and the downregulated genes to understand the underlying pathways and GO for the underlying biological functions. The resulting gene set from the RNA-Seq experiment was compared with the gene expression data of two microarray datasets: GSE19826 (12 tumor and 15 adjacent normals) and GSE79973 (10 tumor and 10 adjacent normals).
Pathway analysis of upregulated genes has identified CAM pathway as significant (adjusted P value: 0.004517). CLDN7, SELL, CLDN4, HLA-DOA, and CLDN1 genes were enriched in the CAM pathway. CLDN1, CLDN4, and CLDN7 belong to the Claudin family and involve in tight junction and are often upregulated in neoplastic tissue. Among the family of claudins, CLDN4 and CLDN7 are more often to be upregulated in stomach cancer as well as other malignancies . Moreover, several other reports have confirmed their upregulation in cancer of various sites . Among the downregulated genes, the most significantly enriched pathways were the gastric acid secretion (adjusted P value: 0.001302) and mineral absorption pathways (Adjusted P value: 0.002315). ATP4A, ATP4B, KCNE2, KCNJ16, and SLC26A7 genes were enriched in the gastric acid secretion pathway. These genes are commonly downregulated genes in gastric adenocarcinoma as reported by many previous studies [24, 25]. ATP4A and ATP4B downregulation is often associated with DNA methylation and ATP4B could also be a potential biomarker of gastric adenocarcinoma . MT1A, TRPV6, MT1M, and MT1G genes were enriched in the mineral absorption pathway.
It is always essential to know or predict the association of disease with the gene or a gene set. DisGeNET is such a database that has a collection of genes as well as their associated disease derived manually from popular databases like Comparative Toxicogenomic Database (Human Subset), UniProt, ClinVar, ClinGen OMIM, and Orphanet as well as extensive text mining data . EnrichR offers the facility to analyze the gene sets against DisGeNET on the web to quickly understand the associated disease with such genes. Most interestingly, the upregulated genes enriched in two stomach related diseases: stomach neoplasm with adjusted P value of 0.001783 and malignant neoplasm of the stomach with adjusted P value of 0.002834. Moreover, the few upregulated genes were also enriched in the tumor progression with adjusted P value of 0.004874. SPP1, MSLN, HOTAIR, CLDN1, CLDN3, CLDN7, ALOX5, and MMP7 genes were found to be common in all these three disease terms. The significant disease term enriched in the downregulated genes was hypokalemia with adjusted P value of 0.04758. However, hypokalemia, characterized by low K+ level, association with GC is not reported much worldwide. Extensive studies might be required to find out the relation of Hypokalemia with GC development. The pathways and the disease terms along with the genes involved are provided in the Table 2.
Comparison of the gene set resulted from the RNA-Seq data with the GEO datasets has allowed us to find out the most common genes. This comparison carries the overall authenticity of differentially expressed genes in gastric adenocarcinoma. We found the genes APOC1 and SALL4 common in all the datasets used for the comparative study which has an impact on OS in GC patients. The expression levels of the common genes are provided in Supplementary Figure 2. Upregulation of both the APOC1 and SALL41 has an opposite effect on the OS. Upregulation of APOC1 is associated with marginal better OS, while upregulation of SALL4 has extremely poor OS. The relative expression of APOC1 which codes for apolipoprotein C1 was detected upregulated in the serum of a large number of GC patients and was identified as a potential biomarker candidate . SALL4 encodes for zinc finger transcription factor is reported to have a role in GC development and found to be overexpressed in GC cases . It was elucidated that upregulation of SALL4 involves GC by activating the Wnt/β-catenin signaling pathway. SALL4 was also found to be associated with poor survival in TCGA (The Cancer Genome Atlas) STAD data .
In summary, although we found significantly large numbers of DEGs, the bioinformatics analysis has enabled us to find out the promising factors (genes) that have a probable impact on gastric carcinogenesis. We have identified important pathways that were altered in GC. The Disease-Gene association study provides the authenticity of the upregulated gen es with the associated disease. At last, comparative study of RNA-Seq with the other STAD patient microarray data has enabled us to find out the most common genes among them and the survival study has converged our focus on the SALL4 gene. However, in-depth study with patient’s clinical data in larger cohorts and a higher sequencing depth is required to explore more hidden biological information in STAD that may enable the researcher to discover new biomarkers.
In conclusion, we found the genes associated with CAM pathway, gastric acid secretion, and mineral absorption pathways altered their expression in STAD samples. APOC1 and SALL4 genes were upregulated in STAD tissue and might affect the disease prognosis. The KM-Plotter analysis revealed that the upregulation of SALL4 is associated with overall poor survival in GC patients and may act as a promising prognostic biomarker.
Availability of data and materials
The aligned reads in BAM format were submitted European Nucleotide Archive (ENA) at EMBL-EBI under the study accession number PRJEB45410 (URL: https://www.ebi.ac.uk/ena/browser/view/PRJEB45410).
Complementary deoxyribonucleic acid
Ribosomal ribonucleic acid
Polymerase chain reaction
Gene Expression Omnibus
Differentially expressed gene
- KM Plotter:
Sexton RE, Al Hallak MN, Diab M, Azmi AS. Gastric cancer: a comprehensive review of current and future treatment strategies. Cancer Metastasis Rev. 2020;39(4):1179–203. https://doi.org/10.1182/blood-2010-10-314682.
Joshi SS, Badgwell BD. Current treatment and recent progress in gastric cancer. CA Cancer J Clin. 2021;71(3):264–79. https://doi.org/10.3322/caac.21657.
Hu B, El Hajj N, Sittler S, Lammert N, Barnes R, Meloni-Ehrig A. Gastric cancer: Classification, histology and application of molecular pathology. J Gastrointest Oncol. 2012;3(3):251–61. https://doi.org/10.3978/j.issn.2078-6891.2012.021.
Jiang K, Li L, Xie Y, Xie D, Xiao Q. High ADAMTS18 expression is associated with poor prognosis in stomach adenocarcinoma. Oncol Lett. 2020;20(5):211. https://doi.org/10.3892/ol.2020.12074.
Matsuoka T, Yashiro M. Biomarkers of gastric cancer: current topics and future perspective. World J Gastroenterol. 2018;24(26):2818–32. https://doi.org/10.3748/wjg.v24.i26.2818.
Ye DM, Xu G, Ma W, Li Y, Luo W, Xiao Y, et al. Significant function and research progress of biomarkers in gastric cancer. Oncol Lett. 2020;19(1):17–29. https://doi.org/10.3892/ol.2019.11078.
Rong L, Huang W, Tian S, Chi X, Zhao P, Liu F. COL1A2 is a novel biomarker to improve clinical prediction in human gastric cancer: integrating bioinformatics and meta-analysis. Pathol Oncol Res. 2018;24(1):129–34. https://doi.org/10.1007/s12253-017-0223-5.
Sorokin M, Poddubskaya E, Baranova M, Glusker A, Kogoniya L, Markarova E, et al. RNA sequencing profiles and diagnostic signatures linked with response to ramucirumab in gastric cancer. Cold Spring Harb Mol Case Stud. 2020;6(2):a004945. https://doi.org/10.1101/mcs.a004945.
Andrews S.. FastQC: a quality control tool for high throughput sequence data. 2010. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc. Accessed 30 April 2021.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. https://doi.org/10.1093/bioinformatics/btu170.
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21. https://doi.org/10.1093/bioinformatics/bts635.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. 1000 Genome project data processing subgroup. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–9. https://doi.org/10.1093/bioinformatics/btp352.
Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30(7):923–30. https://doi.org/10.1093/bioinformatics/btt656.
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550. https://doi.org/10.1186/s13059-014-0550-8.
Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004;5(10):R80. https://doi.org/10.1186/gb-2004-5-10-r80.
Chen EY, Tan CM, Kou Y, Duan Q, Wang Z, Meirelles GV, et al. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinformatics. 2013;14:128. https://doi.org/10.1186/1471-2105-14-128.
Kuleshov MV, Jones MR, Rouillard AD, Fernandez NF, Duan Q, Wang Z, et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 2016;44(W1):W90–7. https://doi.org/10.1093/nar/gkw377.
Xie Z, Bailey A, Kuleshov MV, Clarke DJB, Evangelista JE, Jenkins SL, et al. Gene Set Knowledge Discovery with Enrichr. Curr Protoc. 2021;1(3):e90. https://doi.org/10.1002/cpz1.90.
Wang Q, Wen YG, Li DP, Xia J, Zhou CZ, Yan DW, et al. Upregulated INHBA expression is associated with poor survival in gastric cancer. Med Oncol. 2012;29(1):77–83. https://doi.org/10.1007/s12032-010-9766-y.
He J, Jin Y, Chen Y, Yao HB, Xia YJ, Ma YY, et al. Downregulation of ALDOB is associated with poor prognosis of patients with gastric cancer. Onco Targets Ther. 2016;9:6099–109. https://doi.org/10.2147/OTT.S110203.
Nagy Á, Munkácsy G, Győrffy B. Pancancer survival analysis of cancer hallmark genes. Sci Rep. 2021;11(1):6047. https://doi.org/10.1038/s41598-021-84787-5.
Hewitt KJ, Agarwal R, Morin PJ. The claudin gene family: expression in normal and neoplastic tissues. BMC Cancer. 2006;6:186. https://doi.org/10.1186/1471-2407-6-186.
Morin PJ. Claudin proteins in human cancer: promising new targets for diagnosis and therapy. Cancer Res. 2005;65(21):9603–6. https://doi.org/10.1158/0008-5472.can-05-2782.
Sun C, Yuan Q, Wu D, Meng X, Wang B. Identification of core genes and outcome in gastric cancer using bioinformatics analysis. Oncotarget. 2017;8(41):70271–80. https://doi.org/10.18632/oncotarget.20082.
Li L, Zhu Z, Zhao Y, Zhang Q, Wu X, Miao B, Cao J, Fei S. FN1, SPARC, and SERPINE1 are highly expressed and significantly related to a poor prognosis of gastric adenocarcinoma revealed by microarray and bioinformatics. Sci Rep. 2019;9(1):7827. https://doi.org/10.1038/s41598-019-43924-x.
Raja UM, Gopal G, Rajkumar T. Intragenic DNA methylation concomitant with repression of ATP4B and ATP4A gene expression in gastric cancer is a potential serum biomarker. Asian Pac J Cancer Prev. 2012;13(11):5563–8. https://doi.org/10.7314/apjcp.2012.13.11.5563.
Piñero J, Bravo À, Queralt-Rosinach N, Gutiérrez-Sacristán A, Deu-Pons J, Centeno E, García-García J, Sanz F, Furlong LI. DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants. Nucleic Acids Res. 2017;45(D1):D833–D839. https://doi.org/10.1093/nar/gkw943.
Yi J, Ren L, Wu J, Li W, Zheng X, Du G, Wang J. Apolipoprotein C1 (APOC1) as a novel diagnostic and prognostic biomarker for gastric cancer. Ann Transl Med. 2019;7(16):380. https://doi.org/10.21037/atm.2019.07.59.
Yuan X, Zhang X, Zhang W, Liang W, Zhang P, Shi H, Zhang B, Shao M, Yan Y, Qian H, Xu W. SALL4 promotes gastric cancer progression through activating CD44 expression. Oncogenesis. 2016;5(11):e268. https://doi.org/10.1038/oncsis.2016.69.
Yang Y, Wang X, Liu Y, Hu Y, Li Z, Li Z, Bu Z, Wu X, Zhang L, Ji J. Up-Regulation of SALL4 Is Associated with Survival and Progression via Putative WNT Pathway in Gastric Cancer. Front Cell Dev Biol. 2021;9:600344. https://doi.org/10.3389/fcell.2021.600344.
The authors thanks all the patients involved in the study. The authors are thankful to the Data collectors and Lab Technicians: David K. Zorinsanga, Nubawihi Khenglawt, Jonathan Lalramhluna, Baby Lalrintluangi, R. Lalengkimi, and T. Lalhriatpuii for collecting the samples, clinical information, and documentation.
The work was supported by the Gastric Cancer Project (DBT-NER/Health/46/2015 and BT/551/NE/U-Excel/2014) sponsored by the Department of Biotechnology (DBT), New Delhi, Government of India. The role of the funder includes the grant for purchasing consumables and equipment as well as supporting manpower for data collection and analysis required to conduct the study.
Ethics approval and consent to participate
The study was approved by the Institutional Ethics Committee (No. B.12018/1/13-CH(A)/IEC), Civil Hospital Aizawl, Mizoram. Consent to participate in the study was received in standard consent forms.
Consent for publication
Consent was obtained for publication of the clinical data.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Summary of Clinical information of patients included in the study. Supplementary Table 2. Concentration, Purity, and Integrity of Total RNA extracted from Adjacent Normal (D) and Tumor (T) tissues. Supplementary Table 3. Summary of RNA-Seq Data before and after trimming. Supplementary Table 4. A: Upregulated genes in tumor tissue with log2 Fold Change greater than 1 and Adjusted p-value less than 0.01. B: Downregulated genes in tumor tissue with log2 Fold Change less than -1 and Adjusted p-value less than 0.01. Supplementary Table 5. Comparison of the DEGs from GEO Datasets of GC as well as from DEGs from RNA-Seq Data. 5A: Upregulated Genes. 5B: Down Regulated Genes.
Principal Component Analysis of the samples performed in DESeq. 2.52% and 31% variance were observed on PC1 and PC2, respectively. Figure S2. Heatmap showing the expression level of the common DEGs in the tissue samples. The plot is divided in each clustering level.
About this article
Cite this article
Sarma, R.J., Subbarayan, S., Zohmingthanga, J. et al. Transcriptome analysis reveals SALL4 as a prognostic key gene in gastric adenocarcinoma. J Egypt Natl Canc Inst 34, 11 (2022). https://doi.org/10.1186/s43046-022-00108-5
- Stomach adenocarcinoma
- Differentially expressed genes
- Poor survival