Microarray-based identification of genes associated with cancer progression and prognosis in hepatocellular carcinoma
Journal of Experimental & Clinical Cancer Research volume 35, Article number: 127 (2016)
Hepatocellular carcinoma (HCC) is the third leading cause of cancer-related deaths. The average survival and 5-year survival rates of HCC patients still remains poor. Thus, there is an urgent need to better understand the mechanisms of cancer progression in HCC and to identify useful biomarkers to predict prognosis.
Public data portals including Oncomine, The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) profiles were used to retrieve the HCC-related microarrays and to identify potential genes contributed to cancer progression. Bioinformatics analyses including pathway enrichment, protein/gene interaction and text mining were used to explain the potential roles of the identified genes in HCC. Quantitative real-time polymerase chain reaction analysis and Western blotting were used to measure the expression of the targets. The data were analysed by SPSS 20.0 software.
We identified 80 genes that were significantly dysregulated in HCC according to four independent microarrays covering 386 cases of HCC and 327 normal liver tissues. Twenty genes were consistently and stably dysregulated in the four microarrays by at least 2-fold and detection of gene expression by RT-qPCR and western blotting showed consistent expression profiles in 11 HCC tissues compared with corresponding paracancerous tissues. Eleven of these 20 genes were associated with disease-free survival (DFS) or overall survival (OS) in a cohort of 157 HCC patients, and eight genes were associated with tumour pathologic PT, tumour stage or vital status. Potential roles of those 20 genes in regulation of HCC progression were predicted, primarily in association with metastasis. INTS8 was specifically correlated with most clinical characteristics including DFS, OS, stage, metastasis, invasiveness, diagnosis, and age.
The significantly dysregulated genes identified in this study were associated with cancer progression and prognosis in HCC, and might be potential therapeutic targets for HCC treatment or potential biomarkers for diagnosis and prognosis.
Hepatocellular carcinoma (HCC) is the third leading cause of cancer-related deaths . There are 750,000 new cases of HCC and nearly 700,000 deaths each year, making this a particularly lethal form of cancer . Over the past decade major progress has been made in our understanding of the risk factors and molecular pathways driving liver carcinogenesis, and these advances have led to substantial opportunities for HCC prevention, surveillance, early diagnosis, prediction of prognosis, and therapy . However, the average survival of HCC patients is normally between 6 and 20 months , and long-term prognosis is poor with reported 5-year survival rates ranging from 17 to 53 % . Thus, there is an urgent need to better understand the mechanism of cancer progression and development in HCC and to identify useful biomarkers for diagnosis and prognosis.
High-throughput profiling technologies such as microarrays and, more recently, next-generation sequencing have become invaluable tools for biomedical research, and large amounts of data generated by those tools, including mRNA expression, DNA methylation, and microRNA expression, are collected in public archives such as the major public projects The Cancer Genome Atlas (TCGA)  and the International Cancer Genome Consortium , and the most prominent primary data archives, ArrayExpress , Gene Expression Omnibus (GEO) , Oncomine  and the databases of the International Nucleotide Sequence Database Collaboration . The wide range of those databases, the various ways in which publicly archived gene expression data are being used in support of new studies, and reuse of these public data can be very powerful . In particular, reusing of the data has the potential to predict treatment response and disease progression and was advantageous to develop precision therapies . For example, based on data retrieved from Oncomine, TCGA, and GEO, Liu et al. identified several genes associated with ovarian cancer progression  and drug resistance . In a similar manner, we identified that upregulation of E2F transcription factor 3 is associated with poor prognosis in HCC . In the present study, using data of mRNA expression, DNA methylation, and clinical data retrieved from Oncomine, GEO, and the TCGA cohort, we identified a group of genes associated with cancer progression and prognosis in HCC.
All patients who underwent curative hepatectomy for primary HCC at the First Affiliated Hospital of Guangxi Medical University between March 2015 and September 2015 were eligible for inclusion in this study. Total of 11 HCCs and the matched paracancerous tissues were collected during surgery and stored in a liquid nitrogen tank until use for mRNA isolation and protein extraction. The study was endorsed by the Ethics Committee of Guangxi Medical University and was performed according to the Declaration of Helsinki, 2013 edition. All patients received an explanation of the aims of the study and signed informed consent.
mRNA isolation and quantitative real-time polymerase chain reaction (RT-qPCR) analysis
Total RNA from 11 HCC and their matched paracancerous tissues was isolated using a miRNeasy Mini Kit (Qiagen, Hilden, Germany). RNA was quantified by spectrophotometry on a NanoDrop 2000 (Thermo Scientific, DE, USA). A total of 2 μg RNA was subjected to cDNA synthesis using the miScript II RT Kit (Qiagen, Hilden, Germany). RT-qPCR was performed with the QuantiFast SYBR Green PCR Kit (Qiagen, Hilden, Germany). Data were collected with the StepOnePlus Real-Time PCR System (ABI, CA, USA) according to the manufacturer’s instructions. The gene expression was compared in each HCC sample and the matched paracancerous tissue, and then the homogeneity of variance in all samples was analysed using the t-test. The RT-qPCR gene-specific primers were as follows: TBCE: forward primer, 5′-AGGCCAACAGATGTTCTCCAG-3′, reverse primer, 5′-CAGGGGGTTTCTTAGGCAGG-3′; INTS8: forward primer, 5′-AACTGAGAGTTCTACTGCTGGA-3′, reverse primer, 5′-GCTGCGCCCAAATCATAGC-3′; VIPR1: forward primer, 5′-TGCTGGGACACCATCAACTC-3′, reverse primer, 5′-TTGTCCGGAAAGAAGGCGAA-3′; CLEC4M: forward primer, 5′-TACTTCATGTCTAACTCCCAGCG-3′, reverse primer, 5′-GCTCCTCAGCAGTTTTGATTACG-3′; MARCO: forward primer, 5′-GGGGACACAGGACTTCAAGG-3′, reverse primer, 5′-CCCTGTTCTCCCTTCACACC-3′; DNASE1L3: forward primer, 5′-AGCCCTTTGTGGTCTGGTTC-3′, reverse primer, 5′-CGTCCGTGTAGACCTCAACC-3′; CRHBP: forward primer, 5′-AAATCCTCAGCAGGTTGCGA-3′, reverse primer, 5′-AAGGCGTCATCTTGGAAGGG-3′; FCN2: forward primer, 5′-CTGCAAGGACCTGCTAGACC-3′, reverse primer, 5′-TGTCATTCCCCAGCCAGAAC-3′; GAPDH (used as the control): forward primer, 5′-GAAGGTGAAGGTCGGAGT-3′, reverse primer, 5′-GAAGATGGTGATGGGATTT-3′.
Protein extraction and western blotting
Total protein was extracted from HCC and paracancerous tissues with RIPA lysis buffer (Solarbio, Beijing, China) and proteinconcentration was determined using an Enhanced BCA Protein Quantification Kit (KeyGEN BioTECH, Jiangsu, China). Then the samples were separated by Novex NuPAGE SDS-PAGE Gel System (Thermo Fisher Scientific, MA, USA) and were transferred to the PVDF membrane using the Bio-Rad Criterion System (Bio-Rad, CA, USA). Membranes were blocked with 8 % non-fat dry milk in PBS containing 0.1 % Tween-20 (0.1 % TBST, pH7.4) for 1 h. Membranes were incubated with antibodies specific for human INTS8 (rabbit polyclonal antibody, 1:750 dilutions; Proteintech, Hubei, China) and GAPDH (rabbit polyclonal antibody, 1:1,000 dilution; Boster, Hubei, China) overnight at 4 °C. After 3 washings with 0.1 % TBST for 5 min, horseradish peroxidase-conjugated goat anti-rabbit secondary antibodies (1:5,000 dilution; Bioss, Beijing, China) were applied, followed by washings with 0.1 % TBST for 5 min each at room temperature (RT). The bound immunocomplexes were detected using ECL+ reagent (GE Healthcare Bio-Sciences, NJ, USA) with a FluorChem M system (Proteinsimple, CA, USA).
Gene expression profiles
The genes significantly dysregulated in HCC were identified based on the 4 microarrays, Chen Liver microarray (104 HCCs vs. 76 liver tissues), Roessler Liver microarray (22 HCCs vs. 21 liver tissues), Roessler Liver 2 microarray (225 HCCs vs. 220 liver tissues) and Wurmbach Liver microarray (35 HCCs vs. 10 liver tissues), which are all deposited in Oncomine database (https://www.oncomine.org/resource/login.html) . The 4 microarrays together covering total of 386 cases of HCCs and 327 cases of normal liver tissues. The rank for a gene is the median rank for that gene across each of the analyses. DNA methylation, mRNA expression, and clinical data of 379 HCC patients in a TCGA cohort were retrieved from cBioPortal for Cancer Genomics (http://cbioportal.org) [16, 17], but only 157 samples with matched gene expression data, prognosis data and most of the other clinical data were used to analyze the clinical importance of the target genes. mRNA expression data associated with HCC metastasis were retrieved from microarray GDS3091  and GDS274 , which were deposited in the GEO profiles databases (http://www.ncbi.nlm.nih.gov/geoprofiles/) .
Enrichment of the biological process and cellular component of a group of genes was determined using the DAVID online tool (http://david.abcc.ncifcrf.gov/) [20, 21]. Protein/gene-protein/gene interaction analysis was performed using the GeneMANIA online tool (http://www.genemania.org/) [22, 23]. Function prediction based on text mining was performed using the Coremine Medical online database (http://www.coremine.com/medical/) .
The data were analysed by SPSS 20.0 software. The mRNA expression of a gene is presented as the mean ± SD. Homogeneity of variance was analysed using the t-test. Expression values of a gene were dichotomised into high and low expression using the median as a cutoff for analysis of clinical importance in a TCGA cohort, as described in a previous study . The probability of survival and its significance was calculated using the Kaplan-Meier method and log-rank test, respectively. A Cox proportional hazard model was performed for multivariate analysis of prognosis. The correlation between gene expression and clinicopathologic characteristics was evaluated by Pearson’s χ2 test (two-sided). The correlation between DNA methylation and gene expression was analysed using bivariate correlations. P values < 0.05 were considered to indicate statistically significant differences.
Retrieval of significantly dysregulated genes in HCC
Four independent microarrays deposited in the Oncomine database were selected to identify genes associated with cancer development and progression in HCC. These microarrays were Chen Liver Statistics covering 104 cases of HCC and 76 cases of liver tissue, Roessler Liver Statistics covering 22 cases of HCC and 21 cases of liver tissue, Roessler Liver 2 Statistics covering 225 cases of HCC and 220 cases of liver tissue, and Wurmbach Liver Statistics covering 35 cases of HCC and 10 cases of liver tissues. Based on analysis of these four independent microarrays, 40 genes that were significantly upregulated (P < 1.36E-10) and 40 genes that were significantly downregulated (P < 1.31E-10) in HCC were retrieved (Fig. 1). Analysis of the 80 genes by the DAVID online tool indicated that cell cycle was the top biological process, covering 17 genes, and microtubule cytoskeleton was the top cellular component, covering 14 genes (Additional file 1: Table S1).
Among the 80 genes that were dysregulated in HCCs according to four independent microarrays covering a total of 386 cases of HCC and 327 cases of normal liver tissues, nine genes (CAP2, PTTG1, TOP2A, GMNN, GPC3, UBE2C, UBAP2L, TBCE, and INTS8) were consistently and stably upregulated and 18 genes (CXCL14, VIPR1, CLEC4M, MARCO, CLEC1B, NAT2, FCN2, EGR1, DNASE1L3, MT1F, CRHBP, LCAT, PAMR1, ACSM3, MT1G, MT1X, SRPX, and MT1H) were consistently and stably downregulated in HCC, by least 2-fold (Fig. 1; Table 1). Among the above 27 genes, seven genes—CAP2, GMNN, PTTG1, TBCE, TOP2A, UBE2C, and FCN2—encode proteins associated with cell cycle and microtubule cytoskeleton (Additional file 1: Table S1). Protein/gene-protein/gene interaction analysis was performed to further explain the interrelationships of these genes in HCC. As shown in Additional file 2: Figure S2, the 27 proteins/genes directly/indirectly interacted with each other via co-localisation, genetic interactions, shared common pathways, and protein domains, and, in particular, co-expression, and 10 of them—VIPR1, DNASE1L3, SRPX, MT1H, CXCL14, CLEC4M, CRHBP, GPC3, NAT2, and MARCO—interacted with at least 14 other genes, more than half of all the genes in the interaction network (Additional file 2: Figure S2). Moreover, these genes were also those that were dysregulated at least 4-fold in HCC (Table 1).
Measurement of gene expression at mRNA and protein level
Among the 27 genes, the associations of seven with HCC are relatively well studied and described in published papers. However, the relationship of the remaining 20 genes with HCC was poorly understood, and these genes were selected for further analyses (Table 1). The expression of eight genes that were randomly selected from the 20 genes was measured by RT-qPCR in 11 tissues of HCC patients compared with matched paracancerous tissues. As shown in Fig. 2a, the expression of TBCE and INTS8 was increased, whereas that of VIPR1, CLEC4M, MARCO, DNASE1L3, CRHBP, and FCN2 was decreased in HCC tissues, although the changes in TBCE and VIPR1 expression were not statistically significant. Compared with the average expression in paracancerous tissues, the expression of INTS8 in HCC was upregulated with 2.06-fold and the expression of CLEC4M, MARCO, DNASE1L3, CRHBP, and FCN2 was downregulated with 3.83-, 5.70-, 5.63-, 3.87-, and 8.94-fold, respectively. All results of gene expression determined by RT-qPCR were completely consistent with their expression identified by the four independent microarrays (Fig. 1; Table 1). Furthermore, a significant increase at the protein level of INTS8 was observed in HCC tissues compared with corresponding paracancerous tissues (Fig. 2b), which was consistent with its expression at the mRNA level.
Analysis of clinical importance
The clinical importance in HCC of the 20 selected genes (Table 1) was evaluated on the basis of TCGA clinical data. A total of 379 HCC patient samples with clinical data in a cohort of TCGA were retrieved. Among these, 157 samples with mRNA expression values were selected for analysis of the relationship between genes and clinical characteristics. The expression values of a gene were categorised as high or low according to the median value in accordance with a previous study .
A total of 11 genes were associated with DFS and/or OS (Table 2); among those, low expression of ACSM3 and CXCL14 was associated with poor DFS, and low expression of CRHBP, DNASE1L3, FCN2, MT1X, and VIPR1 was associated with poor OS (Fig. 3, Table 2). Four genes were associated with both DFS and OS: high expression of INTS8 in HCC patients, and low expression of LCAT, MARCO, and PAMR1, was associated with poor DFS and OS (Fig. 4, Table 2). To elucidate whether any of the above genes was an independent factor for predicting patient survival, we performed multivariate analyses of tumour stage, tumour pathologic PT, tumour residual, tumour status, vital status, age, gender, and the 11 genes by a Cox proportional hazards model (Table 3). We found that stage (P = 0.050), tumour status (P = 0.001), DNASE1L3 expression (P = 0.042), and INTS8 expression (P = 0.023) were independent risk prognostic factors for OS in HCC patients, although no gene was found to be an independent prognostic factor for DFS (data not shown).
Six genes were associated with tumour pathologic PT and tumour stage (Table 4); among these, high expression of INTS8 and UBAP2L, and low expression of ACSM3, FCN2, LCAT, and MT1G, was significantly associated with metastatic tumour and late stage (P ≤ 0.05). In particular, UBAP2L was markedly and highly expressed in T2 tumours (72.5 % vs. 27.5 %) and LCAT was lowly expressed in T2 tumours (30.0 % vs. 70.0 %) and highly expressed in T1 tumours (72.6 % vs. 27.4 %). In addition, LCAT was highly expressed in stage I tumours (71.2 % vs. 28.8 %).
Ten genes were associated with age and gender. As shown in Table 4, we found that six genes—CXCL14, GMNN, INTS8, MT1F, MT1G, and SPRX—were expressed at low levels in HCC patients aged ≥ 65 years. Expression of five genes was related to the gender of HCC patients. Except for FCN2, which is lowly expressed in male HCC patients, the other four genes, CLEC1B, CRHBP, MT1G, and TBCE, were all lowly expressed in female HCC patients. In addition, PAMR1 and MT1X were closely related to the vital status; both showed low expression in 60.3 % (38/63) of HCC patients with dead status, compared with high expression in 57.4 % (54/94) of patients with alive status (P = 0.022).
Potential roles of the genes in HCC progression
The potential roles of the 20 genes in HCC were predicted on the basis of Coremine Medical mining. As shown in Fig. 5, the associations of the genes with diagnosis, prognosis, drug resistance, recurrence, metastasis, and invasiveness of HCC was comprehensively analysed. The results indicated that, with the exception of PAMR1, the other 19 genes were all associated with at least one factor contributing to cancer progression, and many of the genes, for example GMNN, CXCL14, MT1G, MT1X, SPRX, and VIPR1, were closely associated with almost all of the factors included in this analysis. Most of the genes were extensively associated with several factors. For example, 15 genes (including INTS8, LCAT, MARCO, and DANSE1L3) were associated with diagnosis, 14 genes (including INTS8, MARCO, CRHBP, and VIPR1) were associated with metastasis, and 13 genes (including LCAT, MARCO, FCN2, and CXCL14) were associated with prognosis.
Based on the gene expression in two independent GEO microarrays corresponding to HCC metastasis, the association of the genes CLEC4M, CRHBP, MARCO, MT1X, SRPX, UBAP2L, and VIPR1 with metastasis was further analysed; unfortunately, data for the other genes were unavailable. The expression of CRHBP, LCAT, and SPRX was significantly dysregulated in nine HCCs with venous metastasis compared with 11 HCC without (Fig. 6a). Genes VIPR1, LCAT, BAP2L, CLEC4M, CRHBP, and SRPX were significantly dysregulated in 32 HCCs with portal vein tumour thrombus metastasis and 33 HCCs with intrahepatic spread metastasis compared with 22 HCCs with no metastasis (Fig. 6b&c). In particular, LCAT was highly expressed in HCC patients with venous metastasis and patients with portal vein tumour thrombus metastasis, and SRPX was lowly expressed in HCC patients with venous metastasis and patients with intrahepatic spread metastasis (Fig. 6).
Correlation of DNA methylation with mRNA expression of the target genes
DNA methylation and mRNA expression data from 379 HCC patients in a TCGA cohort were retrieved and the correlations between them were analysed using bivariate correlations. Among the 20 genes that are poorly studied in HCC (Table 1), DNA methylation data of CLEC1B and SRPX were not available. DNA methylation was negatively correlated with the mRNA expression for eight genes, ACSM3, INTS8, LCAT, MT1X, CRHBP, MARCO, PAMR1, and VIPR1. In particular, high methylation of the first four genes was significantly correlated with lower mRNA expression (Fig. 7), indicating that the expression of these genes in HCC might be regulated by DNA methylation.
Cancer is frequently considered to be a disease of the cell cycle because alterations in different families of cell cycle regulators cooperate in tumour development. Molecular analysis of human tumours has shown that cell cycle regulators are frequently mutated in human neoplasms, underscoring the importance of maintaining cell cycle commitment in the prevention of human cancer . Abnormal expression of cell cycle controllers, particularly G1/S-phase transition, is often implicated in the pathogenesis of most human cancers, including HCC. For example, vaccinia-related kinase 1 promotes HCC by controlling the levels of cell cycle regulators associated with G1/S transition . In this study, 80 genes that were significantly dysregulated in HCC were identified based on four independent microarrays covering a total of 386 cases of hepatocellular carcinoma and 327 cases of normal liver tissues (Fig. 1), and biological process annotation of these genes revealed that 17 of these genes were implicated in cell cycle functions (Additional file 1: Table S1). These results suggested that these genes might contribute to cancer progression and development in HCC at least in part through regulation of the cell cycle.
Twenty-seven genes were further identified to be consistently dysregulated in all four microarrays by at least 2-fold (Table 1). The expression of eight of these genes (TBCE, INTS8, VIPR1, CLEC4M, MARCO, DNASE1L3, CRHBP, and FCN2) was confirmed in 11 tissues of HCC patients compared with matched paracancerous tissues by RT-qPCR (Fig. 2a). Seven of the 27 genes (UBE2C, PTTG1, CAP2, TOP2A, GPC3, EGR1, and NAT2) have been well studied in HCC (Table 1). For example, GPC3 plays critical roles in cell proliferation and invasion through the induction of apoptosis  and is a biomarker for diagnosis  and recurrence . Protein/gene-protein/gene interaction analyses indicated that these 27 proteins/genes strongly interacted with each other, and 10 of them interacted with at least half of all the genes (Additional file 2: Figure S2). Moreover, six of these genes were related to the cell cycle in HCC (Additional file 1: Table S1). Together, these results indicate that the genes identified in this study might play crucial roles in HCC progression, probably functioning as a group.
Biomarkers not only have prognostic implications, but are also helpful for measurement of treatment responses and surveillance for tumour recurrence and for guiding clinical decisions . Thus, prognostic biomarkers for HCC patients are necessary and crucial, and there is an ongoing search for predictive biomarkers. In this study, a group of genes associated with DFS and OS (Table 2) were identified in 157 HCC patients. Among these genes, low expression of ACSM3 and CXCL14 was associated with poor DFS, low expression of CRHBP, DNASE1L3, FCN2, MT1X, and VIPR1 was associated with poor OS (Fig. 3, Table 2), high expression of INTS8 was associated with poor DFS and OS, and low expression of LCAT, MARCO, and PAMR1 was associated with poor DFS and OS (Fig. 4, Table 2). Furthermore, DNASE1L3 and INTS8 were identified as independent risk prognostic factors for OS (Table 3). There are few reports of the association of these genes with prognosis in HCC or in other cancers. Previous studies indicate that downregulation of CXCL14 is associated with prognosis in gastric cancer patients , MT1X may aid in the prognostic discrimination of oral squamous cell carcinoma cases , and MARCO expression is associated with breast cancer survival and risk of recurrence .
Twenty genes that have been less studied in HCC (Table 1) were further evaluated to predict their potential roles in HCC progression. Coremine medical mining suggested that most of those genes were associated with diagnosis, prognosis, drug resistance, recurrence, metastasis, and invasiveness. In particular, 13, 14, and 15 genes were potentially associated with prognosis, metastasis, and diagnosis in HCC, respectively (Fig. 5). The association of these genes with prognosis appears to have clinical importance, as 11 genes were shown to be associated with DFS or/and OS (Table 2, Fig. 3 & 4). The role of these genes in metastasis was further confirmed by gene expression analysis, which showed that five genes were significantly dysregulated in HCC with venous metastasis, portal vein tumour thrombus metastasis, or intrahepatic spread metastasis, compared with the appropriate controls. Specifically, LCAT was highly expressed in HCC patients with venous metastasis and patients with portal vein tumour thrombus metastasis, and SRPX was lowly expressed in HCC patients with venous metastasis and patients with intrahepatic spread metastasis (Fig. 6), suggesting that these two genes might be closely related to HCC metastasis. There are few studies on LCAT and SRPX in cancer metastasis, with only one reported that SRPX is upregulated in gastric cancer cells after depletion of TWIST, which promoted the epithelial-mesenchymal transition that occurs during the initial steps of tumour metastasis .
INTS8 encodes a subunit of the integrator complex that is involved in the cleavage of small nuclear RNAs, and its association with cancer is poorly understood. Limited studies indicate that INTS8 contains mutations in peripheral T cell lymphoma compared with non-malignant samples from 12 patients , and a combination of INTS8 with SULF1, ATP6V1C1, and GPR172A can be used to discriminate gastric carcinomas from adjacent noncancerous tissues . In this study, we found that, potentially regulated by demethylation (Fig. 7), INTS8 was significantly and consistently upregulated at least 2.115-fold in HCC according to four independent microarrays (Fig. 1; Table 1) and that INTS8 mRNA was upregulated 2.06-fold on average in 11 tissues of HCC patients compared with corresponding paracancerous tissues, with a similar expression profile at the protein level (Fig. 2). Based on the clinical importance analysis of 157 HCC patients in a TCGA cohort, we found that high expression of INTS8 was associated with poor DFS and OS (Fig. 4, Table 2), and was an independent risk prognostic factor for OS (Table 3). Moreover, high expression of INTS8 was associated with metastatic tumours and late stage (Table 4), and with younger HCC patients (<65 years old) (Table 4). In addition, text mining indicated that INTS8 was closely related with metastasis, invasiveness, and diagnosis (Fig. 5). The above results strongly indicate that this gene is indeed upregulated in HCC, where it might play crucial roles in HCC cancer progression and development, and is a potential biomarker for diagnosis and, in particular, prognosis.
In summary, by means of data retrieved from six independent microarrays, RT-qPCR and western blotting detection in 11 pairs of tissues, clinical importance analyses in a cohort of 157 patients, and bioinformatics analyses including biological process annotation, protein interaction and text mining, we have identified a group of genes that are significantly dysregulated in HCC and might be associated with cancer progression, development, and, in particular, prognosis. These genes could be potential therapeutic targets for HCC treatment, and might be useful biomarkers for diagnosis and prognosis.
Yang JD, Roberts LR. Hepatocellular carcinoma: A global view. Nat Rev Gastroenterol Hepatol. 2010;7:448–58.
Miki D, Ochi H, Hayes CN, Aikata H, Chayama K. Hepatocellular carcinoma: towards personalized medicine. Cancer Sci. 2012;103:846–50.
Byam J, Renz J, Millis JM. Liver transplantation for hepatocellular carcinoma. Hepatobiliary Surg Nutri. 2013;2:22–30.
Singhal A, Jayaraman M, Dhanasekaran DN, Kohli V. Molecular and serum markers in hepatocellular carcinoma: predictive tools for prognosis and recurrence. Crit Rev Oncol Hematol. 2012;82:116–40.
Cancer Genome Atlas Research N, Weinstein JN, Collisson EA, Mills GB, Shaw KR, Ozenberger BA, et al. The Cancer Genome Atlas Pan-Cancer analysis project. Nat Genet. 2013;45:1113–20.
International Cancer Genome C, Hudson TJ, Anderson W, Artez A, Barker AD, Bell C, et al. International network of cancer genome projects. Nature. 2010;464:993–8.
Rustici G, Kolesnikov N, Brandizi M, Burdett T, Dylag M, Emam I, et al. ArrayExpress update--trends in database growth and links to data analysis tools. Nucleic Acids Res. 2013;41:D987–90.
Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, et al. NCBI GEO: archive for functional genomics data sets--update. Nucleic Acids Res. 2013;41:D991–5.
Rhodes DR, Yu J, Shanker K, Deshpande N, Varambally R, Ghosh D, et al. ONCOMINE: a cancer microarray database and integrated data-mining platform. Neoplasia. 2004;6:1–6.
Coordinators NR. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2015;43:D6–17.
Rung J, Brazma A. Reuse of public genome-wide gene expression data. Nat Rev Genet. 2013;14:89–99.
Kannan L, Ramos M, Re A, El-Hachem N, Safikhani Z, Gendoo DM, et al. Public data and open source tools for multi-assay genomic investigation of disease. Briefings in bioinformatics. 2016;17:603–15.
Liu X, Gao Y, Zhao B, Li X, Lu Y, Zhang J, et al. Discovery of microarray-identified genes associated with ovarian cancer progression. Int J Oncol. 2015;46:2467–78.
Liu X, Gao Y, Lu Y, Zhang J, Li L, Yin F. Downregulation of NEK11 is associated with drug resistance in ovarian cancer. Int J Oncol. 2014;45:1266–74.
Zeng X, Yin F, Liu X, Xu J, Xu Y, Huang J, et al. Upregulation of E2F transcription factor 3 is associated with poor prognosis in hepatocellular carcinoma. Oncol Rep. 2014;31:1139–46.
Gao J, Aksoy BA, Dogrusoz U, Dresdner G, Gross B, Sumer SO, et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal. 2013;6:11.
Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA, et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Can Dis. 2012;2:401–4.
Budhu A, Forgues M, Ye QH, Jia HL, He P, Zanetti KA, et al. Prediction of venous metastases, recurrence, and prognosis in hepatocellular carcinoma based on a unique immune response signature of the liver microenvironment. Cancer Cell. 2006;10:99–111.
Ye QH, Qin LX, Forgues M, He P, Kim JW, Peng AC, et al. Predicting hepatitis B virus-positive metastatic hepatocellular carcinomas using gene expression profiling and supervised machine learning. Nat Med. 2003;9:416–23.
da Huang W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44–57.
da Huang W, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009;37:1–13.
Mostafavi S, Ray D, Warde-Farley D, Grouios C, Morris Q. GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function. Genome Biol. 2008;9 Suppl 1:S4.
Zuberi K, Franz M, Rodriguez H, Montojo J, Lopes CT, Bader GD, et al. GeneMANIA prediction server 2013 update. Nucleic Acids Res. 2013;41:W115–22.
de Leeuw N, Dijkhuizen T, Hehir-Kwa JY, Carter NP, Feuk L, Firth HV, et al. Diagnostic interpretation of array data using public databases and internet sources. Hum Mutat. 2012;33:930–40.
Hedditch EL, Gao B, Russell AJ, Lu Y, Emmanuel C, Beesley J, et al. ABCA transporter gene expression and poor outcome in epithelial ovarian cancer. J Natl’ Cancer Institute. 2014; 106
D’Andrilli G, Kumar C, Scambia G, Giordano A. Cell cycle genes in ovarian cancer: steps toward earlier diagnosis and novel therapies. Clin Cancer Res. 2004;10:8132–41.
Lee N, Kwon JH, Kim YB, Kim SH, Park SJ, Xu W, et al. Vaccinia-related kinase 1 promotes hepatocellular carcinoma by controlling the levels of cell cycle regulators associated with G1/S transition. Oncotarget. 2015;6:30130–48.
Pan Z, Chen C, Long H, Lei C, Tang G, Li L, et al. Overexpression of GPC3 inhibits hepatocellular carcinoma cell proliferation and invasion through induction of apoptosis. Mol Med Rep. 2013;7:969–74.
Yu JP, Xu XG, Ma RJ, Qin SN, Wang CR, Wang XB, et al. Development of a Clinical Chemiluminescent Immunoassay for Serum GPC3 and Simultaneous Measurements Alone With AFP and CK19 in Diagnosis of Hepatocellular Carcinoma. J Clin Lab Anal. 2015;29:85–93.
Wang Y, Shen Z, Zhu Z, Han R, Huai M. Clinical values of AFP, GPC3 mRNA in peripheral blood for prediction of hepatocellular carcinoma recurrence following OLT: AFP, GPC3 mRNA for prediction of HCC. Hepat Mon. 2011;11:195–9.
Wong KF, Xu Z, Chen J, Lee NP, Luk JM. Circulating markers for prognosis of hepatocellular carcinoma. Expert Opinion Med Diagnos. 2013;7:319–29.
Hu C, Lin F, Zhu G, Xue X, Ding Y, Zhao Z, et al. Abnormal hypermethylation of promoter region downregulates chemokine CXC ligand 14 expression in gastric cancer. Int J Oncol. 2013;43:1487–94.
Brazao-Silva MT, Rodrigues MF, Eisenberg AL, Dias FL, de Castro LM, Nunes FD, Faria PR, Cardoso SV, Loyola AM, de Sousa SCOM. Metallothionein gene expression is altered in oral cancer and may predict metastasis and patient outcomes. Histopathology. 2015;67:358–67.
Bergamaschi A, Tagliabue E, Sorlie T, Naume B, Triulzi T, Orlandi R, Russnes HG, Nesland JM, Tammi R, Auvinen P, Kosma V-M, Ménard S, Børresen-Dale A-L. Extracellular matrix signature identifies breast cancer subgroups with different clinical outcome. J Pathol. 2008;214:357–67.
Feng MY, Wang K, Shi QT, Yu XW, Geng JS. Gene expression profiling in TWIST-depleted gastric cancer cells. Anat Rec. 2009;292:262–70.
Simpson HM, Khan RZ, Song C, Sharma D, Sadashivaiah K, Furusawa A, Liu X, Nagaraj S, Sengamalay N, Sadzewicz L, Luke J. Concurrent Mutations in ATM and Genes Associated with Common gamma Chain Signaling in Peripheral T Cell Lymphoma. PLoS One. 2015;10:e0141906.
Cheng L, Zhang Q, Yang S, Yang Y, Zhang W, Gao H, Deng X, Zhang Q. A 4-gene panel as a marker at chromosome 8q in Asian gastric cancer patients. Genomics. 2013;102:323–30.
Kim HE, Kim DG, Lee KJ, Son JG, Song MY, Park YM, et al. Frequent amplification of CENPF, GMNN and CDK13 genes in hepatocellular carcinomas. PLoS One. 2012;7:e43223.
Ieta K, Ojima E, Tanaka F, Nakamura Y, Haraguchi N, Mimori K, et al. Identification of overexpressed genes in hepatocellular carcinoma, with special reference to ubiquitin-conjugating enzyme E2C gene expression. Int J Cancer. 2007;121:33–8.
Fujii T, Nomoto S, Koshikawa K, Yatabe Y, Teshigawara O, Mori T, et al. Overexpression of pituitary tumor transforming gene 1 in HCC is associated with angiogenesis and poor prognosis. Hepatology. 2006;43:1267–75.
Su MC, Hsu HC, Liu YJ, Jeng YM. Overexpression of pituitary tumor-transforming gene-1 in hepatocellular carcinoma. Hepatogastroenterology. 2006;53:262–5.
Liang M, Chen X, Liu W, Li S, Li C, Jiang L, et al. Role of the pituitary tumor transforming gene 1 in the progression of hepatocellular carcinoma. Cancer Biol Ther. 2011;11:337–45.
Shibata R, Mori T, Du W, Chuma M, Gotoh M, Shimazu M, et al. Overexpression of cyclase-associated protein 2 in multistage hepatocarcinogenesis. Clin Cancer Res. 2006;12:5363–8.
Sakamoto M, Mori T, Masugi Y, Effendi K, Rie I, Du W. Candidate molecular markers for histological diagnosis of early hepatocellular carcinoma. Intervirology. 2008;51 Suppl 1:42–5.
Wong N, Yeo W, Wong WL, Wong NL, Chan KY, Mo FK, et al. TOP2A overexpression in hepatocellular carcinoma correlates with early age onset, shorter patients survival and chemoresistance. Int J Cancer. 2009;124:644–52.
Ho DW, Kai AK, Ng IO. TCGA whole-transcriptome sequencing data reveals significantly dysregulated genes and signaling pathways in hepatocellular carcinoma. Frontiers Med. 2015;9:322–30.
Lu DD, Chen YC, Zhang XR, Cao XR, Jiang HY, Yao L. The relationship between metallothionein-1F (MT1F) gene and hepatocellular carcinoma. Yale J Biol Med. 2003;76:55–62.
Tahara D, Nakanishi T, Akazawa S, Yamaguchi Y, Yamamoto H, Akashi M, et al. Lecithin-cholesterol acyltransferase and lipid transfer protein activities in liver disease. Metabolism. 1993;42:19–23.
Gui T, Dong X, Li R, Li Y, Wang Z. Identification of hepatocellular carcinoma-related genes with a machine learning and network analysis. J Computational Biol. 2015;22:63–71.
Lin ZY, Chuang WL. Genes responsible for the characteristics of primary cultured invasive phenotype hepatocellular carcinoma cells. Biomed Pharmacother. 2012;66:454–8.
Udali S, Guarini P, Ruzzenente A, Ferrarini A, Guglielmi A, Lotto V, et al. DNA methylation and gene expression profiles show novel regulatory pathways in hepatocellular carcinoma. Clin Epigen. 2015;7:43.
Zhou X, Zhu HQ, Lu J. Regulation of gene expression in HBV- and HCV-related hepatocellular carcinoma: integrated GWRS and GWGS analyses. Int J Clin Experimental Med. 2014;7:4038–50.
Hoang TV, Toan NL, le Song H, Ouf EA, Bock CT, Kremsner PG, et al. Ficolin-2 levels and FCN2 haplotypes influence hepatitis B infection outcome in Vietnamese patients. PLoS One. 2011;6:e28113.
Sun H, Chua MS, Yang D, Tsalenko A, Peter BJ, So S. Antibody Arrays Identify Potential Diagnostic Markers of Hepatocellular Carcinoma. Biomark Insights. 2008;3:1–18.
Gu X, Wang H, Wang A, Dou T, Qi P, Ji Q, et al. An intronic polymorphism rs2237062 in the CXCL14 gene influences HBV-related HCC progression in Chinese population. Mol Biol Rep. 2012;39:797–803.
Wang W, Huang P, Zhang L, Wei J, Xie Q, Sun Q, et al. Antitumor efficacy of C-X-C motif chemokine ligand 14 in hepatocellular carcinoma in vitro and in vivo. Cancer Sci. 2013;104:1523–31.
Kanda M, Nomoto S, Okamura Y, Nishikawa Y, Sugimoto H, Kanazumi N, et al. Detection of metallothionein 1G as a methylated tumor suppressor gene in human hepatocellular carcinoma using a novel method of double combination array analysis. Int J Oncol. 2009;35:477–83.
Ji XF, Fan YC, Gao S, Yang Y, Zhang JJ, Wang K. MT1M and MT1G promoter methylation as biomarkers for hepatocellular carcinoma. World JGastroenterol. 2014;20:4723–9.
Lu D, Han C, Wu T. Microsomal prostaglandin E synthase-1 promotes hepatocarcinogenesis through activation of a novel EGR1/beta-catenin signaling axis. Oncogene. 2012;31:842–57.
Agundez JA, Olivera M, Ladero JM, Rodriguez-Lescure A, Ledesma MC, Diaz-Rubio M, et al. Increased risk for hepatocellular carcinoma in NAT2-slow acetylators and CYP2D6-rapid metabolizers. Pharmacogenetics. 1996;6:501–12.
Farker K, Schotte U, Scheele J, Hoffmann A. Impact of N-acetyltransferase polymorphism (NAT2) in hepatocellular carcinoma (HCC)--an investigation in a department of surgical medicine. Exp Toxicol Pathol. 2003;54:387–91.
The present study was supported by the National Natural Science Foundation of China (grant nos. 81360448), the Natural Science Foundation of Guangxi (nos. 2014GXNSFAA118139), Key Laboratory of High-Incidence-Tumor Prevention and Treatment (Guangxi Medical University), Ministry of Education (nos. GK2015-ZZ03 and GK2014-ZZ03) and Guangxi Outstanding Teachers Training Project for Colleges.
Availability of data and materials
FY and XL performed most analysis. FY wrote the manuscript. LS provided the clinical samples. TL performed the RT-qPCR and western blotting. TP helped collect samples and revise manuscript. YN and SL performed mRNA and protein isolation. XZ and XQ designed the study and helped draft manuscript. All authors reviewed the manuscript. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
The study was endorsed by the Ethics Committee of Guangxi Medical University and was performed according to the Declaration of Helsinki, 2013 edition. All patients received an explanation of the aims of the study and signed informed consent. We are free to use ovarian cancer data in TCGA by meeting its freedom-to-publish criteria: A marker paper has been published on that tumour type.
Fuqiang Yin, Lipei Shu and Xia Liu are co-first authors.
Biological process and cellular component annotation of the 80 genes associated with HCC development and progression by DAVID online tool. (PDF 167 kb)
Protein/gene-protein/gene interaction network of the 27 genes that were stably and consistently dysregulated in 386 cases of hepatocellular carcinoma compared with 327 cases of normal liver tissue according to the four independent microarrays retrieved from the Oncomine database. (PDF 306 kb)
About this article
Cite this article
Yin, F., Shu, L., Liu, X. et al. Microarray-based identification of genes associated with cancer progression and prognosis in hepatocellular carcinoma. J Exp Clin Cancer Res 35, 127 (2016). https://doi.org/10.1186/s13046-016-0403-2
- Hepatocellular carcinoma