Skip to main content

Constructing the HBV-human protein interaction network to understand the relationship between HBV and hepatocellular carcinoma



Epidemiological studies have clearly validated the association between hepatitis B virus (HBV) infection and hepatocellular carcinoma (HCC). Patients with chronic HBV infection are at increased risk of HCC, in particular those with active liver disease and cirrhosis.


We catalogued all published interactions between HBV and human proteins, identifying 250 descriptions of HBV and human protein interactions and 146 unique human proteins that interact with HBV proteins by text mining.


Integration of this data set into a reconstructed human interactome showed that cellular proteins interacting with HBV are made up of core proteins that are interconnected with many pathways. A global analysis based on functional annotation highlighted the enrichment of cellular pathways targeted by HBV.


By connecting the cellular proteins targeted by HBV, we have constructed a central network of proteins associated with hepatocellular carcinoma, which might be to regard as the basis of a detailed map for tracking new cellular interactions, and guiding future investigations.


Hepatitis B virus is one of the most common infectious diseases in the world, and 43 years after its discovery, it still has a great impact on health, particularly in developing countries. More than 350 million people worldwide are known to be chronic carriers of HBV, and each year 15 million people die of hepatitis [1].

The HBV viral genome is a relaxed-circular, partially duplex DNA of 3,200 base pairs. It has five genes encoding polymerase, pre-S1/pre-S2/S, X protein, precore/core protein, and the ID2828293 gene which is not well understood without an official gene symbol or description[2]. These proteins can also trans-activate other cellular genes, which may play a role in hepatocarcinogenesis [3].

Hepatocellular carcinoma is one of the most common fatal cancers worldwide [4]. HBV is strongly associated with HCC by its presence in the tumor cell and by the striking role of persistent HBV infection as a risk factor for the development of HCC[2]. The incidence of HCC in many countries is increasing in parallel to an increase in chronic HBV infection[1]. It is generally shown that vaccination significantly decreases the incidence of HCC. Moreover, preventing the most severe HBV disease consequences in infected people, such as cirrhosis and fibrosis, will require appropriate therapeutic agents and reduces the risk of developing HCC [5].

To make progress in understanding the mechanisms of viral pathogenesis and the relationship of HCC with HBV, it is important to sort out the interactions of HBV proteins with the vast array of human cellular proteins. While these interactions can be direct viral and host cell protein-protein interactions, many are indirect, including regulatory interactions that alter human gene expression[2]. Rapidly growing knowledge about the protein-protein interaction (PPI) networks (interactome) for hosts and pathogens is beginning to be used to create network-based models [6]. A network analysis approach to a virus-human protein interactome network revealed that host interactors tend to be enriched in proteins that are highly connected in the cellular network [7]. These "hub proteins" are thought to be essential for normal cell functioning and during pathogenesis. Therefore, clarification of the genetic picture of hepatocarcinogenesis caused by HBV infection might provide clues toward achieving a decrease in the incidence of HCC and establishing effective treatments[8].

In this study, we attempted to catalogue all published interactions between HBV and human proteins, particularly human proteins associated with hepatocellular carcinomas, for an in-depth review and understanding of these interactions. Our aim was to enhance insight into HBV replication and pathogenesis on a cellular level, in order to assist in accelerating the development of effective therapeutics.


Text mining of human proteins that interact with HBV and are associated with HCC

To facilitate the development of a database describing HBV and human protein interactions, a detailed literature search was carried out on the PubMed database to analyze binary interactions between HBV and human proteins. We used the automatic text mining pipeline method of NLP (Natural Language Processing), followed by an expert curation process, independent of the results obtained at this step. The data compilation process included publications until January 2009.

In brief, we first searched the document using relevant keywords and transformed it into XML format. We then used the Lingpipe Kit sentence tokenization tool (sentence partition) to separate the abstract text into a single sentence. Follow-up analysis used the sentence as a basic unit. The human genes mentioned in the sentences were extracted using ABNER software [9], and the gene name was normalized based on the Entrez database in order to facilitate analysis and comparison. For example, an extracted conjunction gene description such as "STAT3/5 gene" would be resolved into STAT3 gene and STAT5 gene. We built a protein-protein interaction verb dictionary [10], including terms such as repress, regulate, inhibit, interact, phosphorylate, down-regulate and up-regulate. All of the verbs and their variants were derived from the BioNLP project Using the Lingpipe Toolkit, we then detected protein interaction verbs in sentences and gathered the HBV protein and synonym names (compiled from the Entrez database). Statistical analysis of sentences in which a human protein, an interaction verb and an HBV protein were all present at the same time (co-occurrence) was used to determine HBV protein and HHBV interaction relationships.

Using the same NLP methods, we extracted literature related to hepatocellular carcinoma from PubMed and identified the interactions and relationships between HBV proteins and HHCC.

The integrated human interactome network (H-H network)

In order to make the HBV protein and human protein HHBV interaction network more complete, we integrated the HHBV and HHBV interaction relationships. The HHBV and HHBV protein interaction data were gathered from the STRING database, which includes experimental evidence of protein interactions (e.g., yeast two-hybrid), protein interaction databases (e.g., the KEGG pathway) and text mining co-occurrence. The algorithm for human protein to human protein interaction relationships was previously described [11]. NCBI official gene names were used to combine protein ACC, protein ID, gene name, symbol or alias from different genome reference databases (e.g., ENSEMBL, UNIPROT, NCBI, INTACT, HPRD, etc.) and to eliminate interaction redundancy due to the existence of different protein isoforms for a single gene. Thus, the gene name was used in the text to identify the protein. Finally, we only used non-redundant protein-protein interactions to build the human interactome data set.

The network structure of the HBV protein to human protein interaction relationships and the human protein to human protein interaction relationships was mapped using Medusa software.

Gene ontology analysis

To demonstrate the complexity of the HBV-human protein interaction network, the catalogued data were analyzed using gene ontology [12]. Gene ontology is a set of three structured controlled ontologies that describe gene products in terms of their associated cellular component (CC), biological process (BP), or molecular function (MF) in a species-independent manner. We performed gene ontology analysis using EASE software. Enrichment p-values were adjusted by the Benjamini and Hochberg multiple test correction [13].

Functional analysis using KEGG annotations

Cellular pathway data were retrieved from KEGG, the Kyoto Encyclopedia of Genes and Genomes, and were used to annotate NCBI gene functions [14]. For each viral-host protein interaction, the enrichment of a specific KEGG pathway was tested using a Fisher's exact test followed by the Benjamini and Hochberg multiple test correction to control for the false discovery rate [15].

Network visualization

HBV protein to human protein interaction relationships and human protein to human protein interaction relationships were mapped and visualized in a network structure using Medusa software [16].


Construction of an HBV-human interactome network

In order to analyze the interactions between HBV and human proteins, literature indexed in PubMed was searched using keywords [e.g., ("HBV" [title] OR "hepatitis B" [title] AND ("1980/01/01" [PDAT]: "2009/01/01" [PDAT]))]. From the entire database, 52,531 published journal abstracts were identified by NLP (Natural Language Processing) queries. Further text analysis revealed a total of 146 HBV-targeted human protein (HHBV) from 250 summary descriptions that reported putative interactions between HBV and human proteins, comprising 150 unique HBV to human protein interactions. Figure 1A summarizes the HBV protein interactions catalogued from these papers (see Additional file 1, Table S1 for a listing of all interactions).

Figure 1
figure 1

HBV and human protein interaction network. (A) Summary of the HBV-human protein (HHBV) interactions. (B) HBV and HHBV interaction network. Red square: HBV protein. Circular node: HHBV. For HBV-HHBV interactions, green lines correspond to activate; blue lines, to inhibit; and red lines, to interact (activate or inhibit unknown), all interaction keywords can be found in Additional file 1, Table S2. For HHBV-HHBV interactions, purple indicates evidence from experiments (High-throughput yeast two-hybrid experiment data was collected from public data sources); light blue, from database (Protein - protein interaction relationship was extracted from KEGG pathway database); and grass green, from literature text mining (Scattered literatures about low throughput research on protein - protein interaction were parsed with an in-house computer program), which derived from the Additional file 1, Table S4.

Based on the text in the original journal articles selected by keywords and combining similar keywords, we identified the most important functional keyword used by the authors to describe the interaction. Twenty-five unique keywords were associated with these descriptions. The most frequently used keywords in the database were "interact," 25.77%; "activate," 13.08%; "inhibit," 8.46%; "associate," 9.23%; "regulate," 8.46%, including "upregulate," 3.36%, and "downregulate," 1.54%; and "phosphorylate," 7.31% (Figure 1B, and see Additional file 1, Table S2 for a listing of all keywords). While it could not be excluded that some of these interactions are nonspecific or human errors, the catalogued interactions provide a unique collection of data collectively generated from the available scientific literature.

Analysis of the HBV-infection network showed that X protein and core protein were the most connected proteins (Figure 1A), with 122 (83.5%) and 15 (10.3%) of the total HHBV identified in the database, including many transcription factors and regulators. This highlights the potential multi-functionality of these proteins during infection (Figure 1B, Additional file 1, Table S1). Highly interacting proteins are known to be significantly more disordered than low-degree (LD) proteins [17]. Interestingly, X protein and core protein are predicted to contain one intrinsic disordered region (data not shown) according to DISOPRED2 [18]. In addition, five HHBV such as MAPK8, E2F1, and NF-kappa 1were targeted by more than one HBV protein, suggesting these proteins might play important roles in HBV replication and pathogenesis (Additional file 1, Table S3).

A human PPI network has been reconstructed from eight databases [19]. This network is composed of 44,223 non-redundant PPIs among 9,520 different proteins, corresponding to 30% of the human proteome (the remaining proteins have no known cellular partners and, therefore, cannot be included in this network). Interestingly, HHBV are clearly over-represented in this H-H network (134 (92%) of the total HHBV). Analysis of the HHBV-HHBV sub-network (all connected 146 HHBV proteins), which is composed of 1,977 non-redundant PPIs among different HHBV and more interconnected than the H-H network, indicated that HBV proteins have a strong tendency to interact with highly connected cellular proteins (Figure 1B, Additional file 1, Table S4). This also suggests that HBV preferentially targets host proteins already known to be engaged in protein-protein interactions.

Analysis of the relationship between hepatocellular carcinoma and HHBV

In order to provide a global view of human proteins involved in HCC associated with the HBV - with the aim of clarifying the relationship between HBV proteins and hepatocellular carcinoma-associated proteins (HHCC) - we also made use of NLP methods to extract literature related to HCC from PubMed.

Using the keyword search [e.g., (liver cancer "[title] OR" hepatocellular carcinoma "[title] OR" Liver Neoplasm "[title] OR" Liver Neoplasms "[title] AND (" 1980/01/01 "[PDAT] :" 2009/01/01 "[PDAT]))], we retrieved 19,050 related articles. Based on a combination of text mining procedures and expert curation, a total of 666 HHCC (number of PMID greater than or equal to 2) were identified from 6,709 summary descriptions (Additional file 1, Table S5). Among these, nine of HHCC had more than 100 PMID references (Figure 2A).

Figure 2
figure 2

Analysis of the relationship between H HCC and H HBV . (A) Gene list of top nine HHCC. (B) Overlap between HHCC and HHBV. The blue area corresponds to HHBV; the yellow area, to HHCC: and the green area, to HHBV-HHCC. (C) Gene Ontology analysis of HHBV and HHBV-HHCC.

Compared with HHBV, 76 proteins (HHBV-HHCC) among the HHBV (146) were also hepatocellular carcinoma-associated proteins (Figure 2B, Additional file 1, Table S6). Four HHBV-HHCC's had more than 100 PMID references (Figure 2A).

Gene ontology and KEGG pathway analysis

The 146 HHBV could be classified into 18 mutually dependent functional sets, resulting in 17 cellular processes in 12 cellular components according to the gene ontology analysis. Accordingly, the 76 HHBV-HHCC could be classified into 14 functional sets, resulting in 16 cellular processes in eight cellular components (Additional file 1, Table S7).

As shown in Figure 2C, most of the functional profiling showed transcriptional activity, DNA binding, kinase activity, signal transducer activity, cytokine activity and growth factor activity. These functions are very closely associated with developmental processes, cell differentiation, signal transduction, cell communication, apoptosis, regulation of gene expression, cell proliferation and cell development. These biological processes are thought to play important roles in the pathogenesis of HCC [2].

To better understand the biological functions of HHBV-HHCC, we determined the enrichment of specific pathways for all interactors. Of the 76 proteins (HHBV-HHCC), 63 (~83%) could be mapped to 9 pathways (P < 0.01) of 202 KEGG human pathway database (Additional file 1, Table S8). 6 pathways, namely apoptosis, cell cycle, p53 signaling pathway, toll-like receptor signaling pathway, MAPK signaling pathway and ErbB signaling pathway were significantly enriched (P < 0.0001).

Functional analysis of the HBV-human interaction network

Dysregulation of the balance of survival or apoptosis represents a protumorigenic principle in human hepatocarcinogenesis [20]. To provide a review of the current findings about how the balance is dysregulated by HBV in HCC, we integrated 57 HHBV-HHCC into one molecular interaction network. As shown in Figure 3, these HHBV-HHCC can constitute several signal pathways, such as JAK/STAT, MEK/ERK, PI3K/AKT, NFκB, MAPK, SAPK/JNK, and p53 signal pathways, and mediate many opposing cellular functions, including function in cell cycle and apoptosis regulation [21].

Figure 3
figure 3

Functional analysis of the HBV-human interaction network. In black, HHBV-HHCC either down-regulated or inactivated; in red, HHBV-HHCC either up-regulated or overactivated; with underline, HHBV-HHCC interact (activate or inhibit unknown); in box, non-HHBV-HHCC molecules in pathways. See text for details.

The expression of cytokines like IL2, IL6, TNF and receptors like insulin-like growth factor 1 receptor (IGF1R) are up-regulated, which can activate kinases like the Src tyrosine kinases and the downstream pathway such as MAPK, MEK/ERK. HBx activates the components of the JAK/STAT, MEK/ERK, PI3K/AKT, MAPK, SAPK/JNK signalling pathways, leading to activation of a variety of transcription factors such as STAT-3, ELK-1, NF-κB, CREB, β-catenin, c-Fos, c-Jun, c-Myc, etc. Meanwhile, some physiological proapoptotic molecules are down-regulated or inactivated, such as Fas, p53, DR5 or FADD. HBx can bind to the C-terminus of p53 sequesters in the cytoplasm and prevent it from entering the nucleus [2], failure to up-regulate genes, such as IGFBP3, p21WAF1, Bax or Fas, thereby inactivating several critical p53 dependent activities, including p53 mediated apoptosis. Moreover, the down-regulation of PTEN and the activation of PI3K/AKT-Bad pathway can inhibit TGFβ and FasL induced apoptosis and down-regulation of caspase 3 activity. However, HBx also promotes the apoptosis by regulating the expressions of Fas/FasL, Bax/Bcl-2, and c-Myc gene.

Cell cycle is regulated by activity of cyclins, cyclin dependent kinases (CDKs), and cyclin dependent kinase inhibitors (CDKIs), which generally function within several pathways [22]. HBx can repress the transcription of p21WAF1 and p16INK4A, leading to increase the rate and level of activation of the CDK2 and CDK4. HBx also inhibit the pRb tumor suppressor and increase E2F1 activity, and regulate the expression of MDM2, cyclin D1 and cyclin B1. Ultimately, HBx has been shown to stimulate cell cycle progression by accelerating transit through the G1/S and G2/M checkpoints [2].

In brief, regardless of the mechanism, the aberrant gene expression and deregulated of these pathways ultimately leads to generate a unique response, the acceleration of cell cycle progression and cell growth, increased differentiation and proliferation, repression of apoptosis, and contribute to cell survival and oncogenesis.


Developing an HBV-human interactome network by mapping the interactions of viral proteins with host proteins may give us a comprehensive view of viral infection at the protein level, and provide clues to understanding the development of end-stage complications such as cirrhosis and HCC. In this study, we used an NLP method to analyze the PubMed literature database for articles regarding HBV and human protein interactions. With an exhaustive analysis of the literature and databases, we identified 146 HHBV that are crucial for hepatitis B virus infections. These HHBV are involved in numerous functions associated with oncogenesis, and through screening and mapping the HHCC, we found that about half of the HHBV were also hepatocellular carcinoma-associated proteins such as IL6, STAT3[23], MMP9, TGFB1 [24] and TP53 [25]. This may explain why hepatitis B virus is the primary risk factor for the development of HCC. The Gene ontology analysis show that most of the functional profiling (such as transcriptional activity, DNA binding, kinase activity and signal transducer activity) and biological processes (such as cell differentiation, apoptosis, cell proliferation and cell development) are thought to play important roles in the pathogenesis of HCC.

KEGG functional annotation was used to analyze the biological functions of HHBV-HHCC. 83% of HHBV-HHCC could be mapped to 9 pathways (P < 0.01) (Additional file 1, Table S8), apoptosis, cell cycle, p53 signaling pathway, toll-like receptor signaling pathway, MAPK signaling pathway and ErbB signaling pathway were significantly enriched (P < 0.0001). Although this approach is biased because functions have not yet been attributed to all proteins, it remains a powerful way of incorporating conventional biology into systems-level data sets[26].

Toll-like receptors (TLRs) are known to play a key role in the innate immune system, particularly in the inflammatory response against invading pathogens [27]. In PBMCs of HBV-infected patients, TLR7 expression and TLR9 mRNA are down-regulated, but TLR9 shows increased protein expression [28], which may play important roles in cancer cells[29]. The ErbB receptor tyrosine kinases play important roles in both normal physiology and cancer. ErbB2 (HER-2/neu) has been identified as an important regulator of the metastatic potential of breast cancer, which is the principal cause of death [30]. The detailed relationship between HBV with ErbB receptor and toll-like receptors pathways has not been investigated. Further studies of the functional changes in these pathways in response to HBV infection will provide clear information about the oncogenesis of hepatocellular carcinoma.

We also identified focal adhesion (p < 0.001) might be as a novel pathway affected by HBV through the KEGG pathway analysis (Additional file 1, Table S8). When focal adhesion is deregulated, it can lead to perturbation of cell mobility, detachment from the ECM and tumor initiation and progression [31]. HBx can increase the migratory phenotype of hepatoma cells through the up-regulation of matrix metalloproteinases-1 (MMP1) and MMP9[32]. Moreover, HBx represses several cell adhesion molecules and cytoskeleton proteins, including E-cadherin, integrin, fibronectin, CD47, and CD44 [2]. Regulation of focal adhesion was also identified as a new function that is affected by HCV, primarily through the NS3 and NS5A proteins [26]. However, the impact of HBV protein on focal adhesion should be further assessed using a cellular adhesion assay.

Moreover, a large number of HHBV-HHCC could be significantly enriched in apoptosis, cell cycle, p53 and MAPK signaling pathway (P < 0.0001), which are very crucial in the oncogenesis of HCC [20]. Therefore, we integrated these HHBV-HHCC into one molecular interaction map, which delineate many different oncogenic pathways involved in hepatocarcinogenesis. These proteins are at the center of many different pathways (such as JAK/STAT, MEK/ERK, PI3K/AKT, NFκB, MAPK, SAPK/JNK, and p53 signal pathways) that regulate many important biological processes, including cell differentiation, apoptosis, cell proliferation, cell cycle, etc. HBx can modulate both pre-apoptotic and anti-apoptotic pathways, some physiological pro-apoptotic HHBV-HHCC molecules are down-regulated or inactivated, even more anti-apoptotic signals HHBV-HHCC molecule are up-regulated or over-activation [2]. Therefore, a significant number of the molecular events are altered, leading to the disruption of the balance between death and survival in the preneoplastic hepatocytes and the uncontrolled growth of tumour cell [20, 21].

Accordingly, hepatocellular carcinoma show stronger requirements of these intracellular pathways to survive, therefore, therapeutic strategies to selectively inhibit anti-apoptotic signals in HCC cells might have the potential to provide effective tools to treat HCC in the future [4, 20]. Interestingly, recently studies show that the multikinase inhibitor drug sorafenib can induce HCC apoptosis through inhibiting the RAF/MEK/ERK pathway [33]. Another receptor tyrosine kinase inhibitor drug, sunitinib is also a strong apoptosis inducer in different tumor cells, especially in the presence of inhibitors of the PI3K/Akt/mTOR pathway [34]. Similar situations might be found with other multikinase that are on the way towards approval for HCC therapy [34].

Therefore, the data of HHBV and the most specific annotations for each human protein can be used as a resource for researchers interested in prioritizing drug targets (Additional file 1, Table S1). For example, the damage-specific DNA binding protein 1 (DDB1) had 14 identified interactions with HBV X protein (Additional file 1, Table S1), which is a highly conserved protein implicated in DNA repair and cell cycle regulation [35]. HBx in association with DDB1 may stimulate HBV replication and induce genetic instability in hepatocytes, thereby contributing to HCC development, and making this HBV-host protein interaction as an attractive target for new therapeutic interventions [36].

In addition, it must be point out that not all of the papers that report HBV binding proteins from cell lines validate the binding of these host proteins to the corresponding HBV antigen by co-immunoprecipitation of extracts from clinical samples (infected liver and HCC tissue). At the same time, it raises a number of questions need further studies such as whether all the identified interactions really occur and have functional consequences. To identify new molecules involved in hepatocarcinogenesis, we can establish of high-throughput yeast two-hybrid (Y2H) screens and co-affinity purification methods for large scale analysis of protein-protein interaction[26], and integrate of chip-based chromatin-immunoprecipitation (ChIP-chip) with expression-microarray profiling for the identification of candidate genes directly regulated by HBV[37].

Finally, a number of HHBV-HHCC and cellular processes have been studied, but many of the molecular events involved in the pathophysiology of HCC are still unclear. One single identified HHBV-HHCC may be involved in some new multiple, independently regulated HCC-specific pathways. Hence, the HBV-human protein interaction network might be to regard as the basis of a detailed map for tracking new cellular interactions, and guiding future investigations of the molecular mechanism of oncogenesis of HBV-related HCC, even other diseases such as steatosis and fibrosis, leading to identify a series of new genes involved in these diseases.

In mammals, lethal and disease-related proteins were found enriched among some proteins that are central to multiple pathways [38, 39], and preferential attachment to these proteins may be a general hallmark of viral proteins, as has recently been suggested in an analysis of the literature [40]. An important breakthrough of the further experimental study is the identification of novel signaling components and pathways that can be targeted to develop new therapeutics.


Among the infectious diseases affecting humans, HBV is one of the most common diseases in the world, particularly in developing countries. More than 350 million people worldwide are known to be chronic carriers of HBV, and each year 15 million people die of hepatitis. HCC is one of the most common fatal cancers worldwide, and the incidence of HCC in many countries is increasing in parallel to an increase in chronic HBV infection. Because the role of HBV infection and the pathogenic mechanisms of the cancer-causing variant are not entirely clear, there is still a lack of effective treatment of HCC. For an in-depth review and understanding of these interactions, to enhance insight into HBV replication and pathogenesis on a cellular level, we catalogued all published interactions between HBV and human proteins, particularly human proteins associated with HCC. We have provided a general overview of the landscape of human proteins that interact with HBV.


  1. Kao JH, Chen DS: Global control of hepatitis B virus infection. Lancet Infect Dis. 2002, 2: 395-403. 10.1016/S1473-3099(02)00315-8.

    Article  Google Scholar 

  2. Park NH, Song IH, Chung YH: Chronic hepatitis B in hepatocarcinogenesis. Postgrad Med J. 2006, 82: 507-515. 10.1136/pgmj.2006.047431.

    Article  CAS  Google Scholar 

  3. Huang TJ, Lu CC, Tsai JC, Yao WJ, Lu X, Lai MD, Liu HS, Shiau AL: Novel autoregulatory function of hepatitis B virus M protein on surface gene expression. J Biol Chem. 2005, 280: 27742-27754. 10.1074/jbc.M502209200.

    Article  CAS  Google Scholar 

  4. Roberts LR, Gores GJ: Hepatocellular carcinoma: molecular pathways and new therapeutic targets. Semin Liver Dis. 2005, 25: 212-225. 10.1055/s-2005-871200.

    Article  CAS  Google Scholar 

  5. Barone M, Spano D, D'Apolito M, Centra M, Lasalandra C, Capasso M, Di Leo A, Volinia S, Arcelli D, Rosso N, et al: Gene expression analysis in HBV transgenic mouse liver: a model to study early events related to hepatocarcinogenesis. Mol Med. 2006, 12: 115-123. 10.2119/2006-00015.Barone.

    Article  CAS  Google Scholar 

  6. Tew KL, Li XL, Tan SH: Functional centrality: detecting lethality of proteins in protein interaction networks. Genome Inform. 2007, 19: 166-177. full_text.

    CAS  Google Scholar 

  7. Calderwood MA, Venkatesan K, Xing L, Chase MR, Vazquez A, Holthaus AM, Ewence AE, Li N, Hirozane-Kishikawa T, Hill DE, et al: Epstein-Barr virus and virus human protein interaction maps. Proc Natl Acad Sci USA. 2007, 104: 7606-7611. 10.1073/pnas.0702332104.

    Article  CAS  Google Scholar 

  8. Wang N, Zheng Y, Yu X, Lin W, Chen Y, Jiang Q: Sex-modified effect of hepatitis B virus infection on mortality from primary liver cancer. Am J Epidemiol. 2009, 169: 990-995. 10.1093/aje/kwn418.

    Article  Google Scholar 

  9. Settles B: ABNER: an open source tool for automatically tagging genes, proteins and other entity names in text. Bioinformatics. 2005, 21: 3191-3192. 10.1093/bioinformatics/bti475.

    Article  CAS  Google Scholar 

  10. Rebholz-Schuhmann D, Arregui M, Gaudan S, Kirsch H, Jimeno A: Text processing through Web services: calling Whatizit. Bioinformatics. 2008, 24: 296-298. 10.1093/bioinformatics/btm557.

    Article  CAS  Google Scholar 

  11. von Mering C, Jensen LJ, Snel B, Hooper SD, Krupp M, Foglierini M, Jouffre N, Huynen MA, Bork P: STRING: known and predicted protein-protein associations, integrated and transferred across organisms. Nucleic Acids Res. 2005, 33: D433-437. 10.1093/nar/gki005.

    Article  CAS  Google Scholar 

  12. Ashburner M, Lewis S: On ontologies for biologists: the Gene Ontology--untangling the web. Novartis Found Symp. 2002, 247: 66-80. full_text. discussion 80-63, 84-90, 244-252

    Article  CAS  Google Scholar 

  13. Hosack DA, Dennis G, Sherman BT, Lane HC, Lempicki RA: Identifying biological themes within lists of genes with EASE. Genome Biol. 2003, 4: R70-10.1186/gb-2003-4-10-r70.

    Article  Google Scholar 

  14. Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, Katayama T, Kawashima S, Okuda S, Tokimatsu T, Yamanishi Y: KEGG for linking genomes to life and the environment. Nucleic Acids Res. 2008, 36: D480-484. 10.1093/nar/gkm882.

    Article  CAS  Google Scholar 

  15. Benjamini Y, Drai D, Elmer G, Kafkafi N, Golani I: Controlling the false discovery rate in behavior genetics research. Behav Brain Res. 2001, 125: 279-284. 10.1016/S0166-4328(01)00297-2.

    Article  CAS  Google Scholar 

  16. Hooper SD, Bork P: Medusa: a simple tool for interaction graph analysis. Bioinformatics. 2005, 21: 4432-4433. 10.1093/bioinformatics/bti696.

    Article  CAS  Google Scholar 

  17. Haynes C, Oldfield CJ, Ji F, Klitgord N, Cusick ME, Radivojac P, Uversky VN, Vidal M, Iakoucheva LM: Intrinsic disorder is a common feature of hub proteins from four eukaryotic interactomes. PLoS Comput Biol. 2006, 2: e100-10.1371/journal.pcbi.0020100.

    Article  Google Scholar 

  18. Ward JJ, McGuffin LJ, Bryson K, Buxton BF, Jones DT: The DISOPRED server for the prediction of protein disorder. Bioinformatics. 2004, 20: 2138-2139. 10.1093/bioinformatics/bth195.

    Article  CAS  Google Scholar 

  19. Gandhi TK, Zhong J, Mathivanan S, Karthick L, Chandrika KN, Mohan SS, Sharma S, Pinkert S, Nagaraju S, Periaswamy B, et al: Analysis of the human protein interactome and comparison with yeast, worm and fly interaction datasets. Nat Genet. 2006, 38: 285-293. 10.1038/ng1747.

    Article  CAS  Google Scholar 

  20. Fabregat I, Roncero C, Fernandez M: Survival and apoptosis: a dysregulated balance in liver cancer. Liver Int. 2007, 27: 155-162. 10.1111/j.1478-3231.2006.01409.x.

    Article  CAS  Google Scholar 

  21. Fabregat I: Dysregulation of apoptosis in hepatocellular carcinoma cells. World J Gastroenterol. 2009, 15: 513-520. 10.3748/wjg.15.513.

    Article  CAS  Google Scholar 

  22. Semczuk A, Jakowicki JA: Alterations of pRb1-cyclin D1-cdk4/6-p16(INK4A) pathway in endometrial carcinogenesis. Cancer Lett. 2004, 203: 1-12. 10.1016/j.canlet.2003.09.012.

    Article  CAS  Google Scholar 

  23. Lin L, Amin R, Gallicano GI, Glasgow E, Jogunoori W, Jessup JM, Zasloff M, Marshall JL, Shetty K, Johnson L, et al: The STAT3 inhibitor NSC 74859 is effective in hepatocellular cancers with disrupted TGF-beta signaling. Oncogene. 2009, 28: 961-972. 10.1038/onc.2008.448.

    Article  CAS  Google Scholar 

  24. Matsuzaki K: Modulation of TGF-beta signaling during progression of chronic liver diseases. Front Biosci. 2009, 14: 2923-2934. 10.2741/3423.

    Article  CAS  Google Scholar 

  25. Hussain SP, Schwank J, Staib F, Wang XW, Harris CC: TP53 mutations and hepatocellular carcinoma: insights into the etiology and pathogenesis of liver cancer. Oncogene. 2007, 26: 2166-2176. 10.1038/sj.onc.1210279.

    Article  CAS  Google Scholar 

  26. de Chassey B, Navratil V, Tafforeau L, Hiet MS, Aublin-Gex A, Agaugue S, Meiffren G, Pradezynski F, Faria BF, Chantier T, et al: Hepatitis C virus infection protein network. Mol Syst Biol. 2008, 4: 230-10.1038/msb.2008.66.

    Article  CAS  Google Scholar 

  27. Kim S, Takahashi H, Lin WW, Descargues P, Grivennikov S, Kim Y, Luo JL, Karin M: Carcinoma-produced factors activate myeloid cells through TLR2 to stimulate metastasis. Nature. 2009, 457: 102-106. 10.1038/nature07623.

    Article  CAS  Google Scholar 

  28. Xu N, Yao HP, Sun Z, Chen Z: Toll-like receptor 7 and 9 expression in peripheral blood mononuclear cells from patients with chronic hepatitis B and related hepatocellular carcinoma. Acta Pharmacol Sin. 2008, 29: 239-244. 10.1111/j.1745-7254.2008.00711.x.

    Article  CAS  Google Scholar 

  29. Paone A, Starace D, Galli R, Padula F, De Cesaris P, Filippini A, Ziparo E, Riccioli A: Toll-like receptor 3 triggers apoptosis of human prostate cancer cells through a PKC-alpha-dependent mechanism. Carcinogenesis. 2008, 29: 1334-1342. 10.1093/carcin/bgn149.

    Article  CAS  Google Scholar 

  30. Arora P, Cuevas BD, Russo A, Johnson GL, Trejo J: Persistent transactivation of EGFR and ErbB2/HER2 by protease-activated receptor-1 promotes breast carcinoma cell invasion. Oncogene. 2008, 27: 4434-4445. 10.1038/onc.2008.84.

    Article  CAS  Google Scholar 

  31. Xu Y, Benlimame N, Su J, He Q, Alaoui-Jamali MA: Regulation of focal adhesion turnover by ErbB signalling in invasive breast cancer cells. Br J Cancer. 2009, 100: 633-643. 10.1038/sj.bjc.6604901.

    Article  CAS  Google Scholar 

  32. Zou L, Yang R, Chai J, Pei G: Rapid xenograft tumor progression in beta-arrestin1 transgenic mice due to enhanced tumor angiogenesis. FASEB J. 2008, 22: 355-364. 10.1096/fj.07-9046com.

    Article  CAS  Google Scholar 

  33. Liu L, Cao Y, Chen C, Zhang X, McNabola A, Wilkie D, Wilhelm S, Lynch M, Carter C: Sorafenib blocks the RAF/MEK/ERK pathway, inhibits tumor angiogenesis, and induces tumor cell apoptosis in hepatocellular carcinoma model PLC/PRF/5. Cancer Res. 2006, 66: 11851-11858. 10.1158/0008-5472.CAN-06-1377.

    Article  CAS  Google Scholar 

  34. Abou-Alfa GK, Venook AP: The impact of new data in the treatment of advanced hepatocellular carcinoma. Curr Oncol Rep. 2008, 10: 199-205. 10.1007/s11912-008-0031-x.

    Article  CAS  Google Scholar 

  35. Leupin O, Bontron S, Schaeffer C, Strubin M: Hepatitis B virus X protein stimulates viral genome replication via a DDB1-dependent pathway distinct from that leading to cell death. J Virol. 2005, 79: 4238-4245. 10.1128/JVI.79.7.4238-4245.2005.

    Article  CAS  Google Scholar 

  36. Martin-Lluesma S, Schaeffer C, Robert EI, van Breugel PC, Leupin O, Hantz O, Strubin M: Hepatitis B virus X protein affects S phase progression leading to chromosome segregation defects by binding to damaged DNA binding protein 1. Hepatology. 2008, 48: 1467-1476. 10.1002/hep.22542.

    Article  CAS  Google Scholar 

  37. Sung WK, Lu Y, Lee CW, Zhang D, Ronaghi M, Lee CG: Deregulated Direct Targets of the Hepatitis B Virus (HBV) Protein, HBx, Identified through Chromatin Immunoprecipitation and Expression Microarray Profiling. J Biol Chem. 2009, 284: 21941-21954. 10.1074/jbc.M109.014563.

    Article  CAS  Google Scholar 

  38. Goh KI, Cusick ME, Valle D, Childs B, Vidal M, Barabasi AL: The human disease network. Proc Natl Acad Sci USA. 2007, 104: 8685-8690. 10.1073/pnas.0701361104.

    Article  CAS  Google Scholar 

  39. Hernandez P, Huerta-Cepas J, Montaner D, Al-Shahrour F, Valls J, Gomez L, Capella G, Dopazo J, Pujana MA: Evidence for systems-level molecular mechanisms of tumorigenesis. BMC Genomics. 2007, 8: 185-10.1186/1471-2164-8-185.

    Article  Google Scholar 

  40. Dyer MD, Murali TM, Sobral BW: The landscape of human proteins interacting with viruses and other pathogens. PLoS Pathog. 2008, 4: e32-10.1371/journal.ppat.0040032.

    Article  Google Scholar 

Download references


This study was funded by grants from the National Natural Science Foundation of China (NSFC NO. 30772055) and sponsored by Shanghai Postdoctoral Scientific Program.

Author information

Authors and Affiliations


Corresponding authors

Correspondence to Zhong-Jun Wu or Zhi-Qiang Wang.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

ZJW and YZ made substantial contributions to conception and design, acquisition of data, analysis and interpretation of data; DRH involved in drafting the manuscript; ZQW conceived of the study, and participated in its design and drafted the manuscript. All authors read and approved the final manuscript.

Zhong-Jun Wu, Yu Zhu contributed equally to this work.

Electronic supplementary material


Additional file 1: Additional Tables. Table S1. Total interactions between HBV and human proteins catalogued from related literature. The meaning of each is as follows: Pubmed_ID: PubMed article ID. HBV_gene_mention: HBV gene name appeared in the sentence. HBV_gene: the HBV gene after standardization. verb_mention: the meaning of the verb or verb noun such as heavier appeared in the sentence. verb: the verb after standardization. human_gene_mention: human gene names appeared in the sentence. human_official_gene_symbol: the human gene after standardization. human_gene_entrez_ID: standardization of the ID of the human gene. human_official_gene_description: standardization of the description of the human gene. sentence: the key sentence. PubMed_link: PubMed abstract link. Additional file 1, Table S2. Listing and Distribution of Keywords Associated with the HBV Human Protein Interaction Database. Statistical analysis of interaction verb and calculation of the proportion of each verb. Additional file 1, Table S3. Listing of human proteins interacting with more than one viral protein. Additional file 1, Table S4. Listing of HHBV-HHBV protein- protein interactions. Interacting human proteins are referenced with their cognate NCBI gene name (columns 1 and 2). These physical and direct binary protein-protein interactions were retrieved from the BIND, BioGRID, DIP, GeneRIF, HPRD, IntAct, MINT, and Reactome databases. Interaction type (6 = KEGG database,7 = text mining,8 = homology). Additional file 1, Table S5. Hepatocellular carcinoma-associated proteins (HHCC) catalogued from related literature. Additional file 1, Table S6. Listing of HHBV- HHCC. HHBV: HBV-interacting proteins. HHCC: liver cancer-related genes. HHBV- HHCC: overlap. Additional file 1, Table S7A. Distribution of cellular component Gene Ontology terms associated with HBV-human protein interactions. Additional file 1, Table S7B. Distribution of biological process Gene Ontology terms associated with HBV-human protein interactions Additional file 1, Table S7C. Distribution of molecular function Gene Ontology terms associated with HBV-human protein interactions Additional file 1, Table S8. Functional analysis of the HHBV distribution and enrichment in cellular pathways using KEGG annotations. (XLS 460 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Wu, ZJ., Zhu, Y., Huang, DR. et al. Constructing the HBV-human protein interaction network to understand the relationship between HBV and hepatocellular carcinoma. J Exp Clin Cancer Res 29, 146 (2010).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: