Overexpression of the cohesin-core subunit SMC1A contributes to colorectal cancer development
Journal of Experimental & Clinical Cancer Researchvolume 38, Article number: 108 (2019)
Cancer cells are characterized by chromosomal instability (CIN) and it is thought that errors in pathways involved in faithful chromosome segregation play a pivotal role in the genesis of CIN. Cohesin forms a large protein ring that binds DNA strands by encircling them. In addition to this central role in chromosome segregation, cohesin is also needed for DNA repair, gene transcription regulation and chromatin architecture. Though mutations in both cohesin and cohesin-regulator genes have been identified in many human cancers, the contribution of cohesin to cancer development is still under debate.
Normal mucosa, early adenoma, and carcinoma samples deriving from 16 subjects affected by colorectal cancer (CRC) were analyzed by OncoScan for scoring both chromosome gains and losses (CNVs) and loss of heterozygosity (LOH). Then the expression of SMC1A was analyzed by immunochemistry in 66 subjects affected by CRC. The effects of SMC1A overexpression and mutated SMC1A were analyzed in vivo using immunocompromised mouse models. Finally, we measured global gene expression profiles in induced-tumors by RNA-seq.
Here we showed that SMC1A cohesin core gene was present as extra-copies, mutated, and overexpressed in human colorectal carcinomas. We then demonstrated that cohesin overexpression led to the development of aggressive cancers in immunocompromised mice through gene expression dysregulation.
Collectively, these results support a role of defective cohesin in the development of human colorectal cancer.
Chromosomal instability (CIN) is characterized by chromosome translocations, allelic imbalances and aneuploidy. It is thought that CIN is an early step during tumorigenesis that allows cells to acquire additional genetic changes required for full malignant transformation. There have been extensive studies on the causes and consequences of CIN because it is a characteristic of many cancer cells.
Cohesin is an evolutionarily conserved four-subunit (SMC1A, SMC3, RAD21, and either STAG1 or STAG2) complex that encircles DNA within its ring-shaped structure. It was first identified for its role in ensuring sister chromatid cohesion, which is essential for correct chromosome segregation . The cohesin core complex interacts with several regulatory-cohesin factors contributing to its function including NIPBL, required for the loading of cohesin onto chromatin, ESCO2, essential for the establishment of physical bridges between sisters during S phase, PDS5A and PDS5B, which interact with cohesin for its establishment and maintenance and HDAC8, required for the removal of cohesin [2,3,4,5,6,7,8].
Increasing evidence suggests that cohesin also participates in many additional biological processes. Indeed, it promotes DNA repair by homologous recombination and non-homologous end joining [9,10,11], favors the recruitment of proteins involved in the activation of the intra-S and G2/M checkpoints [12, 13], controls fork replication speed [14, 15] and regulates gene transcription by mediating functional connections between promoters and their distal enhancers [16, 17].
Germline mutations in cohesin and its regulators are responsible for Cornelia de Lange syndrome (CdLS) [18,19,20,21,22,23], a rare developmental disease characterized by both chromosome aneuploidies and aberrations, precocious sister chromatid separation and sensitivity to genotoxic drugs [24,25,26,27]. Despite these considerations, CdLS patients do not show predisposition to cancer . Instead, somatic mutations in the NIPBL, RAD21, SMC1A, SMC3 and STAG2 genes have been described in many human cancers including acute myeloid leukemia, bladder, and colorectal cancer [28,29,30,31,32,33]. Other cohesin and regulatory-cohesin genes are not frequently mutated in cancer. The mechanism through which mutated cohesin promotes tumorigenesis is still controversial. STAG2 is significantly mutated in four or more human cancer types  and experimental data indicates that STAG2 gain-of-function changes cause loss of sister chromatid cohesion and CIN [31, 32, 34, 35]. However, data obtained in naturally occurring tumors suggests no correlation between cohesin mutations and CIN . In addition, STAG2-deficient tumors are often euploid [36, 37]. Therefore, the significance of these mutations in promoting tumorigenesis remains debatable and elusive. These conflicting data are likely in part attributable to the fact that cohesin plays a critical dual role in both gene transcription regulation and maintenance of genome stability. Furthermore, mutational screening has been performed only at the carcinoma stage and no study has been conducted at different stages during tumorigenesis.
Human colorectal cancer (CRC) is an important contributor to cancer morbidity and mortality worldwide. It is classified as microsatellite instability (15% of cases, associated with a better prognosis) and chromosomally unstable (85% of affected patients, with a worse prognosis) [38, 39]. In an effort to provide new insight into the role of cohesin in tumorigenesis, we analyzed the copy number variations (CNVs) by OncoScan in normal mucosa, early adenomas, and carcinomas derived from 16 subjects affected by CRC. We found extra-copies of SMC1A gene in carcinomas and subsequent sequencing revealed the presence of mutations. Furthermore, the analysis of SMC1A expression in a larger cohort of patients (n = 66) showed that it is significantly more overexpressed in carcinomas than in both early adenomas and normal mucosa. Finally, we demonstrated that cohesin overexpression produces aggressive tumors in nude mice through gene expression dysregulation. Collectively, these findings identify SMC1A as a contributing gene in CRC development and may serve as a promising therapeutic strategy.
Normal mucosa, early adenoma, and carcinoma from 16 subjects affected by CRC (Additional file 1: Table S1) were processed for identification of copy number imbalances through Molecular Inversion Probe (MIP) based OncoScan Array following protocols provided by the manufacturer (Affymetrix). DNA samples were normalized to 12 ng/μL, mixed with MIPs and incubated overnight to anneal (16–18 h). Next, each reaction was divided equally into A and B reactions and “Gap Fill” master mix was added with either AT dNTPs (A reaction) or GC dNTPs (B reaction) and incubated. Following the “Gap Fill” reaction, exonuclease was added to remove unligated probes and genomic DNA. Next, MIPs were linearized with a restriction enzyme and PCR amplified (PCR 1). Reactions were taken through a second round of amplification (PCR 2), and subsequently digested with HaeIII restriction enzyme. The digested products were hybridized to the OncoScan Array at 58 °C for 16–18 h. Arrays were stained and washed using the GeneChip Fluidics Station 450 and loaded on the GeneChip Scanner 3000 7G (Affymetrix) where fluorescence intensity was scanned to generate array images (DAT files).
Next, array fluorescence intensity data (CEL) files were generated with an Affymetrix GeneChip Command Console (AGCC, Affymetrix) and used to produce OSCHP files and QC metrics through OncoScan Console Software (Biodiscovery). The standard Affymetrix reference control file for OncoScan data was used for processing the arrays. Samples passing QC criteria (MAPD ≤0.3, ndSNPQC ≥26) were further analyzed using tumor Scan (TuScan) and BioDiscovery’s SNPFASST2 algorithm using the Nexus Express for OncoScan software version 3.0 and 7.5 (Biodiscovery). The TuScan algorithm creates segmentation to differentiate between adjacent clusters of probes and determines the copy number changes. SNP-FASST2 uses a Hidden Markov Model (HMM)-based approach to identify larger copy number segments based on a log-ratio threshold derived from all probes in a given region. Ratios are the log2 ratios of the normalized intensity of the sample over the normalized intensity of a reference with further correction for a sample-specific variation. The Median Log2 Ratio was computed for each segment detected in the analysis. The significance threshold for segmentation was set at 1.0e − 5 also requiring a minimum of three probes per segment and a maximum probe spacing of 1000 kbp between adjacent probes before breaking a segment. Segments were classified as having gains when the Log2Ratio (L2R) exceeded 0.2, losses when L2R was < − 0.2, and with high copy gains and homozygous losses being called when L2R was > 0.6 and < − 1.0, respectively. Differences in copy number changes between samples from a tumor for individual genes were counted when one sample had a change as defined above and another either did not have that change or had a different change. B-allele frequency (BAF) information was also generated for each tumor. BAF is a normalized measure of the allelic intensity ratio of two alleles (A and B), such that a BAF of 1 or 0 indicates the complete absence of one of the two alleles (e.g. AA or BB), and a BAF of 0.5 indicates the equal presence of both alleles (e.g. AB). Median BAF is reported for each segment and is the median BAF of the markers identified as heterozygous. If the number of heterozygous markers in the segment is below 10 or the percent of homozygous markers is above 85% no value is reported. The B-allele frequency values are used to determine whether a segment is in a loss of heterozygosity (LOH) or an allelic imbalance state. By default, probe sets were automatically centered to the median for all samples by the Nexus software. For individual samples where the median probe set value was not diploid, specified regions of balanced heterozygosity were manually identified by visual inspection of L2R and BAF plots and defined as diploid regions, permitting the Nexus software to reset the entire probe set to the newly defined areas . Finally, we carefully analyzed the CNVs involving the X chromosome. In detail, we used a different threshold for the CNVs of the X chromosome in males and females (Copy number = 0, for a deletion identified in a male; Copy number = 2 or greater than 2, for duplications identified in a male subject. Copy number = 1 or 0 for a deletion in a female; Copy number = 3 or greater than 3 for a duplication in a female subject).
Clinical samples and histopathology
Sixty-six subjects affected by CRC (Additional file 1: Table S2) were retrospectively selected from the files of the Unit of Surgical Pathology of the Azienda Ospedaliero-Universitaria Pisana. For each patient, specimens (normal mucosa, early adenomas and carcinomas) were surgically obtained and fixed in 10% neutral-buffered formaldehyde and embedded in paraffin. Routine hematoxylin and eosin staining were performed on microtomic sections for histopathological examination. Histological diagnoses, reviewed independently by experienced pathologists (A.S., G.F.), were formulated according to the 2010 World Health Organization (WHO) Classification (4th Edition). Approval was granted by Azienda Ospedaliero-Universitaria Pisana Ethics Committee (protocol number 3867).
Mutation analysis for SMC1A
DNA was extracted from embedded paraffin samples by the NucleoSpin Tissue kit (Macherey-Nagel) according to the manufacturer’s protocol. Primer pairs (Additional file 2: Table S3) were designed to amplify exons, exon/intron boundaries and short flanking intronic sequences. Amplified PCR products were purified and sequenced.
SMC1A immunohistochemical analysis was performed on 3-μm tissue sections. After deparaffinization via serial xylene baths, samples were rehydrated via a graded ethanol series. Then the sections were heated to 98 °C for 40 min to unmask target antigens, cooled in solution at room temperature and washed with phosphate-buffered saline (PBS) for 3 min. After treatment with peroxide block, sections were washed again in PBS and incubated with power block reagent (Biogenex Laboratories) for 30 min. Primary mouse monoclonal antibody against human SMC1A (diluted 1:400, Bethyl Laboratories) was applied overnight at 4 °C. After washing with PBS, immunoreactivity was obtained by using the “Super Sensitive Polymer-HRP Detection System” (Biogenex), following the manufacturer’s instructions. Thereafter, samples were incubated for 20 min at room temperature with Super Enhancer Reagent, followed by incubation with Poly-HRP reagent. The reaction was developed using 0.05% 3,3′-diaminobenzidine tetrahydrochloride until adequate color development was seen. A negative control was obtained by substituting primary antibody with PBS. The staining percentage was graded as 0 (0–5%), 1 (6–20%), 2 (21–60%) and 3 (61–100%), and the staining intensity was graded as 0 (negative), 1 (+, weak), 2 (++, moderate) and 3 (+++, strong), as previously described [41, 42].
SMC1A cDNA mutagenesis and cell transfection
Site-directed mutagenesis of the SMC1A cDNA (OriGene) was performed with QuikChange Site-Directed Mutagenesis Kit (Stratagene) according to the manufacturer’s instructions. Coding sequences for SMC1A wild type or mutated were inserted into the Not I site of the pcDNA3.1 plasmid (Invitrogen, ThermoFisher Scientific). The presence and orientation of the insert was confirmed by enzyme restriction digestion.
Human colorectal carcinoma HCT116 cells were maintained in culture according to American Type Culture Collection specifications. Briefly, cells were grown in McCoy’s Medium with 10% fetal calf serum and antibiotics in a humidified 5% CO2 atmosphere. HCT116 cells were transfected with the pcDNA3.1 vectors expressing SMC1A wild-type or SMC1A mutant. Stable expressing clones were obtained by selection in G418 (800 μg/ml) for 2 weeks.
Animal care and experiments
Twenty-one immunocompromised NOD-SCID female mice 5–6 weeks of age were purchased from the Animal Care Facility of the IRCCS - Ospedale Policlinico San Martino of Genova, where animals were then housed and maintained in pathogen-free conditions. All experiments were reviewed and approved by the internal Review Board (OPBA) and authorized by the Italian Ministry of Health, accordingly with the current national and European regulations and guidelines for the care and use of laboratory animals (D.L. 26/2014; 86/609/EEC Directive).
Mice were bilaterally injected subcutaneously with 2 × 106 HCT116 cells, HCT116 overexpressing SMC1A wild-type or HCT116 harboring SMC1A c.A2027G mutation, suspended in 0.1 ml of PBS. The mice were then regularly palpated to assess tumor latency. Tumor growth was recorded measuring nodule size with calipers three times per week. When the first nodule reached the volume of 300 mm3, all animals were sacrificed and tumors were excised, measured, photographed, weighed and formalin-fixed and snap-frozen in liquid nitrogen for further analyses.
For histology analysis, half of the samples were fixed in 4% neutral-buffered formalin, embedded in paraffin and cut to obtain 3- to 4-mm-thick sections. Slides were then stained with hematoxylin and eosin for histopathological examination; the other halves were snap-frozen in liquid nitrogen and kept at − 80 °C for molecular investigation.
Library preparation and RNA-sequencing (RNA-seq)
Three tumors deriving from the inoculation of HCT116 (907_1, 907_2 and 907_3), four deriving from HCT116 overexpressing SMC1A wild-type (907_4, 907_5, 907_6 and 907_7) and four deriving from HCT116 harboring SMC1A c.A2027G mutation (907_8, 907_9, 907_10 and 907_11) were separately processed for RNA-seq analyses.
Library preparation was obtained using the TruSeq Stranded mRNA Sample Prep kit (Illumina). The poly-A mRNA was fragmented for 3 min at 94 °C and every purification step was performed using 1X Agencourt AMPure XP beads. Both RNA samples and final libraries were quantified using the Qubit 2.0 Fluorometer (Invitrogen) and the quality was tested using the Agilent 2100 Bioanalyzer RNA Nano assay (Agilent). Libraries were then processed with Illumina cBot for cluster generation on the flowcell, following the manufacturer’s instructions and sequenced on single-end mode on HiSeq 2500 (Illumina). The CASAVA 1.8.2 version of the Illumina pipeline was used to process raw data for format conversion and de-multiplexing.
To avoid low-quality data, adapters were removed by Cutadapt 1 and lower quality bases were trimmed by ERNE2. For the analysis of differentially expressed genes, the quality-checked reads were processed using the TopHat version 2.0.0 package (Bowtie 2 version 2.2.0) as FASTQ files [43,44,45]. Reads were mapped to the human reference genome GRCh37/hg19. Read abundance was evaluated and normalized by using Cufflinks 3 for each gene and Cuffdiff from the Cufflinks 2.2.0 package was used to calculate the differential expression levels and evaluate the statistical significance of detected alterations. Numbers of reads per sample are shown in Additional file 3: Table S4. Only protein-coding genes were considered and gene level expression values were determined by fragments per kilobase million (FPKM) mapped. All genes with FPKM > 1 were designated as expressed and analyzed with an established p-value < 0.05.
cDNA synthesis and quantitative real-time PCR (qRT-PCR)
Total RNA was extracted by RNAeasy Mini-kit (Qiagen) and cDNA was synthesized with SuperScript™ II reverse transcriptase using oligo-dT (Invitrogen). PCR analyses were performed using Rotor Gene 3000 (Corbett). qPCR reactions were run in duplicate and normalized with respect to HPRT. Primers used for mRNA expression analysis are listed in Additional file 4: Table S5.
Pathway analysis and function
The differentially expressed genes were functionally analyzed for biological processes using Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.8 (https://david.ncifcrf.gov). For each term, the p-value was calculated and a term with p < 0.05 was considered to be enriched.
LOH, CNVs and genome changed data were analyzed by Kruskal-Wallis test, a non-parametric method for comparing two or more independent samples of equal or different sample sizes. Immunohistochemistry data were analyzed by the Wilcoxon signed-rank test, a non-parametric method used for comparing related samples or matched samples. All other data were analyzed by Student’s t-test or the chi-squared test. P-values of < 0.05 were considered statistically significant.
The data discussed in this publication have been deposited in NCBI’s Gene Expression Omnibus and are accessible through GEO Series accession number GSE113678.
OncoScan analysis during CRC development highlights the presence of extra-copies of cohesin genes
In order to identify copy number variations (CNVs) and loss of heterozygosity (LOH), we performed molecular cytogenetic analysis on normal mucosa, early adenoma, and carcinoma from 16 subjects affected by CRC. Genomic DNA was interrogated using OncoScan FFPE Array. We found an increase in LOH at the carcinoma stage (p = 0.0015, Kruskal-Wallis test, Table 1, Additional file 5: Figure S1a). In addition, most of the subjects (13/16) showed a marked increase in chromosome gains or losses proceeding from normal mucosa until the carcinoma stage (p = 0.0006, Fig. 1a-c, Additional file 5: Figures S1b–S4,). Common events in carcinoma were the gain of whole chromosomes 7, 13, and X; gain of the long arm of chromosomes 8 and 20; loss of chromosome 18 and loss of the short arm of chromosomes 8 and 17 (Fig. 1c, Additional file 5: Figure S4). Of note, few events were identified in both normal mucosa and early adenomas (Fig. 1a-b, Additional files 5: Figures S2 and S3). The distribution of the events on chromosomes during cancer development is shown in Additional file 5: Figure S5a, b and c. Globally, the fraction of the genome that was altered ranged from 8.99% in normal mucosa to 68.9% in carcinoma (p < 0.0001, Table 1, Additional file 5: Figure S1c).
Next, we manually searched for genes belonging to the cohesin pathway. This allowed us to identify genomic regions containing HDAC8 (chromosome X, gain in 62.5% of subjects), RAD21 (chromosome 8, gain in 75% of subjects), STAG2 (chromosome X, gain in 62.5% of subjects) and SMC1A (chromosome X, gain in 50% of subjects). This finding indicates that cohesin genes go through extra-copies during CRC development.
SMC1A sequencing reveals high frequency of mutations in carcinoma during CRC development
Next, we focused on SMC1A gene because its mutations have been identified in CRC and in other cancers and cause CIN and chromosome aneuploidy in vitro [29, 30]. In addition, accumulating evidence has proved that SMC1A plays a role in genome stability. In fact, it is involved in G2/M checkpoint and it is phosphorylated by both ATR and ATM protein kinase [12, 13, 46,47,48]. Sequencing of SMC1A revealed mutations in 13 out of 16 samples previously analyzed by OncoScan. We identified five synonymous variants (Additional file 6: Table S6) in carcinoma samples alone and twenty-five mutations: twenty-two missense mutations, a 1-bp deletion causing a frameshift and two non-sense mutations (Table 2, Fig. 1d). It is worth noting that 5 out of 13 carcinoma samples carried from two to four SMC1A mutations (Table 2, Additional file 7: Figure S6) whereas two subjects (9 and 12) carried the same mutation (c.A950G) at the carcinoma stage (Table 2). Amino acid changes encompass all gene domains without significant differences (Fig. 1e). The effect of mutations has been predicted in silico using the Mutation Taster program (http://www.mutationtaster.org/) and PolyPhen2 (http://genetics.bwh.harvard.edu/pph2/). Most missense mutations identified in carcinoma were predicted to be damaging (have a functional effect) while only two (with MutationTaster) or three (with PolyPhen2) mutations in adenomas (Additional file 8: Table S7) were damaging.
SMC1A expression increases from normal mucosa to carcinoma during CRC development
To investigate the levels of SMC1A expression during CRC development, we measured its expression in 66 subjects (including those analyzed by OncoScan assay) by immunochemistry, and again for each patient we analyzed normal mucosa, early adenoma and carcinoma. Robust SMC1A expression was observed in most of the carcinomas, 77.27% with strong (+++), 21.21% with moderate (++) and 1.52% with weak intensity (+). In contrast, only 30.3% of adenomas had strong intensity, whereas a significant fraction (62%) showed moderate intensity (Fig. 1f, Table 3, Additional file 9: Figures S7, S8). According to the Wilcoxon signed-rank test, the increase in SMC1A expression from normal mucosa to adenoma (p = 1.44 e-12) and from adenoma to carcinoma (p = 3.82 e-07) was highly significant. Immunohistochemistry data was validated by quantitative RT-PCR (Additional file 9: Figure S9). In normal mucosa, cells displayed a round shape, and since SMC1A is a nuclear protein, the signal was confined to the nuclei which were in the periphery. During malignant transformation, we observed that nuclei increased in size and occupied most of the cytoplasm (Fig. 1g, Additional file 10: Figure S10). It is likely that this phenomenon is caused by the chromosome aneuploidies we detected by OncoScan assay. Altogether, this data indicates that the expression of SMC1A increases during colorectal tumorigenesis, from normal mucosa to carcinoma.
SMC1A overexpression reduces the latency period of cancer formation in vivo
To determine the effects of altered cohesin expression in vivo, we introduced both SMC1A wild-type gene and SMC1A c.A2027G (leading to p.E676G change, located in the coiled-coil domain near the hinge domain) mutation, which induced chromosomal instability when transiently transfected in vitro , in HCT116 cells, a near-diploid human colorectal cancer cell line with stable karyotype. Western blots were then performed to document that the vectors led to overexpression of both wild-type and mutated SMC1A proteins (Fig. 2a). The effect of SMC1A overexpression was analyzed in vivo using immunocompromised mouse models. The time-dependent analysis showed that the development of tumors peaked after 11 and 13 days (expressed as 50% of survival) with SMC1A c.A2027G mutation and SMC1A wild-type respectively, whereas the latency was longer (18 days) with HCT116 cells (p < 0.05, Fig. 2b). Furthermore, both the weight and volume of the tumors significantly increased in mice inoculated with SMC1A wild-type (p = 0.011, p = 0.0079) and SMC1A c.A2027G (p = 0.013, p = 0.0022) when compared with HCT116 cells (Fig. 2c-g, Additional file 11: Figure S11). No difference was found between SMC1A wild-type and SMC1A c.A2027G, though both the weight and volume of the latter were higher (Fig. 2f-g). Tumors derived from the three cell lines were morphologically indistinguishable from one another, and presented large necrotic areas and several mitotic figures (Fig. 2h).
Next, we measured global gene expression profiles in eleven tumors induced by parental and transduced HCT116 cells. In particular, three tumors induced by parental cells (907_1, 907_2 and 907_3), four induced by SMC1A wild-type transduced cells (907_4, 907_5, 907_6 and 907_7) and four induced by SMC1A c.A2027G transduced cells (907_8, 907_9, 907_10 and 907_11). Unsupervised sample clustering by principal component analysis (PCA) showed that tumors induced by mutant-SMC1A transduced cells differed to a greater extent than those induced by parental and wild-type SMC1A transduced cells (Additional file 12: Figure S12).
SMC1A wild-type and SMC1A c.A2027G displayed 744 (401 down and 343 up) and 742 (486 down and 256 up) dysregulated genes respectively when compared with parental HCT116-induced tumors (Fig. 3a). The transcriptional effects were small, with fold changes ranging from + 0.9 to − 0.17 and from + 0.79 to − 1.00 for SMC1A wild-type and SMC1A c.A2027G respectively (Additional file 12: Tables S8, S9). Dysregulated genes belong to many biological processes (Additional file 13: Figures S13, S14, Tables S10, S11). Furthermore, 68 dysregulated genes were shared in common between tumors induced by SMC1A wild-type and SMC1A c.A2027G (Fig. 3b). All of them displayed only minor fold changes that ranged from + 0.8 to − 0.8 when compared with the baseline of parental HCT116 (Fig. 3c). RNA-seq data was validated in eleven genes (ATG12, CEP55, CHAF1A, CLDN4, H19, HIF1A, KLF5, SAT1, STEAP4, TACSTD2 and TOMM40) by quantitative RT-PCR experiments (Additional file 14, Figures S15, S16). These genes were chosen because they showed differential expression between SMC1A wild-type and SMC1A c.A2027G tumors (Fig. 3c). In addition, since most of them are involved in cancer development their differential expression could explain the aggressiveness of SMC1A c.A2027G tumors.
CRC is the third most common human malignancy worldwide, with an estimated 135,430 new cases and 50,260 deaths in 2017 in the United States alone . CRC progresses through a series of histopathologic stages ranging from dysplastic crypts to malignant cancers. Most CRC is characterized by CIN, and clinical management of these tumors poses a serious dilemma due to its worse prognosis. Though the molecular basis for CIN is just beginning to be discovered, it has been suggested that CIN is an early event in cancer development, initiating CRC.
To characterize somatic alteration in CRC, 48 samples (16 normal mucosa, 16 adenomas and 16 carcinomas) were profiled for CNVs with OncoScan. We found the gain of chromosomes 7, 13, X, 8q and 20q and the loss of chromosomes 18, 8p and 17p in CRC samples. Interestingly, these chromosome changes were previously described in molecular characterization of CRC reports [50, 51]. However, we did not detect the loss of chromosomes 14q and 15q. In addition, we found that RAD21 was amplified in 75% of patients against only 2% in The Cancer Genome Atlas (TCGA) data of CRC . It is likely that these discrepancies are due to the number of samples analyzed (48 in this work vs 276 in ).
SMC1A gene has been postulated to participate in CRC development by promoting aneuploidy [29, 30] but the biological basis of cohesin involvement is currently unknown. Here we report that colorectal tissue acquires extra-copies of SMC1A gene during tumorigenesis. In addition, SMC1A expression is significantly more robust in carcinomas than in normal mucosa and early adenomas. Recently, the overexpression of SMC1A was identified as a predictor of poor prognosis in late-stage CRC . These findings suggest that overexpression of SMC1A plays a role in cancer pathogenesis. This notion is corroborated by the finding that human primary fibroblasts overexpressing SMC3, the molecular partner of SMC1A in the cohesin core, showed evidence of cell transformation, including anchorage-independent growth and foci of transformation .
Interestingly, in addition to SMC1A overexpression, carcinoma samples display more SMC1A mutations than adenomas and about 40% of carcinomas are characterized by carrying several SMC1A mutations in the same sample, suggesting that different clonal populations arise during cancer development. Furthermore, the finding that neutral mutations were identified only in carcinomas suggests that the mutation rate of SMC1A is higher in carcinoma than in the early stage of cancer development. Previously we showed that SMC1A mutations decrease from early adenomas to colorectal cancers . This apparent contradiction could be due to sample selection. In fact, in the present work we selected only patients whose mucosa, adenoma and carcinoma were available while in a previous work  adenoma and carcinoma samples were selected independently. We postulate that the combination of both SMC1A mutations and gene expression up-regulation may contribute to generating the additional genetic lesions required for a cell to fully undergo malignant transformation.
The ectopic expression of SMC1A has a positive impact on in vivo growth. In fact, the overexpression of SMC1A reduced the latency period of cancer formation, and both the size and volume of tumors in a subcutaneous murine xenograft model were significantly increased in presence of up-regulated SMC1A. Of note, tumors induced by SMC1A c.A2027G mutation were bigger than those induced by SMC1A overexpression alone, indicating that mutation has an additive positive effect on tumor growth.
SMC1A up-regulation resulted in a significant change in gene expression profiles. In fact, we found more than 700 dysregulated genes with small fold changes ranging from + 0.9 to − 1.00. Automated analysis showed that differentially expressed genes were virtually implicated in many metabolic pathways, arguing a role for cohesin in cellular proliferation, signalling pathways and other transformation-associated processes. We identified a subset of genes that which showed differential expression between SMC1A wild-type and SMC1A c.A2027G tumors. ATG12 and STEAP4 genes are particularly interesting. ATG12 regulates the apoptotic pathway by binding and inactivating BCL2 and MCL1  whereas STEAP4 is a metalloreductase, involved in responses to nutrients, inflammatory and oxidative stress, fatty acid metabolism, and glucose metabolism [54,55,56]. Recently, it has been found that ATG12 and STEAP4 were down- and up-regulated respectively in human CRC and predicted poor prognosis [57, 58]. Again, the dysregulation of CEP55, CLDN4, CHAF1A, H19 and STEAP4 have been found to significantly correlate with CRC tumor stage, aggressiveness, metastasis and poor prognosis [59,60,61,62,63]. Altogther, their differential expression could explain the greater aggressiveness of SMC1A c.A2027G tumors.
Furthermore, this data indicates that changes in SMC1A expression level have significant, though modest, effects on transcription throughout the genome and that tumorigenesis likely arises from the collective effects of small changes in the expression of many genes. At present, we do not know how many observed gene expression changes are primary (i.e., due to direct transcriptional actions of overexpressed cohesin) and how many are downstream consequences of cohesin dysregulation.
In our model, early events of chromosome missegregation lead to the occurrence of extra-copies of X chromosome in normal mucosa (Fig. 4). As consequence, this causes the overexpression of SMC1A gene because it maps to Xp11.22–Xp11.21, in a region that escapes X inactivation. Cohesin ensures correct chromosome segregation and its overexpression may lead to the failure of this process, leading to chromosome imbalance, with abnormal numbers of chromosomes in daughter cells. Mitotic missegregation can contribute to cancerogenesis in two different ways. Chromosome gain results in the activation of proto-oncogenes. For instance, the trisomy of chromosome 7 is associated with the overexpression of MET oncogene in renal carcinoma . Chromosome loss could lead to the removal of tumor suppressor gene resulting in tumorigenesis if the first allele is inactivated. The loss of chromosome 10 results in the inactivation of tumor suppressor gene PTEN in human glioblastoma .
In addition to its role in mediating sister chromatid cohesion, cohesin plays a pivotal role in gene transcription regulation. In fact, cohesin binds the chromatin near the transcription start site of many genes, including c-MYC, important for cell cycle regulation, differentiation and development [66,67,68]. Interestingly, genome-wide data has showed the presence of genomic sites where the co-localization of cohesin and c-MYC has a relevant role in co-regulating transcription . These findings allow us to hypothesize that cohesin overexpression results in transcription changes modulating the expression of specific biochemical tumor-promoting pathways.
In addition to X chromosome aneuploidy, we found that SMC1A was also mutated. The presence of a mutated SMC1A subunit could produce an inactive or functionally restricted cohesin complex. We showed that SMC1A mutations affect the association of SMC hinge dimers with DNA , suggesting a potential role of SMC proteins in remodeling chromatin architecture. We propose that both cohesin overexpression and SMC1A mutations accelerate the acquisition of a mutator phenotype driving tumorigenesis by triggering additional genetic changes that allow a growth advantage and CRC development (Fig. 4). This notion is supported by the observation that colorectal cancer cells exhibit up to 100-fold higher rates of missegregation than normal cells .
Collectively, our findings strongly suggest that SMC1A functions as a “caretaker” oncogene in CRC. This discovery has important clinical applications because SMC1A could serve as a potential target for the development of new therapies in CRC.
Affymetrix genechip command console
Ataxia-telangiectasia and Rad3 related
Copy number variations
Fragments per kilobase million
Histone deacetylase 8
Loss of heterozygosity
Molecular inversion probe
Principal component analysis
Quantitative real-time PCR
Structural maintenance of chromosome
The cancer genome atlas
Haering CH, Lowe J, Hochwagen A, Nasmyth K. Molecular architecture of SMC proteins and the yeast cohesin complex. Mol Cell. 2002;9(4):773–88.
Rolef Ben-Shahar T, Heeger S, Lehane C, East P, Flynn H, Skehel M, Uhlmann F. Eco1-dependent cohesin acetylation during establishment of sister chromatid cohesion. Science. 2008;321(5888):563–6.
Unal E, Heidinger-Pauli JM, Kim W, Guacci V, Onn I, Gygi SP, Koshland DE. A molecular determinant for the establishment of sister chromatid cohesion. Science. 2008;321(5888):566–9.
Zhang J, Shi X, Li Y, Kim BJ, Jia J, Huang Z, Yang T, Fu X, Jung SY, Wang Y, et al. Acetylation of Smc3 by Eco1 is required for S phase sister chromatid cohesion in both human and yeast. Mol Cell. 2008;31(1):143–51.
Panizza S, Tanaka T, Hochwagen A, Eisenhaber F, Nasmyth K. Pds5 cooperates with cohesin in maintaining sister chromatid cohesion. Curr Biol. 2000;10(24):1557–64.
Waizenegger IC, Hauf S, Meinke A, Peters JM. Two distinct pathways remove mammalian cohesin from chromosome arms in prophase and from centromeres in anaphase. Cell. 2000;103(3):399–410.
Arumugam P, Gruber S, Tanaka K, Haering CH, Mechtler K, Nasmyth K. ATP hydrolysis is required for cohesin's association with chromosomes. Curr Biol. 2003;13(22):1941–53.
Jahnke P, Xu W, Wulling M, Albrecht M, Gabriel H, Gillessen-Kaesbach G, Kaiser FJ. The Cohesin loading factor NIPBL recruits histone deacetylases to mediate local chromatin modifications. Nucleic Acids Res. 2008;36(20):6450–8.
Brough R, Bajrami I, Vatcheva R, Natrajan R, Reis-Filho JS, Lord CJ, Ashworth A. APRIN is a cell cycle specific BRCA2-interacting protein required for genome integrity and a predictor of outcome after chemotherapy in breast cancer. EMBO J. 2012;31(5):1160–76.
Sjogren C, Nasmyth K. Sister chromatid cohesion is required for postreplicative double-strand break repair in Saccharomyces cerevisiae. Curr Biol. 2001;11(12):991–5.
Gelot C, Guirouilh-Barbat J, Le Guen T, Dardillac E, Chailleux C, Canitrot Y, Lopez BS. The Cohesin complex prevents the end joining of distant DNA double-Strand ends. Mol Cell. 2016;61(1):15–26.
Watrin E, Peters JM. The cohesin complex is required for the DNA damage-induced G2/M checkpoint in mammalian cells. EMBO J. 2009;28(17):2625–35.
Musio A, Montagna C, Mariani T, Tilenni M, Focarelli ML, Brait L, Indino E, Benedetti PA, Chessa L, Albertini A, et al. SMC1 involvement in fragile site expression. Hum Mol Genet. 2005;14(4):525–33.
Cucco F, Palumbo E, Camerini S, D'Alessio B, Quarantotti V, Casella ML, Rizzo IM, Cukrov D, Delia D, Russo A, et al. Separase prevents genomic instability by controlling replication fork speed. Nucleic Acids Res. 2018;46(1):267–78.
Terret ME, Sherwood R, Rahman S, Qin J, Jallepalli PV. Cohesin acetylation speeds the replication fork. Nature. 2009;462(7270):231–4.
Huang J, Li K, Cai W, Liu X, Zhang Y, Orkin SH, Xu J, Yuan GC. Dissecting super-enhancer hierarchy based on chromatin interactions. Nat Commun. 2018;9(1):943.
Dowen JM, Bilodeau S, Orlando DA, Hubner MR, Abraham BJ, Spector DL, Young RA. Multiple structural maintenance of chromosome complexes at transcriptional regulatory elements. Stem Cell Reports. 2013;1(5):371–8.
Krantz ID, McCallum J, DeScipio C, Kaur M, Gillis LA, Yaeger D, Jukofsky L, Wasserman N, Bottani A, Morris CA, et al. Cornelia de Lange syndrome is caused by mutations in NIPBL, the human homolog of Drosophila melanogaster nipped-B. Nat Genet. 2004;36(6):631–5.
Tonkin ET, Wang TJ, Lisgo S, Bamshad MJ, Strachan T. NIPBL, encoding a homolog of fungal Scc2-type sister chromatid cohesion proteins and fly nipped-B, is mutated in Cornelia de Lange syndrome. Nat Genet. 2004;36(6):636–41.
Musio A, Selicorni A, Focarelli ML, Gervasini C, Milani D, Russo S, Vezzoni P, Larizza L. X-linked Cornelia de Lange syndrome owing to SMC1L1 mutations. Nat Genet. 2006;38(5):528–30.
Deardorff MA, Kaur M, Yaeger D, Rampuria A, Korolev S, Pie J, Gil-Rodriguez C, Arnedo M, Loeys B, Kline AD, et al. Mutations in cohesin complex members SMC3 and SMC1A cause a mild variant of cornelia de Lange syndrome with predominant mental retardation. Am J Hum Genet. 2007;80(3):485–94.
Deardorff MA, Bando M, Nakato R, Watrin E, Itoh T, Minamino M, Saitoh K, Komata M, Katou Y, Clark D, et al. HDAC8 mutations in Cornelia de Lange syndrome affect the cohesin acetylation cycle. Nature. 2012;489(7415):313–7.
Deardorff MA, Wilde JJ, Albrecht M, Dickinson E, Tennstedt S, Braunholz D, Monnich M, Yan Y, Xu W, Gil-Rodriguez MC, et al. RAD21 mutations cause a human cohesinopathy. Am J Hum Genet. 2012;90(6):1014–27.
Vrouwe MG, Elghalbzouri-Maghrani E, Meijers M, Schouten P, Godthelp BC, Bhuiyan ZA, Redeker EJ, Mannens MM, Mullenders LH, Pastink A, et al. Increased DNA damage sensitivity of Cornelia de Lange syndrome cells: evidence for impaired recombinational repair. Hum Mol Genet. 2007;16(12):1478–87.
Revenkova E, Focarelli ML, Susani L, Paulis M, Bassi MT, Mannini L, Frattini A, Delia D, Krantz I, Vezzoni P, et al. Cornelia de Lange syndrome mutations in SMC1A or SMC3 affect binding to DNA. Hum Mol Genet. 2009;18(3):418–27.
Cucco F, Musio A. Genome stability: what we have learned from cohesinopathies. Am J Med Genet C Semin Med Genet. 2016;172(2):171–8.
Cukrov D, Newman TAC, Leask M, Leeke B, Sarogni P, Patimo A, Kline AD, Krantz ID, Horsfield JA, Musio A. Antioxidant treatment ameliorates phenotypic features of SMC1A-mutated Cornelia de Lange syndrome in vitro and in vivo. Hum Mol Genet. 2018;27(17):3002–11.
Kon A, Shih LY, Minamino M, Sanada M, Shiraishi Y, Nagata Y, Yoshida K, Okuno Y, Bando M, Nakato R, et al. Recurrent mutations in multiple components of the cohesin complex in myeloid neoplasms. Nat Genet. 2013;45(10):1232–7.
Barber TD, McManus K, Yuen KW, Reis M, Parmigiani G, Shen D, Barrett I, Nouhi Y, Spencer F, Markowitz S, et al. Chromatid cohesion defects may underlie chromosome instability in human colorectal cancers. Proc Natl Acad Sci U S A. 2008;105(9):3443–8.
Cucco F, Servadio A, Gatti V, Bianchi P, Mannini L, Prodosmo A, De Vitis E, Basso G, Friuli A, Laghi L, et al. Mutant cohesin drives chromosomal instability in early colorectal adenomas. Hum Mol Genet. 2014;23(25):6773–8.
Solomon DA, Kim JS, Bondaruk J, Shariat SF, Wang ZF, Elkahloun AG, Ozawa T, Gerard J, Zhuang D, Zhang S, et al. Frequent truncating mutations of STAG2 in bladder cancer. Nat Genet. 2013;45(12):1428–30.
Solomon DA, Kim T, Diaz-Martinez LA, Fair J, Elkahloun AG, Harris BT, Toretsky JA, Rosenberg SA, Shukla N, Ladanyi M, et al. Mutational inactivation of STAG2 causes aneuploidy in human cancer. Science. 2011;333(6045):1039–43.
Lawrence MS, Stojanov P, Mermel CH, Robinson JT, Garraway LA, Golub TR, Meyerson M, Gabriel SB, Lander ES, Getz G. Discovery and saturation analysis of cancer genes across 21 tumour types. Nature. 2014;505(7484):495–501.
Guo G, Sun X, Chen C, Wu S, Huang P, Li Z, Dean M, Huang Y, Jia W, Zhou Q, et al. Whole-genome and whole-exome sequencing of bladder cancer identifies frequent alterations in genes involved in sister chromatid cohesion and segregation. Nat Genet. 2013;45(12):1459–63.
Tirode F, Surdez D, Ma X, Parker M, Le Deley MC, Bahrami A, Zhang Z, Lapouble E, Grossetete-Lalami S, Rusch M, et al. Genomic landscape of Ewing sarcoma defines an aggressive subtype with co-association of STAG2 and TP53 mutations. Cancer Discov. 2014;4(11):1342–53.
Balbas-Martinez C, Sagrera A, Carrillo-de-Santa-Pau E, Earl J, Marquez M, Vazquez M, Lapi E, Castro-Giner F, Beltran S, Bayes M, et al. Recurrent inactivation of STAG2 in bladder cancer is not associated with aneuploidy. Nat Genet. 2013;45(12):1464–9.
Taylor CF, Platt FM, Hurst CD, Thygesen HH, Knowles MA. Frequent inactivating mutations of STAG2 in bladder cancer are associated with low tumour grade and stage and inversely related to chromosomal copy number changes. Hum Mol Genet. 2014;23(8):1964–74.
Boland CR, Goel A. Microsatellite instability in colorectal cancer. Gastroenterology. 2010;138(6):2073–87 e2073.
Pino MS, Chung DC. The chromosomal instability pathway in colon cancer. Gastroenterology. 2010;138(6):2059–72.
Hardiman KM, Ulintz PJ, Kuick RD, Hovelson DH, Gates CM, Bhasi A, Rodrigues Grant A, Liu J, Cani AK, Greenson JK, et al. Intra-tumor genetic heterogeneity in rectal cancer. Lab Investig. 2016;96(1):4–15.
Zhou W, Wang Z, Shen N, Pi W, Jiang W, Huang J, Hu Y, Li X, Sun L. Knockdown of ANLN by lentivirus inhibits cell growth and migration in human breast cancer. Mol Cell Biochem. 2015;398(1–2):11–9.
Wang J, Yu S, Cui L, Wang W, Li J, Wang K, Lao X. Role of SMC1A overexpression as a predictor of poor prognosis in late stage colorectal cancer. BMC Cancer. 2015;15:90.
Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, Pimentel H, Salzberg SL, Rinn JL, Pachter L. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and cufflinks. Nat Protoc. 2012;7(3):562–78.
Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L. Differential analysis of gene regulation at transcript resolution with RNA-seq. Nat Biotechnol. 2013;31(1):46–53.
Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25(9):1105–11.
Kim ST, Xu B, Kastan MB. Involvement of the cohesin protein, Smc1, in Atm-dependent and independent responses to DNA damage. Genes Dev. 2002;16(5):560–70.
Yazdi PT, Wang Y, Zhao S, Patel N, Lee EY, Qin J. SMC1 is a downstream effector in the ATM/NBS1 branch of the human S-phase checkpoint. Genes Dev. 2002;16(5):571–82.
Kitagawa R, Bakkenist CJ, McKinnon PJ, Kastan MB. Phosphorylation of SMC1 is a critical downstream event in the ATM-NBS1-BRCA1 pathway. Genes Dev. 2004;18(12):1423–38.
Siegel RL, Miller KD, Jemal A. Cancer statistics, 2017. CA Cancer J Clin. 2017;67(1):7–30.
Wood LD, Parsons DW, Jones S, Lin J, Sjoblom T, Leary RJ, Shen D, Boca SM, Barber T, Ptak J, et al. The genomic landscapes of human breast and colorectal cancers. Science. 2007;318(5853):1108–13.
Cancer Genome Atlas N. Comprehensive molecular characterization of human colon and rectal cancer. Nature. 2012;487(7407):330–7.
Ghiselli G, Iozzo RV. Overexpression of bamacan/SMC3 causes transformation. J Biol Chem. 2000;275(27):20235–8.
Rubinstein AD, Eisenstein M, Ber Y, Bialik S, Kimchi A. The autophagy protein Atg12 associates with antiapoptotic Bcl-2 family members to promote mitochondrial apoptosis. Mol Cell. 2011;44(5):698–709.
Ohgami RS, Campagna DR, McDonald A, Fleming MD. The Steap proteins are metalloreductases. Blood. 2006;108(4):1388–94.
Wellen KE, Fucho R, Gregor MF, Furuhashi M, Morgan C, Lindstad T, Vaillancourt E, Gorgun CZ, Saatcioglu F, Hotamisligil GS. Coordinated regulation of nutrient and inflammatory responses by STAMP2 is essential for metabolic homeostasis. Cell. 2007;129(3):537–48.
Jin Y, Wang L, Qu S, Sheng X, Kristian A, Maelandsmo GM, Pallmann N, Yuca E, Tekedereli I, Gorgulu K, et al. STAMP2 increases oxidative stress and is critical for prostate cancer. EMBO Mol Med. 2015;7(3):315–31.
Xue X, Bredell BX, Anderson ER, Martin A, Mays C, Nagao-Kitamoto H, Huang S, Gyorffy B, Greenson JK, Hardiman K, et al. Quantitative proteomics identifies STEAP4 as a critical regulator of mitochondrial dysfunction linking inflammation and colon cancer. Proc Natl Acad Sci U S A. 2017;114(45):E9608–17.
Yoo BH, Khan IA, Koomson A, Gowda P, Sasazuki T, Shirasawa S, Gujar S, Rosen KV. Oncogenic RAS-induced downregulation of ATG12 is required for survival of malignant intestinal epithelial cells. Autophagy. 2018;14(1):134–51.
Jeffery J, Sinha D, Srihari S, Kalimutho M, Khanna KK. Beyond cytokinesis: the emerging roles of CEP55 in tumorigenesis. Oncogene. 2016;35(6):683–90.
Ersoz S, Mungan S, Cobanoglu U, Turgutalp H, Ozoran Y. Prognostic importance of Claudin-1 and Claudin-4 expression in colon carcinomas. Pathol Res Pract. 2011;207(5):285–9.
Wu Z, Cui F, Yu F, Peng X, Jiang T, Chen D, Lu S, Tang H, Peng Z. Up-regulation of CHAF1A, a poor prognostic factor, facilitates cell proliferation of colon cancer. Biochem Biophys Res Commun. 2014;449(2):208–15.
Chen SW, Zhu J, Ma J, Zhang JL, Zuo S, Chen GW, Wang X, Pan YS, Liu YC, Wang PY. Overexpression of long non-coding RNA H19 is associated with unfavorable prognosis in patients with colorectal cancer and increased proliferation and migration in colon cancer cells. Oncol Lett. 2017;14(2):2446–52.
Bateman NW, Tan D, Pestell RG, Black JD, Black AR. Intestinal tumor progression is associated with altered function of KLF5. J Biol Chem. 2004;279(13):12093–101.
Zhuang Z, Park WS, Pack S, Schmidt L, Vortmeyer AO, Pak E, Pham T, Weil RJ, Candidus S, Lubensky IA, et al. Trisomy 7-harbouring non-random duplication of the mutant MET allele in hereditary papillary renal carcinomas. Nat Genet. 1998;20(1):66–9.
Wang SI, Puc J, Li J, Bruce JN, Cairns P, Sidransky D, Parsons R. Somatic mutations of PTEN in glioblastoma multiforme. Cancer Res. 1997;57(19):4183–6.
Stedman W, Kang H, Lin S, Kissil JL, Bartolomei MS, Lieberman PM. Cohesins localize with CTCF at the KSHV latency control region and at cellular c-myc and H19/Igf2 insulators. EMBO J. 2008;27(4):654–66.
Rhodes JM, Bentley FK, Print CG, Dorsett D, Misulovin Z, Dickinson EJ, Crosier KE, Crosier PS, Horsfield JA. Positive regulation of c-Myc by cohesin is direct, and evolutionarily conserved. Dev Biol. 2010;344(2):637–49.
Gimigliano A, Mannini L, Bianchi L, Puglia M, Deardorff MA, Menga S, Krantz ID, Musio A, Bini L. Proteomic profile identifies dysregulated pathways in Cornelia de Lange syndrome cells with distinct mutations in SMC1A and SMC3 genes. J Proteome Res. 2012;11(12):6111–23.
Yan J, Enge M, Whitington T, Dave K, Liu J, Sur I, Schmierer B, Jolma A, Kivioja T, Taipale M, et al. Transcription factor binding in human cells occurs in dense clusters formed around cohesin anchor sites. Cell. 2013;154(4):801–13.
Lengauer C, Kinzler KW, Vogelstein B. Genetic instability in colorectal cancers. Nature. 1997;386:623–7.
We thank Dr. Paolo Vezzoni for helpful discussion.
This work was supported by a grant from Associazione Italiana Ricerca sul Cancro (IG17374) to A.M. and (IG18517) to S.S.
Availability of data and materials
The datasets supporting the conclusions of this article are included within the article and its additional files.
Ethics approval and consent to participate
Approval was granted by Azienda Ospedaliero-Universitaria Pisana Ethics Committee (protocol #3867). All animal experiments were reviewed and approved by the internal Review Board (OPBA) and authorized by the Italian Ministry of Health, accordingly with the current national and European regulations and guidelines for the care and use of laboratory animals (D.L. 26/2014; 86/609/EEC Directive).
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Table S1. Features of CRC patients analyzed by OncoScan. Table S2. Features of CRC patients analyzed by immunohistochemistry. (PDF 167 kb)
Table S3. Nucleotide primers used for amplifying the SMC1A gene. (PDF 23 kb)
Table S4. Number of reads in tumor samples analysed by RNA-seq. (PDF 12 kb)
Table S5. Primers sequences used for validating RNA-seq data by RT-qPCR. (PDF 41 kb)
Figure S1. (a) Violin plot showing LOH during cancer progression. (b) Violin plot showing CNVs in mucosa, adenoma and carcinoma samples. (c) Violin plot showing the percentage of genome changed during tumorigenesis. Figure S2. CNVs profile in colorectal mucosa. Example of representative CNVs in subject 12 is reported. Figure S3. CNVs profile in colorectal adenoma. Example of representative CNVs in subject 12 is reported. Figure S4. CNVs profile in colorectal carcinoma. Example of representative CNVs in subject 12 is reported. The OncoScan analysis identified the gain of whole chromosomes 7, 13 and X; the partial gain of chromosomes 8 and 20; the loss of chromosome 18 and the partial loss of chromosomes 1, 2, 5, 8 and 17. Figure S5. (a) Circos plot showing the distribution of the events in mucosa samples. (b) Circos plot showing the distribution of the events in adenoma samples. (c) Circos plot showing the distribution of the events in carcinoma samples. Colored dots represent the events. The black circle line is the baseline: the dots inside represent the deletions and the outside ones show the amplifications. (PDF 14770 kb)
Table S6. SMC1A synonymous variants identified in carcinoma samples. (PDF 30 kb)
Figure S6. SMC1A mutational screening. Example of representative SMC1A sequencing is reported, showing multiple nucleotide changes in the carcinoma deriving from subject 2. (PDF 64 kb)
Table S7. Prediction of mutation effects on SMC1A protein using Mutation Tester and PolyPhen2. (PDF 17 kb)
Figure S7. Violin plot showing the distribution of SMC1A-positive cells. Figure S8. Violin plot showing the distribution of SMC1A staining intensity. Figure S9. Immunohistochemistry data was validated by RT-qPCR. *p < 0.05. (PDF 1153 kb)
Figure S10. SMC1A immunohistochemistry in mucosa, adenoma and carcinoma (PDF 1996 kb)
Figure S11. Effects of SMC1A mutation and overexpression in in vivo. (a) Representative images of tumours formed in the mice with HCT116 cells. (b) Tumours formed in the mice in which HCT116 SMC1A wild-type cells were implanted. (c) Tumours induced by HCT116 SMC1A c.A2027G cells. (PDF 1199 kb)
Figure S12. Classification of three tumors deriving from the inoculation of HCT116 (907_1, 907_2 and 907_3, red circle), four deriving from HCT116 overexpressing SMC1A wild-type (907_4, 907_5, 907_6 and 907_7, blue circle) and four deriving from HCT116 harboring SMC1A c.A2027G mutation (907_8, 907_9, 907_10 and 907_11, green circle) by gene expression. Table S8. Dysregulated genes in HCT116 SMC1A wild-type induced tumors. Table S9. Dysregulated genes in HCT116 SMC1A c.A2027G induced tumors. (PDF 851 kb)
Figure S13. GO term enrichment analysis of biological processes that were significantly overrepresented when considering differentially expressed genes in HCT116 SMC1A wild-type cells. Figure S14. GO term enrichment analysis of biological process that were significantly overrepresented when considering differentially expressed genes in HCT116 SMC1A c.A2027G cells. Table S10. Dysregulated pathways identified in tumors induced by SMC1A wild-type with p < 0.01. Table S11. Dysregulated pathways identified in tumors induced by SMC1A c.A2027G with p < 0.01. (PDF 3535 kb)
Figure S15. Dysregulated genes in induced tumors. RNA-seq data was validated for eleven genes, ATG12, CEP55, CHAF1A, CLDN4, H19, HIF1A, KLF5, SAT1, STEAP4, TACSTD2 and TOMM40, by RT-qPCR. *p < 0.05. Figure S16. Heatmap of the eleven dysregulated genes validated by RT-qPCR. (PDF 3904 kb)