Contributed equally
Lung cancer remains the leading cause of cancer-related mortality for both men and women. Tumor recurrence and metastasis is the major cause of lung cancer treatment failure and death. The microRNA-200 (miR-200) family is a powerful regulator of the epithelial-mesenchymal transition (EMT) process, which is essential in tumor metastasis. Nevertheless, miR-200 family target genes that promote metastasis in non-small cell lung cancer (NSCLC) remain largely unknown. Here, we sought to investigate whether the microRNA-200 family regulates our previously identified NSCLC prognostic marker genes associated with metastasis, as potential molecular targets. Novel miRNA targets were predicted using bioinformatics tools based on correlation analyses of miRNA and mRNA expression in 57 squamous cell lung cancer tumor samples. The predicted target genes were validated with quantitative RT-PCR assays and western blot analysis following re-expression of miR-200a, -200b and -200c in the metastatic NSCLC H1299 cell line. The results show that restoring miR-200a or miR-200c in H1299 cells induces downregulation of
Lung cancer remains the leading cause of cancer-related mortality in the world, with an overall 5-year survival rate of 15%. Approximately 85% of lung cancer cases are non-small cell lung cancer (NSCLC) (
In the past few years, microRNAs (miRNAs) have emerged as promising molecular factors with potential for clinical applications in cancer diagnosis and therapy (
The microRNA-200 (miR-200) family, represented by miR-200a, -200b, -200c, -141 and -429, is a marker and powerful regulator of the EMT process. Its functions include maintaining the epithelial phenotype of tissues through suppression of the EMT-inducing transcription factors zinc finger E-box binding homeobox 1 and 2 (ZEB1 and ZEB2) (
The goal of this study is to identify potential targets of miR-200 family essential in NSCLC metastasis and clinical outcome. Our previous studies identified prognostic biomarkers associated with metastasis in early stage NSCLC tumors not treated with chemotherapy (
A total of 130 lung squamous cell carcinoma (SCC) samples were analyzed in this study. The patient characteristics were described in a previous publication (
The mRNA expression levels of the lung cancer prognostic markers identified in our previous studies (
The miRNA targets predicted by TargetScan are based on the presence of conserved 8mer, 7mer and 6mer sites that match the seed region of each miRNA (
The binding sites between each miRNA-mRNA pair was retrieved from the
Small airway epithelial cells (SAEC) and normal human bronchial/tracheal epithelial cells (NHBE) were obtained from Lonza Walkersville Inc. (Walkersville, MD). Human non-small cell lung cancer cell line H1299 and human immortalized lung epithelial cell line BEAS-2B were purchased from the American Type Culture Collection (ATCC, Manassas, VA). SAEC cells were cultured according to the supplier’s recommendations in SABM medium supplemented with 52
Human miRIDIAN shMIMIC lentiviral miRNA particles (hsa-miR-200a: UAACACUGUCUGGUA ACGAUGU, hsa-miR-200b: UAAUACUGCCUGGUAAUG AUGA, hsa-miR-200c: UAAUACUGCCGGGUAAUGA UGGA, and control scrambled microRNA were purchased from Open Biosystems (Huntsville, AL) and used for infection of target cells in the presence of 4
Cells were lysed in 1X SDS lysis buffer (50 mM Tris-HCl, pH 6.8, 2% SDS, 10% glycerol). Total protein was quantified by the BCA method. β-mercaptoethanol was added to lysates to a final concentration 100 mM. Equal amounts of total protein were separated by 4–12% SDS-PAGE and transferred to a PVDF membrane. Membranes were blocked 1 h with 5% non-fat milk in 1X PBS containing 0.05% Tween-20. Membranes were then incubated for 1 h at room temperature with primary antibodies. After incubation with the primary antibody, membranes were washed thrice in 1X PBS with 0.05% Tween-20 for 5 min each and blocked for 7 min in blocking solution. Membranes were incubated for 1 h at room temperature with horseradish peroxidase (HRP) conjugated donkey anti-mouse IgG or donkey anti-rabbit IgG in 1X PBS with 0.05% Tween-20. Membranes were then washed five times for 5 min in PBS-Tween-20 and finally developed with HyGLO Western Blotting Substrate (Denville Scientific) according to the instructions of the manufacturer. Protein band intensity was determined using FluorChem® Q software (AlphaInnotech, Santa Clara, CA). Relative protein level was determined after normalization to tubulin and relative to negative control (miR-scr) samples. The following antibodies were used:
Total RNA was extracted using the mirVana® kit (Ambion Inc., Austin, TX) according to the manufacturer’s protocol. To ensure a good RNA quality, the quality and integrity of the total RNA was evaluated using 28S/18S ratio and a visual image of the 28S and 18S bands were evaluated on the 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA). RNA isolated using this method yielded a very good quality, with a RIN number ≥9. Concentration of the total RNA was assessed using the NanoDrop-1000 Spectrophotometer (NanoDrop Technologies, Germany).
Complementary DNA (cDNA) was generated using total RNA according to the TaqMan® MicroRNA Reverse Transcription protocol (Applied Biosystems Inc.). Quantitative RT-PCR for microRNA was performed using TaqMan MicroRNA assays (Applied Biosystems Inc.). Human U47 small nuclear RNA was used as an endogenous control. The expression levels of miRNAs were quantified using ABI 7500 quantitative real-time instrument and SDS software (Applied Biosystems Inc.). The abundance of miRNA is expressed as Ct (threshold fluorescence) which gives the number of cycles required to reach threshold fluorescence. Real-time PCR for target genes was determined using total RNA and cDNA was generated using a High-Capacity cDNA Reverse Transcription kit and TaqMan gene expression assays (Applied Biosystems Inc.). E-cadherin (CDH1) mRNA was measured using SYBR-Green Master mix and CDH1 specific primers according to manufacturer’s protocol (Applied Biosystems Inc.). All qRT-PCR reactions were performed on 7500 instrument (Applied Biosystems Inc.). In the qRT-PCR analysis of E-cadherin, the dissociation curve showed the absence of a secondary peak, indicating no presence of primer dimer. Specificity of the PCR product obtained from SYBR-Green reactions was verified by sequencing. The expression level of each gene was determined by following formulas: fold change = 2−ΔΔCt, where ΔCt (cycle threshold) = Cttarget gene - Ctendogenous control gene, and ΔΔCt = ΔCttreated sample - ΔCtcontrol sample. The expression level of the analyzed genes is reported as fold change relative to negative miR-scrambled (-src) infected samples. The human UBC gene was used as an endogenous control gene.
In this study, a predicted gene was considered a confirmed target if the mRNA level was significantly downregulated or the protein level was downregulated at least 15% relative to negative control samples. Not all of the predicted targets were analyzed at the protein level due to the lack of specificity of commercially available antibodies.
Ingenuity pathway analysis (IPA) software (Ingenuity Systems, Redwood City, CA) was used to derive curated molecular interactions reported in the scientific literature. These interactions included both physical and functional interactions, as well as interactions representing pathway relevance. In this study, in order to delineate molecular networks of genes interacting with the miR-200 family and novel molecular targets, a core analysis was employed to identify the most relevant canonical pathways, biological functions and physiological processes from the interactions reported in the IPA database. We then selected pathways that were statistically significant with a p<0.05 in adjusted Benjamini-Hochberg tests.
The statistical significance of the difference between groups was determined by un-paired t-tests at p≤0.05. The qRT-PCR expression data are presented as mean ± SEM.
To screen for potential miRNA regulators of our previously identified lung cancer prognostic gene signatures (
Several miRNAs, including the miR-200 family, were predicted to target multiple prognostic biomarkers. We focused on the miR-200 family because of its reported role in tumor metastasis. The miRNA-200 family is represented by miR-200a, miR-200b, miR-200c, miR-141 and miR-429, based on their genomic location and primary sequence. Based on sequence similarity, the miR-200 family is divided into two subclasses: one class includes miR-200b, -200c and -429, and the other class includes miR-200a and miR-141 (
The predicted targets for hsa-miR-200a include deleted in liver cancer 1 gene (
We analyzed the expression levels of miR-200a, -200b -and 200c in a metastatic human NSCLC model, H1299 cells. Normal human small airway epithelial cells (SAEC) were used as control cells. The expression level of miR-200a, -200b and -200c in H1299 cells was at the detection limit (
At the protein level,
The results of the present study show that miRNA-200a regulates
To further substantiate the regulatory effects of miR-200 on these lung cancer prognostic markers, SCC patient samples (n=57) (
The regulation of miR-200 on these predicted target genes was also evaluated in human immortalized lung epithelial cells BEAS-2B. The overexpression of miR-200b in these cells resulted in significantly downregulated mRNA level of
After the potential molecular targets of the miR-200 family were shown in the present study, we sought to explore the effect of miR-200s on EMT in the metastatic NSCLC cells. Re-expression of miR-200c induced a 1.53-fold upregulation of E-cadherin (
Molecular network interactions and significant canonical signaling pathways associated with miR-200s and their predicted molecular targets were retrieved using IPA. The molecular network map shows interactions between the miR-200s and their known target genes,
The IPA functional analysis found a total of 69 canonical pathways associated with the miR-200 network, of which 13 canonical pathways were statistically significant (adjusted p<0.05 with Benjamini-Hochberg tests;
Lung cancer is a dynamic and diverse disease and is associated with numerous somatic mutations, deletions and amplification events. Tumor recurrence and metastasis causes lethality and failure in lung cancer treatment. About 35–50% of stage I NSCLC patients will develop and die from tumor recurrence within 5 years following surgery (
miRNAs are small non-coding RNAs that regulate gene expression via degradation or translational inhibition of target mRNAs. Importantly, one miRNA can regulate the expression of multiple genes because it can bind to its mRNA targets regardless whether there is perfect seed sequence complementarity (
Deregulated expression of miR-200 family members has been observed in multiple cancer types (
On the other hand, overexpression of miR-200 was also found in cholangiocarcinoma malignant cells compared to non-malignant cells (
Despite strong evidence that miR-200s inhibit EMT and suppress cancer cell invasion, several functional overexpression studies have yielded conflicting results on the role of miR-200s in metastasis, supporting both their anti-metastatic (
The present study sought to determine whether some of our previously identified human lung cancer prognostic markers are potential molecular targets of miR-200a, -200b and -200c microRNAs. The study goal was to explore whether biomarkers associated with NSCLC poor prognosis are functionally involved in EMT and metastasis through miR-200 regulation. In order to identify new molecular targets of the miR-200 family, we used the H1299 NSCLC cell line. This cell line is
This study identified a regulation of miR-200 family on their potential novel molecular targets. The results show that
IPA functional pathway analyses found that the miR-200 molecular network involved canonical pathways of immune response, molecular mechanisms of cancer, metastasis signaling transduction, cell-cell communication, proliferation and DNA repair. These results indicate that miR-200 is essential in regulating signaling pathways responsible for many biological functions and complex molecular mechanisms (
In conclusion, this study combined computational predictions and quantitative experimental validations to demonstrate that the miR-200 family regulates multiple NSCLC prognostic marker genes. The identified regulation, direct or indirect, provides important insights of possible microRNA regulatory mechanisms in EMT and lung cancer metastasis and lays a foundation for future functional analysis. These potential molecular targets, each with significant prognostic value in NSCLC patients, are involved in the regulation of gene transcription and signal transduction pathways. The findings of the miR-200 downregulation of
We thank Rebecca Raese for her help in editing the manuscript. We thank Yuya Kudo for his assistance in figure preparation. This project was supported by the American Cancer Society (122300-IRG-09-061-04-IRG to A.V.I.) and Susan G. Komen (KG110350 to A.V.I.). Software license for Ingenuity Pathway Analysis was supported by NIH/NCRR P2016477. This project is supported by the NIH R01LM009500 (PI: N.L.G.) and NCRR P20 RR16440 ARRA Supplement (PD: N.L.G.). The findings and conclusions in this report are those of the author(s) and do not necessarily represent the views of the National Institute for Occupational Safety and Health.
Computational prediction of miRNA and target lung cancer prognostic genes and experimental results focusing on miR-200 family. (A) Overview of bioinformatic prediction of miRNA target genes and functional assays. (B) Gene expression fold change of miR-200 in primary squamous cell lung cancer tumors vs. normal lung tissues in the patient cohort from Raponi
Relative mRNA levels of the predicted targets of miR-200 in H1299. (A) Relative mRNA expression of predicted miR-200 target genes in H1299 cells infected with miR-scrambled and miR-200a, -200b and -200c. These targets were predicted with
Protein levels of the predicted molecular targets of miR-200 in H1299 cells after infection with miR-scr (scrambled) or with miR-200a, -200b or -200c. miR-scr was used as a negative control. (A) 3′-UTR sequences of the miR-200a, -200b and -200c putative binding sites of target genes is given in the 5′- to 3′-orientation. (B) Western blot analysis of protein levels in H1299 cells over-expressing miR-200a, -200b, -200c or miR-scr. One representative blot is shown. The experiments were repeated in three biological replicates. Tubulin was used as a loading control. (C) Semi-quantitative analysis of protein levels relative to negative control miR-scr. Protein level was determined as described in Materials and methods.
Relative mRNA and protein levels of the predicted molecular targets of miR-200 in BEAS-2B. (A) Relative mRNA expression of predicted miR-200 target genes in BEAS-2B cells infected with miR-scrambled and miR-200a, -200b and -200c. (B) Western blot analysis of protein levels in BEAS-2B cells overexpressing miR-200a, -200b, -200c or miR-scr. One representative blot is shown. The experiments were repeated in three biological replicates. Tubulin was used as a loading control. (C) Semi-quantitative analysis of protein levels relative to negative control miR-scr. Protein level was determined as described in Materials and methods. DLC1 protein was not detected in BEAS-2B in western blots.
Molecular network analysis of the miR-200 family and potential molecular targets with Ingenuity pathway analysis (IPA).
Proposed mechanisms of the miR-200 regulation in tumor initiation and metastasis.
The predicted target genes analyzed in this study.
| Gene symbol | Gene name | Assay ID | Function | Pathway | Remarks |
|---|---|---|---|---|---|
| AHNAK nucleoprotein (desmoyokin) | Hs00225285_m1 | Protein-protein binding | Signal transduction | NSCLC prognostic biomarker ( | |
| Alpha thalassemia/mental retardation syndrome X-linked | Hs00230877_m1 | Transcriptional regulator, chromatin remodeling | Transcription | NSCLC prognostic biomarker ( | |
| Deleted liver cancer 1 | Hs00183436_m1 | Regulation of small GTP-binding proteins | Signal transduction | Tumor suppressor gene ( | |
| E2F transcription factor-4 | Hs00608098_m1 | Transcriptional factor, cell cycle, apoptosis | Transcription | NSCLC prognostic maker ( | |
| Hemochromatosis | Hs00373474_m1 | Regulation of body iron metabolism | Iron metabolism | NSCLC prognostic biomarker ( | |
| Heterogeneous nuclear ribonucleo protein-A3 | Hs00864845_s1 | Cytoplasmic RNA binding and trafficking, protein binding | Signal transduction | NSCLC prognostic biomarker ( | |
| Thrombospondin 1 | Hs00962914_m1 | Extracellular adhesive glycoprotein | Protein interaction | NSCLC prognostic biomarker ( | |
| Ubiquitin-conjugating enzyme E2I | Hs00163336_m1 | Ubiquitin-activating protein | Protein degradation | NSCLC prognostic biomarker ( | |
| Ubiquitin-like modifier activating enzyme-6 | Hs00414964_m1 | Ubiquitin-conjugation for protein degradation | Protein degradation | NSCLC prognostic biomarker ( |
Genes regulated by miR-200a, -200b and -200c in H1299.
| Genes | |||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
| |||||||
| Predicted | Downregulated | Predicted | Downregulated | Predicted | Downregulated | ||||
| mRNA | Protein | mRNA | Protein | mRNA | Protein | ||||
The gene is a predicted target for the corresponding miRNA;
downregulation at mRNA level;
downregulation at protein level.
Correlation between the expression of miR-200 and its regulated genes in squamous cell lung cancer patient tumors (n=57).
| Genes | |||
|---|---|---|---|
| −0.0629 | NA | −0.301 | |
| −0.313 | −0.374 | −0.496 | |
| −0.193 | −0.253 | −0.393 | |
| NA | 0.264 | NA | |
| −0.426 | −0.458 | −0.484 | |
| −0.395 | −0.379 | −0.382 |
Statistically significant at p<0.05. NA, gene not regulated by miR-200 in H1299.
Borderline significant at p=0.057.
Top 13 significant canonical pathways related to the miR-200 molecular network in Ingenuity pathway analysis.
| Canonical pathways | P-value | Molecules |
|---|---|---|
| Virus entry via endocytic pathways | 0.0002 | B2M, CAV1, TFRC |
| Allograft rejection signaling | 0.0019 | B2M, IL2 |
| OX40 signaling pathway | 0.0025 | B2M, IL2 |
| Caveolar-mediated endocytosis signaling | 0.0044 | B2M, CAV1 |
| Communication between innate and adaptive immune cells | 0.0056 | B2M, IL2 |
| Chronic myeloid leukemia signaling | 0.0071 | CTBP1, E2F4 |
| Molecular mechanisms of cancer | 0.0102 | DAXX, CDH1, E2F4 |
| DNA double-strand break repair by homologous recombination | 0.0191 | ATRX |
| Lipid antigen presentation by CD1 | 0.0257 | B2M |
| Antiproliferative role of TOB in T cell signaling | 0.0355 | IL2 |
| Colorectal cancer metastasis signaling | 0.0407 | CDH1, E2F4 |
| Role of CHK proteins in cell cycle checkpoint control | 0.0447 | E2F4 |
| Cell cycle regulation by BTG family proteins | 0.0468 | E2F4 |
Top 25 significant disease and disorder functions related to the miR-200 molecular network in Ingenuity pathway analysis.
| Disease and Disorders | P-value | Molecules |
|---|---|---|
| Genetic disorder | 0.00004 | B2M, CDH1, IL2, TFR2, ZEB2, ATRX, mir-200, CAV1, ZEB1, HFE, AGTR1 |
| Metabolic disease | 0.00004 | B2M, TFR2, HFE, AGTR1 |
| Cancer | 0.00005 | B2M, E2F4, mir-200, ATRX, ZEB1, DLC1, CTBP1, CDH1, BMI1, IL2, ZEB2, CAV1, TFRC, AGTR1 |
| Reproductive system disease | 0.00005 | B2M, CDH1, BMI1, IL2, ATRX, mir-200, CAV1, TFRC, DLC1, AGTR1 |
| Gastrointestinal disease | 0.00021 | B2M, CDH1, BMI1, IL2, mir-200, ZEB2, CAV1, TFRC, AGTR1 |
| Hepatic system disease | 0.00021 | B2M, BMI1, IL2, mir-200, DLC1, AGTR1 |
| Organismal injury and abnormalities | 0.00021 | IL2, AGTR1 |
| Infection mechanism | 0.00053 | CTBP1, E2F4, IL2, CAV1, TFRC, ZEB1 |
| Infectious disease | 0.00053 | CTBP1, B2M, IL2, CAV1, TFRC, AGTR1 |
| Hematological disease | 0.00122 | B2M, CTBP1, E2F4, IL2, ATRX, CAV1, DLC1, AGTR1, HFE |
| Dermatological diseases and conditions | 0.00138 | CAV1, ZEB1, HFE |
| Inflammatory response | 0.00138 | B2M, CDH1, IL2, ZEB1, AGTR1 |
| Ophthalmic disease | 0.00138 | ZEB1 |
| Respiratory disease | 0.00186 | CTBP1, B2M, CDH1, IL2, mir-200, CAV1, AGTR1 |
| Immunological disease | 0.00247 | B2M, DAXX, E2F4, CDH1, HNRNPA3, IL2, TFRC, DLC1, AGTR1 |
| Antimicrobial response | 0.00276 | IL2 |
| Cardiovascular disease | 0.00276 | B2M, IL2, CAV1, AGTR1 |
| Inflammatory disease | 0.00401 | B2M, DAXX, CDH1, HNRNPA3, IL2, ZEB2, TFRC, ZEB1, DLC1, AGTR1 |
| Connective tissue disorders | 0.00501 | B2M, DAXX, CDH1, HNRNPA3, IL2, TFRC, DLC1, AGTR1 |
| Neurological disease | 0.00550 | BMI1, ZEB2, CAV1, AGTR1 |
| Renal and urological disease | 0.00550 | B2M, CDH1, AGTR1 |
| Skeletal and muscular disorders | 0.00733 | B2M, DAXX, CDH1, HNRNPA3, BMI1, IL2, TFRC, DLC1 |
| Developmental disorder | 0.00961 | ATRX, ZEB2, AGTR1 |
| Nutritional disease | 0.01230 | IL2 |
| Endocrine system disorders | 0.04720 | AGTR1 |