• 国家药监局综合司 国家卫生健康委办公厅
  • 国家药监局综合司 国家卫生健康委办公厅

基于蛋白对宫颈癌相关基因的生物信息学分析

通讯作者: 于琦, yuqi@sxmu.edu.cn
DOI:10.12201/bmr.202303.00017
声明:预印本系统所发表的论文仅用于最新科研成果的交流与共享,未经同行评议,因此不建议直接应用于指导临床实践。

Protein-based bioinformatics analysis of cervical cancer-related genes

Corresponding author: yu qi, yuqi@sxmu.edu.cn
  • 摘要:目的/意义本研究旨在通过生物信息学挖掘与宫颈癌(CESC)相关的基因,探讨了与HPVE6E7密切相关差异表达基因(DEGs)的表达特征及临床意义。方法/过程从UCSC获取TCGA和GTEx中CESC的宫颈组织及临床信息,作为训练集。从GEO中获取与CESC相关的表达谱芯片GSE63514作为验证集。使用R软件limma包筛选肿瘤和正常样本的DEGs,制作与MigDB数据库中E6E7蛋白相关基因的Venn图。通过survival包进行批量生存分析,并通过ROC和蛋白表达水平进行验证。其次,通过拷贝数变异和甲基化相关性得到关键基因。最后,构建特异性共表达网络并进行富集分析和免疫浸润分析。结果/结论与HPVE6E7蛋白相关的DEGs有101个,生存分析和ROC分析共筛选出8个关键DEGs。经蛋白水平进行验证后发现有4个基因与mRNA水平表达情况一致分别是CHAF1B?E2F1?MCM4和PCNA。经过拷贝数和甲基化相关性分析后,筛选出3个基因有显著意义,分别是E2F1?MCM4和PCNA。同时,经通路分析后发现特异性共表达网络中的基因显著富集在DNA复制?染色体组织?核染色体等通路。最后,通过免疫相关性分析发现关键基因与CD4 T细胞,B细胞和中性粒细胞显著相关。E2F1、MCM4、PCNA、DNA复制,染色体组织等为CESC发生发展以及HPVE6E7编码蛋白显著相关的分子机制和关键基因。

    关键词: 宫颈癌宫颈组织差异基因HPVE6E7编码蛋白

     

    Abstract: The aim of this study was to explore the expression characteristics and clinical significance of differentially expressed genes (DEGs) closely related to HPVE6E7 through bioinformatics mining of genes associated with cervical cancer (CESC). Method The cervical tissue and clinical information of CESC in TCGA and GTEx were obtained from UCSC as the training set. The expression profile chip GSE63514 associated with CESC from GEO was obtained as the validation set. The DEGs of tumor and normal samples were screened using the R software limma package to produce Venn diagrams of genes associated with the E6E7 protein in the MigDB database. Bulk survival analysis was performed by survival package and validated by ROC and protein expression levels. Next, key genes were obtained by copy number variation and methylation correlation. Finally, specific co-expression networks were constructed and subjected to enrichment analysis and immuno-infiltration analysis. ResultsThere were 101 DEGs associated with HPVE6E7, and 8 DEGs were screened after survival and ROC analysis. After verification at the protein level, four genes were found to be consistent with expression at the mRNA level, namely CHAF1B, E2F1, MCM4, and PCNA. Through copy number and methylation correlation analysis, three genes were selected as significant, respectively, E2F1, MCM4, and PCNA. Meanwhile, the genes in the specific co-expression network were strongly enriched in DNA replication, chromosome organization, nuclear chromosomes, etc. Eventually, immune correlation analysis revealed significant correlations with CD4 T cells, B cells, and neutrophils.E2F1, MCM4, PCNA, DNA replication, chromosome organization, etc., were the molecular mechanisms and key pivot genes for the occurrence and development of CESC and the protein encoded by HPVE6E7.

    Key words: Cervical cancer; cervical tissue; Differentially Expressed Genes; HPVE6E7 encoded protein

    提交时间:2023-03-22

    版权声明:作者本人独立拥有该论文的版权,预印本系统仅拥有论文的永久保存权利。任何人未经允许不得重复使用。
  • 图表

  • 朱笑笑, 钱爱兵. 基于百度指数的乳腺癌防治健康信息网络关注特征分析. 2020. doi: 10.12201/bmr.201906.00001

    陈洞天, 徐进. 基于集成平台和HL7标准的危急值系统的设计与实现. 2021. doi: 10.12201/bmr.202109.00019

    万佳林, 贾晓峰, 胡志民. 基于多案例研究的生物医学科学数据开放共享策略分析. 2023. doi: 10.12201/bmr.202312.00024

    芦欣怡, 王亚东. 医疗卫生行业综合监管制度体系构建——基于扎根理论的质性分析. 2021. doi: 10.12201/bmr.201910.00001

    宋思嘉, 单晨璐, 王爽, 陈如梵, 张涛, 郑灏, 韩雅琴. 基因数据隐私问题及相关保护技术进展研究. 2021. doi: 10.12201/bmr.202104.00015

    崔蓓, 王磊. 基于ISM的我国生物医药创新能力影响因素分析. 2022. doi: 10.12201/bmr.202111.00015

    阮旭凌, 刘琦, 郭志恒, 晏峻峰. 基于LDA和XGBoost算法的乳腺癌预测模型构建研究. 2022. doi: 10.12201/bmr.202106.00007

    郝雅琴, 胡云峰, 郝静, 赵鑫. 环状RNA与自噬在胃癌中作用的研究进展. 2023. doi: 10.12201/bmr.202304.00001

    宋佳, 范成鑫, 王婉晨, 艾旭峰, 刘馨璐, 李翠玉, 张玉杰, 李秋莎, 王安琪, 丰志强, 尹文强, 马东平. 利益相关者视角下我国药品集中带量采购政策研究——基于政策工具的文本分析. 2022. doi: 10.12201/bmr.202203.00006

  • 序号 提交日期 编号 操作
    1 2023-01-31

    bmr.202303.00017V1

    下载
  • 公开评论  匿名评论  仅发给作者

引用格式

程灵婧, 李贺同, 张升校, 刘鸿齐, 于琦, 郑超越, 冯爽, 孔腾, 孙翔飞, 贺培凤, 吕小萍. 基于蛋白对宫颈癌相关基因的生物信息学分析. 2023. biomedRxiv.202303.00017

访问统计

  • 阅读量:489
  • 下载量:5
  • 评论数:0

Email This Article

User name:
Email:*请输入正确邮箱
Code:*验证码错误