程灵婧, 李贺同, 张升校, 刘鸿齐, 于琦, 郑超越, 冯爽, 孔腾, 孙翔飞, 贺培凤, 吕小萍. 基于蛋白对宫颈癌相关基因的生物信息学分析. 2023. biomedRxiv.202303.00017
基于蛋白对宫颈癌相关基因的生物信息学分析
通讯作者: 于琦, yuqi@sxmu.edu.cn
DOI:10.12201/bmr.202303.00017
Protein-based bioinformatics analysis of cervical cancer-related genes
Corresponding author: yu qi, yuqi@sxmu.edu.cn
-
摘要:目的/意义本研究旨在通过生物信息学挖掘与宫颈癌(CESC)相关的基因,探讨了与HPVE6E7密切相关差异表达基因(DEGs)的表达特征及临床意义。方法/过程从UCSC获取TCGA和GTEx中CESC的宫颈组织及临床信息,作为训练集。从GEO中获取与CESC相关的表达谱芯片GSE63514作为验证集。使用R软件limma包筛选肿瘤和正常样本的DEGs,制作与MigDB数据库中E6E7蛋白相关基因的Venn图。通过survival包进行批量生存分析,并通过ROC和蛋白表达水平进行验证。其次,通过拷贝数变异和甲基化相关性得到关键基因。最后,构建特异性共表达网络并进行富集分析和免疫浸润分析。结果/结论与HPVE6E7蛋白相关的DEGs有101个,生存分析和ROC分析共筛选出8个关键DEGs。经蛋白水平进行验证后发现有4个基因与mRNA水平表达情况一致分别是CHAF1B?E2F1?MCM4和PCNA。经过拷贝数和甲基化相关性分析后,筛选出3个基因有显著意义,分别是E2F1?MCM4和PCNA。同时,经通路分析后发现特异性共表达网络中的基因显著富集在DNA复制?染色体组织?核染色体等通路。最后,通过免疫相关性分析发现关键基因与CD4 T细胞,B细胞和中性粒细胞显著相关。E2F1、MCM4、PCNA、DNA复制,染色体组织等为CESC发生发展以及HPVE6E7编码蛋白显著相关的分子机制和关键基因。
Abstract: The aim of this study was to explore the expression characteristics and clinical significance of differentially expressed genes (DEGs) closely related to HPVE6E7 through bioinformatics mining of genes associated with cervical cancer (CESC). Method The cervical tissue and clinical information of CESC in TCGA and GTEx were obtained from UCSC as the training set. The expression profile chip GSE63514 associated with CESC from GEO was obtained as the validation set. The DEGs of tumor and normal samples were screened using the R software limma package to produce Venn diagrams of genes associated with the E6E7 protein in the MigDB database. Bulk survival analysis was performed by survival package and validated by ROC and protein expression levels. Next, key genes were obtained by copy number variation and methylation correlation. Finally, specific co-expression networks were constructed and subjected to enrichment analysis and immuno-infiltration analysis. ResultsThere were 101 DEGs associated with HPVE6E7, and 8 DEGs were screened after survival and ROC analysis. After verification at the protein level, four genes were found to be consistent with expression at the mRNA level, namely CHAF1B, E2F1, MCM4, and PCNA. Through copy number and methylation correlation analysis, three genes were selected as significant, respectively, E2F1, MCM4, and PCNA. Meanwhile, the genes in the specific co-expression network were strongly enriched in DNA replication, chromosome organization, nuclear chromosomes, etc. Eventually, immune correlation analysis revealed significant correlations with CD4 T cells, B cells, and neutrophils.E2F1, MCM4, PCNA, DNA replication, chromosome organization, etc., were the molecular mechanisms and key pivot genes for the occurrence and development of CESC and the protein encoded by HPVE6E7.
Key words: Cervical cancer; cervical tissue; Differentially Expressed Genes; HPVE6E7 encoded protein提交时间:2023-03-22
版权声明:作者本人独立拥有该论文的版权,预印本系统仅拥有论文的永久保存权利。任何人未经允许不得重复使用。 -
图表
-
朱笑笑, 钱爱兵. 基于百度指数的乳腺癌防治健康信息网络关注特征分析. 2020. doi: 10.12201/bmr.201906.00001
陈洞天, 徐进. 基于集成平台和HL7标准的危急值系统的设计与实现. 2021. doi: 10.12201/bmr.202109.00019
万佳林, 贾晓峰, 胡志民. 基于多案例研究的生物医学科学数据开放共享策略分析. 2023. doi: 10.12201/bmr.202312.00024
芦欣怡, 王亚东. 医疗卫生行业综合监管制度体系构建——基于扎根理论的质性分析. 2021. doi: 10.12201/bmr.201910.00001
宋思嘉, 单晨璐, 王爽, 陈如梵, 张涛, 郑灏, 韩雅琴. 基因数据隐私问题及相关保护技术进展研究. 2021. doi: 10.12201/bmr.202104.00015
崔蓓, 王磊. 基于ISM的我国生物医药创新能力影响因素分析. 2022. doi: 10.12201/bmr.202111.00015
阮旭凌, 刘琦, 郭志恒, 晏峻峰. 基于LDA和XGBoost算法的乳腺癌预测模型构建研究. 2022. doi: 10.12201/bmr.202106.00007
郝雅琴, 胡云峰, 郝静, 赵鑫. 环状RNA与自噬在胃癌中作用的研究进展. 2023. doi: 10.12201/bmr.202304.00001
宋佳, 范成鑫, 王婉晨, 艾旭峰, 刘馨璐, 李翠玉, 张玉杰, 李秋莎, 王安琪, 丰志强, 尹文强, 马东平. 利益相关者视角下我国药品集中带量采购政策研究——基于政策工具的文本分析. 2022. doi: 10.12201/bmr.202203.00006
-
序号 提交日期 编号 操作 1 2023-01-31 bmr.202303.00017V1
下载 -
-
公开评论 匿名评论 仅发给作者
引用格式
访问统计
- 阅读量:489
- 下载量:5
- 评论数:0