•  
  •  
 

Journal of Scientific Information Research

Keywords

domain knowledge graph; named entity recognition; entity relationship extraction; machine learning; electricity domain text; intelligent decision support

Abstract

[Purpose/significance] This research constructed a domain knowledge graph and its scenario-oriented application framework for decision support at four levels: the data foundation layer, the key technology layer, the domain knowledge graph construction layer, and the scenario-oriented application layer. This framework aims to provide systematic support for knowledge discovery.

[Method/process] Based on the construction of a domain knowledge graph and its scenario-oriented application framework for decision support, this research focuses on the improvement of models and performance evaluation for fine-grained entity and relationship extraction at the discourse level within texts. The optimal model is selected to construct a domain knowledge graph. Taking the field of transformer equipment failure as an example, empirical studies are conducted to realize scenario-oriented applications aimed at decision support.

[Result/conclusion] The integrated pipeline knowledge extraction method, which combines the improved BERT-BiLSTM CRF model and the PURE-RE model, has demonstrated superior comprehensive performance. The knowledge graph constructed for the domain of transformer faults has been verified against the expert knowledge in the "Equipment Standard Defect Knowledge Base", and the search results have passed verification. This effectively assists maintenance personnel in equipment inspection and maintenance, thereby validating the effectiveness and practicality of the framework established in this research. With the support of domain expert knowledge verification, it can effectively empower related fields to conduct decision support.

First Page

58

Last Page

69

Submission Date

December 2024

Revision Date

February 2025

Acceptance Date

February 2025

Publication Date

October 2025

Digital Object Identifier (DOI)

ꎺ 10.19809/j.cnki.kjqbyj.2025.04.006

Reference

[1] 杨观赐, 许彪, 罗可欣, 等. 知识图谱技术综述: 构建、推理及典型应用[J]. 贵州大学学报 (自然科学版), 2025, 42 (2): 1-10.
[2] 杨欣谊, 苏新宁. 领域知识结构认知: 基于大数据环境的适用性分析[J]. 图书情报工作, 2024, 68 (23): 4-16.
[3] 刘峤, 李杨, 段宏, 等. 知识图谱构建技术综述[J]. 计算机研究与发展, 2016, 53 (3): 582-600.
[4] 王昊, 李晓敏, 布文茹, 等. 面向知识图谱生成的非遗丝织领域实体关系抽取研究[J]. 数据分析与知识发现, 2024, 8 (Z1): 179-190.
[5] 石栖, 陈文杰, 胡正银, 等. 面向知识发现的科学实验知识图谱构建研究[J]. 数据分析与知识发现, 2025, 9 (3): 1-15.
[6] 张称心, 孙家盛, 段阳. 机床夹具设计知识图谱构建及应用[J]. 机电工程, 2025, 42 (1): 106-116.
[7] 支凤稳, 陈佳琪, 孙若阳, 等. 基于知识图谱的数字记忆领域知识发现研究[J]. 情报科学, 2024, 42 (9): 123-134.
[8] 陈鹏, 邰彬, 石英, 等. 融合BERT、双向长短记忆网络和条件随机场的电力设备缺陷文本实体抽取[J]. 电网技术, 2023, 47 (10): 4367-4375.
[9] 刘烨宸, 李华昱. 领域知识图谱研究综述[J]. 计算机系统应用, 2020, 29 (6): 1-12.
[10] 姜晓恒, 吕鹏帅, 刘允, 等. 纺织品表面缺陷知识图谱构建研究[J/OL]. 计算机工程与应用, 1-12[2024-11-07]. http://kns.cnki.net/kcms/detail/11.2127.TP.20241017.1358.012.html.
[11] 沈思, 朱雨菲. 面向学术全文本多维知识元的学术图谱构建研究[J]. 情报学报, 2024, 43 (8): 960-975.
[12] 曹树金, 李睿婧. 基于专利文献摘要的创新知识图谱构建与应用[J]. 情报理论与实践, 2022, 45 (11): 21-28.
[13] ZHANG Y J, LIN H F, YANG Z H, et al. A hybrid model based on neural networks for biomedical relation extraction[J]. Journal of biomedical informatics, 2018, 81: 83-92.
[14] 郑小虎, 刘正好, 刘冰, 等. 面向平缝工艺信息融合的知识图谱构建方法[J]. 纺织学报, 2024, 45 (4): 195-203.
[15] 王杰, 谢忠局, 赵建涛, 等. 基于知识图谱和用户画像的金融产品推荐系统[J]. 计算机应用, 2022, 42 (S1): 43-47.
[16] 史慧洋, 魏靖烜, 蔡兴业, 等. 威胁情报提取与知识图谱构建技术研究[J]. 西安电子科技大学学报, 2023, 50 (4): 65-75.
[17] 谢庆, 蔡扬, 谢军, 等. 基于ALBERT的电力变压器运维知识图谱构建方法与应用研究[J]. 电工技术学报, 2023, 38 (1): 95-106.
[18] 周长玉, 李想, 焦润海, 等. 基于领域知识图谱的变压器故障联合推理[J]. 中国电机工程学报, 2025, 45 (6): 2160-2170.
[19] WANG Z R, KONONOVA O, CRUSE K, et al. Dataset of solution-based inorganic materials synthesis procedures extracted from the scientific literature[J]. Scientific data, 2022, 9 (1): 231.
[20] ZHAO J Y, COLE J M. Reconstructing Chromatic-Dispersion relations and predicting refractive indices using text mining and machine learning[J]. Journal of chemical information and modeling, 2022, 62 (11): 2670-2684.
[21] WANG W R, JIANG X, TIAN S H, et al. Automated pipeline for superalloy data by text mining[J]. npj Computational Materials, 2022, 8 (1): 9.
[22] 张鹤译, 王鑫, 韩立帆, 等. 大语言模型融合知识图谱的问答系统研究[J]. 计算机科学与探索, 2023, 17 (10): 2377-2388.
[23] 杨波, 孙晓虎, 党佳怡, 等. 面向医疗问答系统的大语言模型命名实体识别方法[J]. 计算机科学与探索, 2023, 17 (10): 2389-2402.
[24] 国家电网有限公司设备管理部. 变压器类设备典型故障及缺陷案例汇编 (2006~2010) [M]. 北京: 中国电力出版社. 2012.
[25] 国家电网有限公司设备管理部. 变压器类设备典型故障及缺陷案例汇编 (2011~2018) [M]. 北京: 中国电力出版社. 2019.
[26] ZHONG Z, CHEN D. A frustratingly easy approach for entity and relation extraction[C]//Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Online. Association for Computational Linguistics. 2021: 50-61.
[27] WANG Y, YU B, ZHANG Y, et al. TPLinker: Single-stage Joint Extraction of Entities and Relations Through Token Pair Linking[C]//Proceedings of the 28th International Conference on Computational Linguistics. Barcelona, Spain, 2020: 1572-1582.
[28] 杨竣辉, 李苏晋. 融合位置信息和实体类别信息的中文命名实体识别[J]. 计算机工程, 2025, 51 (3): 113-121.
[29] 翟羽佳, 田静文, 赵玥. 基于BERT-BiLSTM-CRF模型的算法术语抽取与创新演化路径构建研究[J]. 情报科学, 2022, 40 (4): 71-78.
[30] VITERBI A J. A personal history of the Viterbi algorithm[J]. IEEE signal processing magazine, 2006, 23 (4): 120-142.
[31] 杨涵. 基于全局指针与潜在关系嵌入的实体关系抽取方法[D]. 哈尔滨: 哈尔滨工程大学, 2023.
[32] 柴嘉琪, 谈钰濛, 向兴华, 等. 基于文献数据的知识图谱构建: 以中医药治疗糖尿病周围神经病变为例[J]. 中国实验方剂学杂志, 2024, 30 (6): 144-150.
[33] 纪鑫, 武同心, 杨智伟, 等. 基于时序知识图谱的电力设备缺陷预测[J]. 北京航空航天大学学报, 2024, 50 (10): 3131-3138.
[34] 束嘉伟, 杨挺, 耿毅男, 等. 面向电力知识图谱构建的重叠实体关系联合抽取方法[J]. 高电压技术, 2024, 50 (11): 4912-4922.
[35] PATIL N, PATIL A, PAWAR B V. Named entity recognition using conditional random fields[J]. Procedia computer science, 2020, 167: 1181-1188.
[36] WANG M, ZHOU T, WANG H H, et al. Chinese power dispatching text entity recognition based on a double-layer BiLSTM and multi-feature fusion[J]. Energy reports, 2022, 8: 980-987.
[37] ALZAIDY R, CARAGEA C, GILES C L. Bi-LSTM-CRF sequence labeling for keyphrase extraction from scholarly documents[C]//The world wide web conference. 2019: 2551-2557.
[38] ZHOU S, LIU J, ZHONG X, et al. Named entity recognition using BERT with whole world masking in cybersecurity domain[C]//2021 IEEE 6th International Conference on Big Data Analytics (ICBDA) . IEEE, 2021: 316-320.
[39] 游溢, 赵普志, 刘冬, 等. 基于优化模糊推理系统的电力变压器故障检测方法[J]. 济南大学学报 (自然科学版), 2023, 37 (1): 71-76, 83.
[40] 范慧芳, 咸日常, 王涛, 等. 改进朴素贝叶斯模型在电力变压器故障定位中的应用[J]. 高压电器, 2023, 59 (2): 190-197.
[41] 罗文萱. 基于胶囊神经网络的电力变压器故障诊断方法研究[J]. 高压电器, 2024, 60 (5): 92-98.
[42] 韩宝庭. 变压器远程故障诊断专家系统研究[D]. 武汉: 湖北工业大学, 2016.
[43] 刘辉, 张雨璇, 汪伟, 等. 电网调控运行智能问答系统构建方法研究[J]. 自动化技术与应用, 2023, 42 (5): 111-114.
[44] 王绪亮, 顾媛丽, 张鸿儒, 等. 基于知识集成流形的电力设备缺陷文本数据增强方法与应用研究[J]. 电网技术, 2024, 48 (4): 1690-1699.
[45] 张宇波, 王有元, 梁玄鸿, 等. 电力设备缺陷文本的双通道语义增强网络挖掘方法[J]. 高电压技术, 2024, 50 (5): 1923-1932.

Share

COinS