Scientific Information Research
Keywords
multi-source intelligence information; data collection; framework design; Hook mechanism
Abstract
[Purpose/significance]With the increasing complexity of data ecological environment,a large number of real and valuable data are often hidden in massive multi-source heterogeneous data.How to efficiently collect and integrate data from multi-source intelligence information and effectively support in-depth analysis has always been a core issue in the field of data science,and an important basic work in the field of information science.[Method/process]Firstly,this paper analyzes the multi-source intelligence information environment from multiple perspectives,and analyzes the main characteristics of the information environment from different perspectives.Secondly,it studies the method of information collection by integrating Hook mechanism,explores the mechanism of Hook in the process of information transmission,and combines with Python collection framework.Then,it designs and constructs the framework of multi-source intelligence information collection,determines the implementation mode and scope of the framework,and expands the collection framework to the level of knowledge and wisdom,so as to realize the value conversion.Finally,the framework is applied to collecting multi-source intelligence information.[Result/conclusion]In the face of complex multi-source intelligence information,the multi-source intelligence information collection framework proposed in this study can accurately and effectively obtain data,and lay a solid data foundation for subsequent knowledge mining research through metadata integration.
First Page
13
Recommended Citation
JIN, Jialin; WANG, Yuefen; LIU, Cheng; and ZOU, Bentao
(2022)
"Design and Application of Multi-Source Intelligence Information Collection Framework Integrating Hook Mechanism,"
Scientific Information Research: Vol. 4:
Iss.
1, Article 2.
Available at:
https://eng.kjqbyj.com/journal/vol4/iss1/2
Reference
[1] DONG X L,NAUMANN F.Data fusion-resolving data conflicts for integration[J].Proceedings of the VLDB Endowment,2009,2(02):1654-1655. [2] LOPEZ J A.Data Integration:2013's Top 3 Trends[EB/OL].(2013-01-08)[2021-04-07].https://tdwi.org/Articles/2013/01/08/Data-Integration-2013-Top-Trends.aspx. [3] 化柏林,武夷山.多“源”信息需要多“方”融合[J].情报学报,2013,32(03):225. [4] XU W H,YU J H.A novel approach to information fusion in multisource datasets:A granular computing viewpoint[J].Information Sciences,2017(378):410-423. [5] 郑彦宁,刘志辉,赵筱媛,等.基于多源信息与多元方法的产业竞争情报分析范式[J].情报学报,2013,32(03):228-234. [6] 沈承放,莫达隆,黄文韬.网页数据采集算法及在住户调查中的应用[J].统计与决策,2021,37(07):52-56. [7] 唐琳.微信订阅号文本采集及预处理关键技术研究[J].赤峰学院学报(自然科学版),2019,35(11):54-56. [8] ABBAS Z,YONG L,LI Y,et al.Patent-based trend analysis for advanced thermal energy storage technologies and their applications[J].International Journal of Energy Research,2020,44(07):5093-5116. [9] 陈楚云,洪佳明,周蔚林,等.基于数据挖掘技术构建针灸古籍经验推荐平台的方法与应用[J].中国针灸,2017,37(07):768-772. [10] 陈锐锋,谭春林.大数据视域下知网首发的新冠肺炎专题中文论文文献计量学分析[J].科技传播,2020,12(19):10-14. [11] 朱益新,王洁,吴卫勇,等.基于专利技术的空气清新剂质量安全分析报告[J].质量与认证,2021(04):61-63. [12] ANCZEWSKA M,BIECHOWSKA D,GAŁECKI P,et al.Analysis of psychiatric services provided to adults in 2010-2014 based on the National Health Fund data[J].Psychiatria Polska,2019,53(06):1321-1336. [13] 化柏林.多源信息融合方法研究[J].情报理论与实践,2013,36(11):16-19. [14] 张娴,方曙,肖国华,等.专利文献价值评价模型构建及实证分析[J].科技进步与对策,2011,28(06):127-132. [15] 《2019年度国家自然科学基金项目指南》编辑委员会.2019项目指南[EB/OL].(2018-12-15)[2021-05-02].http://www.nsfc.gov.cn/nsfc/cen/xmzn/2019xmzn/index.html. [16] National Information Standards Organization.Scientific and Technical Reports-Preparation,Presentation,and Preservation:ANSI/NISO Z39.18-2005(R2010)[S/OL].[2005-07-27]. https://www.niso.org/publications/z39.18-2005-r2010.