Journal of Scientific Information Research
Keywords
emerging technology; topic recognition; BERTopic; new energy vehicles
Abstract
[Purpose/significance]Identifying and foreseeing emerging technologies, bring technological first-mover advantages to enterprises and governments, and grasp technological development trends in a timely manner. [Method/process]This study uses BERTopic's topic modeling method to obtain domain topic distribution, and merges paper and patent topics based on the cosine similarity of topic vectors to identify emerging topics. [Result/conclusion]Using the BERTopic topic modeling method combined with index evaluation can effectively identify emerging topics and emerging terms.Taking the field of new energy vehicles as an example to carry out empirical research, using two methods: divided verification period and data verification method, 12 of the 16 identified topics passed the verification, which verified the effectiveness of this research method.
First Page
131
Last Page
140
Submission Date
23-Apr-2024
Revision Date
13-Aug-2024
Acceptance Date
12-Sep-2024
Published Date
01-Jan-2025
Reference
[1] 新华社. 中华人民共和国国民经济和社会发展第十四个五年规划和2035年远景目标纲要[EB/OL]. (2021-03-13) [2023-04-12]. http://www.gov.cn/xinwen/2021-03/13/content_5592681.htm.
[2] REDING D F, EATON J. Science & technology trends: 2020—2040[R]. NATO Science & Technology Organization, Brussels: 2020.
[3] 乔治·戴, 保罗·休梅克. 沃顿论新兴技术管理[M]. 石莹 (译) . 北京: 华夏出版社, 2002.
[4] COZZENS S, GATCHAIR S, KANG J, et al. Emerging technologies: quantitative identification and measurement[J]. Technology Analysis & Strategic Management, 2010, 22 (03): 361-376.
[5] ROTOLO D, HICKS D, MARTIN B. What is an emerging technology?[J]. Research Policy, 2015, 44 (10): 1827-1843.
[6] 华宏鸣, 郑邵濂. 高新技术管理[M]. 上海: 复旦大学出版社, 1995.
[7] 罗建, 蔡丽君, 史敏. 新兴技术识别方法研究进展[J]. 科技情报研究, 2019, 1 (01): 95-103.
[8] 周萌, 朱相丽. 新兴技术概念辨析及其识别方法研究进展[J]. 情报理论与实践, 2019, 42 (10): 162-169.
[9] 赵洪江, 陈学华, 苏晓波. 新兴技术、新技术、高技术及高新技术概念辨析[J]. 企业技术开发, 2005, 24 (11): 42-43, 64.
[10] CHIRISTENSEN C M. The Innovator's Dilemma: When New Technologies Cause Great Firms To Fail[M]. Cambridge: Harvard Business Review Press, 1997.
[11] Nelson R R, Winter S G. An evolutionary theory of Economic change[M]. Cambridge: Harvard University Press, 1982.
[12] ZHOU Y, DONG F, KONG D J, et al. Unfolding the convergence process of scientific knowledge for the early identification of emerging technologies[J]. Technological Forecasting and Social Change, 2019, 144 (C): 205-220.
[13] WANG B C, LIU Y F, ZHOU Y, et al. Emerging nanogenerator technology in China: A review and forecast using integrating bibliometrics, patent analysis and technology roadmapping methods[J]. Nano Energy, 2018, 46: 322-330.
[14] CHANG S H. Technical trends of artificial intelligence in standard-essential patents[J]. Data Technologies and Applications, 2021, 55 (01): 97-117.
[15] CHO Y, HAN Y J, HWANG J, et al. Identifying Technology Opportunities for Electric Motors of Railway Vehicles with Patent Analysis[J]. Sustainability, 2021, 13 (05): 1-13.
[16] JOUNG J, KIM K. Monitoring emerging technologies for technology planning using technical keyword based analysis from patent data[J]. Technological Forecasting and Social Change, 2017, 114 (C): 281-292.
[17] 李欣, 谢前前, 黄鲁成, 等. 基于SAO结构语义挖掘的新兴技术演化轨迹研究[J]. 科学学与科学技术管理, 2018, 39 (01): 17-31.
[18] TRAPPEY A J C, CHEN P P J, Trappey C V, et al. A machine learning approach for solar power technology review and patent evolution analysis[J]. Applied Sciences, 2019, 9 (07): 1-25.
[19] 周云泽, 闵超. 基于LDA模型与共享语义空间的新兴技术识别: 以自动驾驶汽车为例[J]. 数据分析与知识发现, 2022, 6 (Z1): 55-66.
[20] GROOTENDORST M. BERTopic: Neural topic modeling with a class-based TF-IDF procedure[J]. arXiv Preprint arXiv: 2203. 05794, 2022.
[21] REIMERS N, GUREVYCH I. Sentence-BERT: Sentence Umbeddings using Siamese BERT-Networks[J]. arXiv Preprint arXiv: 1908. 10084, 2019.
[22] MCINNES L, HEALY J, ASTELS S. hdbscan: Hierarchical density based clustering[J]. The Journal of Open Source Software, 2017, 2 (11): 205.
[23] HUANG L, CHEN X, NI X X, et al. Tracking the dynamics of co-word networks for emerging topic identification[J]. Technological Forecasting and Social Change, 2021, 170: 120944.
[24] LIU Y Q, WANG X F, LEI X P. Quality estimation of patent based on text mining and its empirical research[J]. Computer Engineering and Applications, 2007, 43 (33): 12-14.
[25] PORTER A L, GARNER J, CARLEY S F, et al. Emergence scoring to identify frontier R&D topics and key players[J]. Technological orecasting and Social Change, 2019, 146 (C): 628-643.
[26] 王连喜, 蒋盛益, 李霞, 等. “一带一路”: 研究热点与新兴主题发展分析[J]. 情报杂志, 2019, 38 (02): 71-77.
[27] 国家知识产权局办公室. 国家知识产权局办公室关于印发《战略性新兴产业分类与国际专利分类参照关系表 (2021) (试行) 》的通知[EB/OL]. (2021-02-10) [2023-04-12]. https://www.cnipa.gov.cn/art/2021/2/10/art_75_156716. html.
[28] COHAN A, FELDMAN S, BELTAGY I, et al. Specter: Document-level representation learning using citation-informed transformers[J]. arXiv Preprint arXiv: 2004. 07180, 2020.
[29] 国务院办公厅. 国务院办公厅关于印发新能源汽车产业发展规划 (2021—2035年) 的通知[EB/OL]. (2020-11-02) [2023-04-12]. http://www.gov.cn/zhengce/content/2020-11/02/content_5556716.htm.
[30] 国家发展改革委, 国家能源局. 国家发展改革委 国家能源局关于印发《“十四五”新型储能发展实施方案》的通知[EB/OL]. (2022-03-22) [2023-04-12]. http://www.gov.cn/zhengce/zhengceku/2022-03/22/5680417/files/41a50cec48e84cc4adfca855c3444f6b.pdf.
[31] WNEVC. 全球新能源汽车前沿及创新技术评选[EB/OL]. (2022-08-27) [2023-04-12]. http://www.wnevc.org.cn/CN/Appraisal/#pxtab-im.
[32] Gartner. Hype Cycle for Transportation and Smart Mobility, 2021[EB/OL]. (2021-07-12) [2023-04-12]. https://www.gartner.com/en/documents/4003467.
Digital Object Identifier (DOI)
10.19809/j.cnki.kjqbyj.2025.01.012
Recommended Citation
WANG, Dakun and HUA, Bolin
(2025)
"Research on Emerging Technology Topic Identification Based on BERTopic,"
Journal of Scientific Information Research: Vol. 7:
Iss.
1, Article 12.
DOI: 10.19809/j.cnki.kjqbyj.2025.01.012
Available at:
https://eng.kjqbyj.com/journal/vol7/iss1/12