Scientific Information Research
Keywords
government data; data quality; open government data; reliability; evaluation
Abstract
[Purpose/significance]Due to various problems found in current government data quality,it is necessary to build an effective quality evaluation index for government data.[Method/process]Literature on government data were comprehensively analyzed,and the indicators for data quality evaluation were identified,and clustered into three dimensions.Then the weights of each dimension and indicator were determined with AHP method.[Result/conclusion]An evaluation indicator system for government data quality that includes three dimensions of data source,data set and data environment and 15 indicators including reliability,normalization,authenticity,accuracy,and adaptability has been constructed.Its usability was tested with the open government data of Shanghai and Zhejiang province. On this basis,this paper puts forward some suggestions for governments to implement data quality management.
First Page
17
Recommended Citation
HU, Qiandai and WANG, Fang
(2021)
"Constructing an Evaluation Indicator System for Government Data Quality,"
Scientific Information Research: Vol. 3:
Iss.
3, Article 2.
Available at:
https://eng.kjqbyj.com/journal/vol3/iss3/2
Reference
[1] 中共中央,国务院.中共中央 国务院关于构建更加完善的要素市场化配置体制机制的意见[EB/OL].(2020-04-09)[2020-12-10].http://www.gov.cn/zhengce/2020-04/09/content_5500622.htm. [2] KULKARNI A.A Study on Metadata Management and Quality Evaluation in Big Data Management[J].Engineering Technology & Applied Science Research,2016,4(07):455-459. [3] 宗威,吴锋.大数据时代下数据质量的挑战[J].西安交通大学学报(社会科学版),2013,33(05):38-43. [4] 汪应洛,黄伟,朱志祥.大数据产业及管理问题的一些初步思考[J].科技促进发展,2014(01):15-19. [5] 郭路生,刘春年.大数据时代应急数据质量治理研究[J].情报理论与实践,2016,39(11):101-105. [6] 王宏志.大数据质量管理:问题与研究进展[J].科技导报,2014,32(34):78-84. [7] WANG F.Understanding the dynamic mechanism of interagency government data sharing[J].Government Information Quarterly,2018,35(04):536-546. [8] UBALDI B,PEREZ A R.OECD Open Government Data Report Executive Summary(2020)[EB/OL].(2020-11-27)[2021-1-26].https://3zwybbflfi7t4qzstwgqz2yw6u-jj2cvlaia66be-www-oecd-ilibrary-org.translate.goog/deliver/9789264305847-en.pdf?itemId=/content/publication/9789264305847-en&mimeType=pdf. [9] 王芳,阴宇轩,刘汪洋,等.我国城市政府运用大数据提升治理效能评价研究[J].图书与情报,2020(02):81-93. [10] 杨青云,赵培英,杨冬青,等.数据质量评估方法研究[J].计算机工程与应用,2004(09):3-4,15. [11] 王瑞云,贾君枝.基于用户适用度的开放数据质量提升研究[J].数字图书馆论坛,2018(12):18-26. [12] WANG F,ZHAO A,ZHAO H,et al.Building a Holistic Taxonomy Model for OGD-Related Risks:Based on a Lifecycle Analysis[J].Data Intelligence,2019(01):309-332. [13] 刘冰,庞琳.国内外大数据质量研究述评[J].情报学报,2019,38(02):217-226. [14] 王娟.国内外政府开放数据质量研究述评[J].图书馆理论与实践,2019(12):27-31. [15] HUANG K T,LEE Y W,WA NG R Y.Quality Information and Knowledge Management[M].New Jersey:Prentice Hall,1999. [16] KAHN B K,STRONG D M.Product and service performance model for information quality:an update [C]//Proceedings of the 3rd International Conference on Information Quality.Cambridge:MIT,1998:102-115. [17] FRANCALANCI C,PERNICI B.Data Quality Assessment from the Users Perspective[C]//International Workshop on Information Quality in Information Systems.Paris:SIGMOD,2004. [18] GRYNA F M,BINGHAM R S,JURAN J M.Quality Control Handbook[M].New York:McGraw Hill,1974. [19] 蔡莉,朱扬勇.大数据质量[M].上海:上海科学技术出版社,2017:7-8. [20] AEBI D,PERROCHON L.Towards improving data quality[C]// Proc.of the international conference on information systems and management of data.New York:ACM,1993. [21] SUKUMAR R,RAMACHANDRAN N,FERRELL R K.'Big Data'in healthcare:How good is it?[J].International Journal of Health Care Quality Assurance,2015:2-9. [22] 莫祖英.国内外信息质量研究述评[J].情报资料工作,2015(02):29-36. [23] RAO D,GUDIVADA V N,RAGHAVAN V V.Data quality issues in big data[C]//Proceedings of IEEE International Conference on Big Data.IEEE,2015:2654-2660. [24] IMMONEN A,PÄÄKKÖNEN P,OVASKA E.Evaluating the Quality of Social Media Data in Big Data Architecture[J].IEEE Access,2015(03):2028-2043. [25] 宋金玉,陈爽,郭大鹏,等.数据质量及数据清洗方法[J].指挥信息系统与技术,2013,4(05):63-70. [26] TOIVONEN M.Big Data Quality Challenges in the Context of Business Analytics[D].Helsinki:University of Helsinki,2015:47-48. [27] CAI L,ZHU Y Y.The Challenges of Data Quality and Data Quality Assessment in the Big Data Era[J].Data Science Journal,2015(14):1-10. [28] WANG R Y,D M STRONG.Beyond Accuracy:What Data Quality Means to Data Consumers[J].Journal of Management Information Systems,1996,12(04):5-34. [29] ABDULLAH N,ISMAIL S A,SOPHIAYATI S,et al.Data quality in big data:A review[J].International Journal of Advances in Soft Computing and its Applications,2015:17-27. [30] TALEB I,EL KASSABI H T,SERHANI M A,et al.Big data quality:A quality dimensions evaluation[C]// Proceedings of the 2016 International IEEE Conferences on Ubiquitous Intelligence & Computing,Advanced and Trusted Computing,Scalable Computing and Communications,Cloud and Big Data Computing,Internet of People,and Smart World Congress. Toulouse:IEEE,2016:759-765. [31] MERINO J,CABALLERO I,RIVAS B,et al.A Data Quality in Use Model for Big Data[J].Future Generation Computer Systems,2016(63):123-130. [32] CABALLERO I,SERRANO M,PIATTINI M.A data quality in use model for big data[C]//Proceedings of the International Conference on Conceptual Modeling.Heidelberg:Springer,2014:65-74. [33] PHYSICA,HEIDELBERG.Part of the Frontiers in Statistical Quality Control book series[M]//LENZ H J,BOROWSKI E.Business data quality control:a step by step procedure.New York:Physica-Verlag HD,2012:374. [34] RICHARD Y W,REDDY M P,HENRY B K.Toward quality data:An attribute-based approach[J].Decision Support System,1995,13(3-4):349-372. [35] BALLOU D P,PAZER H L.Modelling Data and Process Quality in Multi-Input,Multi-Output Information System[J].Management Science,1985,31(02):150-162. [36] MCGILVRAY D.数据质量管理工程实践[M].刁兴春,曹建军,张健美,等译.北京:电子工业出版社,2010:29-31. [37] KHUSHALI Y D.Big data quality modeling and validation[D].CA:San Jose State University,2018. [38] MILLER H.The multiple dimensions of information quality[J].Information System Management,1996,13(02):79-82. [39] BATINI C,CAPPIELLO C,FRANCALANCI C,et al.Methodologies for data quality assessment and improvement[J].ACM Computing Surveys,2009,41(03):16.1-16.52. [40] Lee S H.Measuring Correlation of Information Quality Dimensions using Six Sigma based Product Perspective[D].Adelaide:University of South Australia,1997. [41] 马一鸣.政府大数据质量评价体系构建研究[D].长春:吉林大学,2016. [42] 莫祖英.大数据质量测度模型构建[J].情报理论与实践,2018,41(03):11-15. [43] 张绍华,潘蓉,宗宇伟.大数据治理与服务[M].上海:上海科学技术出版社,2016:120. [44] 查先进,陈明红.信息资源质量评估研究[J].中国图书馆学报,2010,36(02):46-55. [45] 陈武.基于数据资源整合平台的数据质量提升技术研究与应用[J].中国管理信息化,2017,20(23):189-191. [46] 屈文建,唐晶,陈旦芝.高校科研数据质量控制架构与机制研究[J].情报理论与实践,2018,41(11):45-50. [47] 辛金国,张亮亮.大数据背景下统计数据质量影响因素分析[J].统计与决策,2017(19):64-67. [48] 王芳,陈锋.国家治理进程中的政府大数据开放利用研究[J].中国行政管理,2015(11):6-12. [49] OGDP.Open Government Data Principles:8 Principles of Open Government Data[EB/OL].(2007-12-08)[2020-12-10].https://www.mendeley.com/catalogue/0c7324d6-916a-36eb-84e5-9d28aa751b44. [50] BERNERS-LEE T.Linked data: design issues [EB/OL].(2015-08-03)[2020-12-10].https://www.w3.org/DesignIssues/LinkedData.html. [51] VETRÒ A,CANOVA L,TORCHIANO M,et al.Open data quality measurement framework:Definition and application to Open Government Data[J].Government Information Quarterly,2016,33(02):325-337. [52] ATTARD J,ORLANDI F,SCERRI F,et al.A systematic review of open government data initiatives[J].Government Information Quarterly,2015,32(04):399-418. [53] OVIEDO E,MAZON J N,ZUBCOFF J J.Towards a data quality model for open data portals[C]// Naiguata,Venezuela:39th Latin American Computing Conference (CLEI),2013. [54] VELJKOVIĆ N,BOGDANOVIĆ-DINIĆ S,Stoimenov L.Benchmarking open government: an open data perspective[J].Government Information Quarterly,2014,31(02):278-290. [55] ZUIDERWIJK ANNEKE,JANSSEN MARIJN.Participation and data quality in open data use:Open data infrastructurese valuated[C]//Portsmouth,England:15th European Conference on eGovernment(ECEG),Univ Portsmouth,2015. [56] 郑磊,吕文增.地方政府开放数据的评估框架与发现[J].图书情报工作,2018,62(22):32-44. [57] 郑磊,关文雯.开放政府数据评估框架、指标与方法研究[J].图书情报工作,2016,60(18):43-55. [58] 莫祖英,邝苗苗.基于用户视角的政府开放数据质量评价模型及实证研究[J].大学图书情报学刊,2020,38(04):84-89. [59] 谭必勇,陈艳.我国开放政府数据平台数据质量研究:以十省、市为研究对象[J].情报杂志,2017,36(11):99-105. [60] 张晓娟,谭婧.我国省级政府数据开放平台元数据质量评估研究[J].电子政务,2019(03):58-71. [61] 王今,马海群.政府开放数据质量的用户满意度评价研究[J].现代情报,2016,36(09):4-9. [62] 马海群,唐守利.基于结构方程的政府开放数据网站服务质量评价研究[J].现代情报,2016,36(09):10-15,33. [63] 曹海军,李明.中国政府数据开放平台服务质量评价:基于熵权TOPSIS的实证分析[J].上海行政学院学报,2020,21(04):55-64. [64] 李晓彤,翟军,郑贵福.我国地方政府开放数据的数据质量评价研究:以北京、广州和哈尔滨为例[J].情报杂志,2018,37(06):141-145. [65] 翟军,陶晨阳,李晓彤.开放政府数据质量评估研究进展及启示[J].图书馆,2018(12):74-79. [66] 韦忻伶,安小米,李雪梅,等.开放政府数据评估体系述评:特点分析[J].图书情报工作,2017,61(18):119-127. [67] 翁士洪,林晨晖,早克然·库地热提.突发事件政府数据开放质量评估研究:新冠病毒疫情的全国样本实证分析[J].电子政务,2020(05):2-13. [68] 李凡星.基于数据质量的政府开放数据平台评估探究[D].南京:南京大学,2017. [69] 刘博浩.我国开放政府数据质量评价研究[D].郑州:郑州大学,2019. [70] 邵艳红.我国政府开放数据质量评价指标体系构建研究[D].保定:河北大学,2019. [71] 黄滔滔.开放数据背景下政务信息资源数据质量评价及提升策略研究[D].湘潭:湘潭大学,2020. [72] 郎艳娜.我国省级政府开放数据平台数据质量评价研究[D].保定:河北大学,2019. [73] 杨峰,史琦,姚乐野.基于用户主体认知的政府社交媒体信息质量评价:政务微博的考察[J].情报杂志,2015,34(12):181-185. [74] 洪学海,王志强,杨青海.面向共享的政府大数据质量标准化问题研究[J].大数据,2017,3(03):44-52. [75] 王芳,赵洪,马嘉悦,等.数据科学视角下数据溯源研究与实践进展[J].中国图书馆学报,2019,45(05):79-100. [76] 白清礼.我国政府公开信息的质量评估指标体系构建[J].图书馆理论与实践,2016(11):55-60. [77] 王芳,储君,张琪敏,等.跨部门政府数据共享:问题、原因与对策[J].图书与情报,2017(05):54-62. [78] 曹瑞昌,吴建明.信息质量及其评价指标体系[J].情报探索,2002(04):6-9. [79] REDMAN,THOMAS C,FOREWORD BY-GODFREY,et al. Data Quality for the Information Age[M].United States:Artech House,1997:33-50. [80] LOSHIN D.The Practitioner's guide to data quality improvement[M].Burlington:Morgan Kaufmann:133. [81] STVILIA B,GASSER L,TWIDALE M B,et al.A Framework for Information Quality Assessment[J].Journal of the American Society for Information Science and Technology,2007,58(12):1720-1733. [82] 刘冰,卢爽.基于用户体验的信息质量综合评价体系研究[J].图书情报工作,2011,55(22):56-59. [83] 童楠楠.我国政府开放数据的质量控制机制研究[J].情报杂志,2019,38(01):135-141. [84] 陈朝兵,程申.政府数据开放监管的国际经验与中国路径[J].图书情报工作,2020,64(12):49-57.