论文标题
将出版物与项目一级的资金联系起来:FP7项目报告的出版物的精选数据集
Linking Publications to Funding at Project Level: A curated dataset of publications reported by FP7 projects
论文作者
论文摘要
将出版物与项目级别的资金明确链接的数据集是对资金计划的评估文献分析的基础。缺乏对资金贡献的出版物的数据,对欧盟资助计划的影响的分析通常会感到沮丧。在这里,我们介绍了欧盟根据第七框架计划资助的项目报告的学术出版物数据集。该数据集是通过首先合并来自不同报告渠道的数据并通过系统地将记录与外部权威来源匹配并分配外部标识符来创建的。最初的数据集有305K记录链接到一个或多个项目,其中69%具有数字对象识别(DOI)。通过数据质量保证,我们验证了93%的初始记录(283K),并将其中90%的DOI分配给其中90%(245K)。生成的数据集具有245K独特的DOI(链接到一个或多个项目)。据我们所知,这是赠款持有人报告的框架计划的第一个全面和精心策划的数据集。该数据集只能归功于欧盟资助项目使用的报告系统中的重大改进和投资。 该数据集可用欧盟开放数据门户:https://data.europa.eu/data/datasets/cordisfp7projects
Datasets explicitly linking publications to funding at project level are the basis of evaluative bibliometric analysis of funding programmes. Analysis of the impact of the EU funding programmes has been often frustrated by the lack of data on publications to which the funding has contributed. Here we present a dataset of scholarly publications reported by the projects funded by the European Union under the 7th Framework Programme. The dataset was created by first consolidating data from different reporting channels and validating the records by systematically matching them to external authoritative sources and assigning them external identifiers. The initial dataset had 305k records linked to one or more projects out of which 69% had a digital object identify (doi). Through the data quality assurance, we validate 93% of the initial records (283k) and assign a doi to 90% of them of them (245k). The resulting dataset has 245k unique dois (linked to one or more projects). It is, to our knowledge, the first comprehensive and curated dataset of scholarly outputs of the Framework Programme as reported by the grant holders. The dataset could only be created thanks to significant improvements and investments made in the reporting systems used by EU funded projects. The dataset is available EU open data portal: https://data.europa.eu/data/datasets/cordisfp7projects