The exploration of information extraction and analysis about science and technology policy in China
Date | 07 August 2017 |
DOI | https://doi.org/10.1108/EL-10-2016-0235 |
Pages | 709-723 |
Published date | 07 August 2017 |
Author | Wen Zeng,Changqing Yao,Hui Li |
Subject Matter | Information & knowledge management,Information & communications technology,Internet |
The exploration of information
extraction and analysis about
science and technology policy
in China
Wen Zeng, Changqing Yao and Hui Li
Institute of Scientic and Technical Information of China, Beijing, China
Abstract
Purpose –Science and technology policy plays an important role in promoting the development of economic
and social development in China. At present, the research on science and technology policy is mainly focused
on the basic theories and some quantitative research. The analyses for contents of massive science and
technology policies are relatively less. This paper makes use of semantic technologies to extract and analyze
the relatively important information from massive science and technology policies. The purpose of this paper
is to facilitate users to quickly and effectively obtain valuable information from the massive science and
technology policies. The key methods and study results are presented in the paper. The study results can
provide references for further study and application in China.
Design/methodology/approach –The paper presented the analysis model and method for science and
technology policy in China. The terms and sentences are the important information in the science and
technology policy. The study adopted the technology of natural language processing to analyze the linguistics
characteristics of terms and combined with statistical analyses to extract the terms from Chinese science and
technology policy. Then, the authors designed an algorithm, calculated and analyzed the important sentences
in Chinese science and technology policies. The experiments were run on the Java test platform.
Findings –This paper put forward the analysis model and method for science and technology policy in China.
The study obtained the following conclusions: term extraction of science and technology policy: the paper analyzed
characteristic of terms in Chinese science and technology policy and designed a method of extracting a term that
was suitable for the science and technology policy. The calculation of important sentences for science and
technology policy: the paper designed an algorithm and calculated the importance of the sentences to obtain
valuable information from the massive science and technology policies.
Research limitations/implications –In our methods, there are some defects to be improved or solved
in the future. For example, the precision of algorithm needs to be improved. The signicance of this paper is
to propose and use the analysis model to process Chinese science and technology policy; we can provide an
auxiliary tool to help policy beneciaries. Enterprises and individuals can be more effective to extraction and
mining information from massive science and technology policy and nd the target policy.
Practical implications –To verify the effectiveness of the method, the paper selected the real policies
about the new energy vehicles as experimental data; at the same time, the paper added uncorrelated policies.
It used the proposed analysis model of science and technology policy to calculate and nd out the relatively
important sentences. The results of study showed that the proposed method can obtain better performance. It
veried the validity of this method. The model and method have been applied to actual retrieval system.
Social implications –The proposed model and method in the paper have been applied to actual retrieval
system for users.
Originality/value –The paper proposed the new analysis model and method to analyze science and
technology policies in China. The presented model and method are a new attempt. According to the
The project of this article is supported by the National Social Science Fund Project in China (Grant
No.14BTQ038): Research on Information Analysis Method and Integrated Platform Based on Fact-type
Scientic and Technical Big Data.
The current issue and full text archive of this journal is available on Emerald Insight at:
www.emeraldinsight.com/0264-0473.htm
Information
extraction and
analysis
709
Received 31 October 2016
Revised 15 April 2017
Accepted 16 April 2017
TheElectronic Library
Vol.35 No. 4, 2017
pp.709-723
©Emerald Publishing Limited
0264-0473
DOI 10.1108/EL-10-2016-0235
To continue reading
Request your trial