The exploration of information extraction and analysis about science and technology policy in China

Date07 August 2017
DOIhttps://doi.org/10.1108/EL-10-2016-0235
Pages709-723
Published date07 August 2017
AuthorWen Zeng,Changqing Yao,Hui Li
Subject MatterInformation & knowledge management,Information & communications technology,Internet
The exploration of information
extraction and analysis about
science and technology policy
in China
Wen Zeng, Changqing Yao and Hui Li
Institute of Scientic and Technical Information of China, Beijing, China
Abstract
Purpose Science and technology policy plays an important role in promoting the development of economic
and social development in China. At present, the research on science and technology policy is mainly focused
on the basic theories and some quantitative research. The analyses for contents of massive science and
technology policies are relatively less. This paper makes use of semantic technologies to extract and analyze
the relatively important information from massive science and technology policies. The purpose of this paper
is to facilitate users to quickly and effectively obtain valuable information from the massive science and
technology policies. The key methods and study results are presented in the paper. The study results can
provide references for further study and application in China.
Design/methodology/approach The paper presented the analysis model and method for science and
technology policy in China. The terms and sentences are the important information in the science and
technology policy. The study adopted the technology of natural language processing to analyze the linguistics
characteristics of terms and combined with statistical analyses to extract the terms from Chinese science and
technology policy. Then, the authors designed an algorithm, calculated and analyzed the important sentences
in Chinese science and technology policies. The experiments were run on the Java test platform.
Findings This paper put forward the analysis model and method for science and technology policy in China.
The study obtained the following conclusions: term extraction of science and technology policy: the paper analyzed
characteristic of terms in Chinese science and technology policy and designed a method of extracting a term that
was suitable for the science and technology policy. The calculation of important sentences for science and
technology policy: the paper designed an algorithm and calculated the importance of the sentences to obtain
valuable information from the massive science and technology policies.
Research limitations/implications In our methods, there are some defects to be improved or solved
in the future. For example, the precision of algorithm needs to be improved. The signicance of this paper is
to propose and use the analysis model to process Chinese science and technology policy; we can provide an
auxiliary tool to help policy beneciaries. Enterprises and individuals can be more effective to extraction and
mining information from massive science and technology policy and nd the target policy.
Practical implications To verify the effectiveness of the method, the paper selected the real policies
about the new energy vehicles as experimental data; at the same time, the paper added uncorrelated policies.
It used the proposed analysis model of science and technology policy to calculate and nd out the relatively
important sentences. The results of study showed that the proposed method can obtain better performance. It
veried the validity of this method. The model and method have been applied to actual retrieval system.
Social implications The proposed model and method in the paper have been applied to actual retrieval
system for users.
Originality/value The paper proposed the new analysis model and method to analyze science and
technology policies in China. The presented model and method are a new attempt. According to the
The project of this article is supported by the National Social Science Fund Project in China (Grant
No.14BTQ038): Research on Information Analysis Method and Integrated Platform Based on Fact-type
Scientic and Technical Big Data.
The current issue and full text archive of this journal is available on Emerald Insight at:
www.emeraldinsight.com/0264-0473.htm
Information
extraction and
analysis
709
Received 31 October 2016
Revised 15 April 2017
Accepted 16 April 2017
TheElectronic Library
Vol.35 No. 4, 2017
pp.709-723
©Emerald Publishing Limited
0264-0473
DOI 10.1108/EL-10-2016-0235

To continue reading

Request your trial

VLEX uses login cookies to provide you with a better browsing experience. If you click on 'Accept' or continue browsing this site we consider that you accept our cookie policy. ACCEPT