Integrating semantic metadata in P2P‐based digital libraries

Published date01 May 2005
DOIhttps://doi.org/10.1108/01435120510596071
Pages218-229
Date01 May 2005
AuthorHao Ding
Subject MatterLibrary & information science
Integrating semantic metadata in
P2P-based digital libraries
Hao Ding
Information Management Group, Norwegian University of Science and
Technology, Trondheim, Norway
Abstract
Purpose To propose methods for expressing semantics and operating semantics in largely
distributed environment, such as peer-to-peer (P2P) based digital libraries (DLs) where heterogeneous
schemas may exist and the relationships among them must be explicated for better performance in
information searching.
Design/methodology/approach – In conventional solutions, a mediator is adopted to create and
maintain the matching between relevant terms such that distinct but relevant metadata schemas can
be integrated according to the mapping relationships in the mediator. However, such solutions suffer
some problems originated from the static matching in mediator. This paper proposes to use facts to
express the relationships among heterogeneous schemas and conduct the reasoning dynamically by
using inference engines.
Findings – It is justified to use facts and inference engines to express and operate the semantics
among heterogeneous but relevant information resources. The user can choose to convert only part of
the XML document into facts if she can unpeel deeply nested XML tags. Additionally, it is possible for
the user to manually edit (assert, update or retract) the facts as needed in the reasoning.
Research limitations/implications – The study assumes that peers are clustered according to
shared topics or interest. An exhaust evaluation has not been conducted.
Practical implications – Each node can publish its schema to the involved peer community such
that other peers can automatically discover the specific schema. A local matchmaking engine is
adopted as well in order to automatically generate the relations between its own schema and the
retrieved ones.
Originality/value This paper provides a framework for semantic data integration in P2P
networks.
Keywords Digital libraries,Internet, Information networks
Paper type Research paper
Introduction
Integrating heterogeneous data and information is a ubiquitous problem. Cooperative
digital libraries (DLs), scientific communities, and average people with common
interests are inclined to make their collections online as well as accessing the others’
information resources. With the exponential growth of online resources, it is easy for
the user to get overwhelmed by the information flood. In order to search and process
the information in a more efficient way, schemas are introduced to describe the basic
structural information of the collections. Meanwhile, users prefer to have their own
annotating schemas for their collections because they feel more accustomed to the
semantic interpretations. Besides, since different DLs may be aimed at different users,
domains or even topics, it is almost impossible to describe everything just in one huge
schema. Instead, a large volume of heterogeneous schemas are created. Therefore,
integrating data and information in such a large-scale and heterogeneous environment
becomes a challenge.
The Emerald Research Register for this journal is available at The current issue and full text archive of this journal is available at
www.emeraldinsight.com/researchregister www.emeraldinsight.com/0143-5124.htm
LM
26,4/5
218
Library Management
Vol. 26 No. 4/5, 2005
pp. 218-229
qEmerald Group Publishing Limited
0143-5124
DOI 10.1108/01435120510596071

To continue reading

Request your trial

VLEX uses login cookies to provide you with a better browsing experience. If you click on 'Accept' or continue browsing this site we consider that you accept our cookie policy. ACCEPT