would be a useful tool for facilitating multilingual
discovery of cultural heritage information.
The fact that a link between records has been
discovered is valuable information by itself and this
linkage may be recorded in one or both datasets.
These external links may later be used for enriching
datasets "on the fly" or for monitoring changes to the
linked dataset. An example of this approach is the
datos.bne.es service from the National Library of
Spain which uses the already established links to
VIAF in order to enrich their records with links to
the authority records of other national libraries
(Vila-Suero et al., 2013).
Linked Data is a technique for publishing data
on the Web in a way that facilitates object
interlinking and data access "on the fly" (Berners-
Lee, 2006; Bizer et al., 2009). It publishes data so
that data identifiers (URIs) can be dereferenced (i.e.
users can access structured information about these
objects online, by making HTTP requests) and
provides a way for including URIs of linked objects
in the data published.
Information published as Linked Data (e.g.
LCSH dataset used in the experiment) is well-suited
for data enrichment: (1) data is published on the
Web, making it possible for users to find it, reuse it
and link to it; (2) the Linked Data model makes it
easy to enrich records with new information; and (3)
these records have web-accessible URI identifiers
for accessing up-to-date information about them.
The National Library of Latvia is in the process
of publishing NLL's authority data as Linked Data.
Once this dataset is published it will enable the
benefits listed above such as the opportunity for
other users to explore and link to NLL's authority
data. The data published by NLL's linked data
service will be enriched with additional information
including Linked Data from other data sources.
6 CONCLUSIONS
Dataset interlinking creates new opportunities for
data quality improvement and data enrichment.
This paper discussed principles for dataset
linking and improvement, and presented results of
an experiment for linking and enriching library
authority data.
The experiment was conducted using the
National Library of Latvia authority file and Linked
Data from the Library of Congress. The experiment
helped us identify and fix data quality issues in the
NLL-SH dataset, and to enrich it using information
from matching LCSH records. Links between
taxonomy records from the two datasets may be
used for multilingual discovery of bibliographic
data.
Datasets that are published as Linked Data are
especially useful for data enrichment as their records
are available "on the fly" and may include links to
other related datasets. The National Library of
Latvia is in the process of publishing its authority
file as Linked Data, making it possible for user
worldwide to reuse it and to interlink it with other
datasets.
ACKNOWLEDGEMENTS
This research is a part of the project "Competence
Centre of Information and Communication
Technologies" by IT Competence Centre, contract
No.
L-KC-11-0003, co-financed by European Regional
Development Fund, Research No. 1.18 “Data array
quality analysis and enhancement technologies”.
More information: http://www.itkc.lv/.
REFERENCES
Berners-Lee, T. (2006). Linked Data – Design Issues.
W3C [online, accessed: 2015-03-31]. Available from:
http://www.w3.org/DesignIssues/LinkedData.
Bizer, C., Heath, T. and Berners-Lee, T. (2009). Linked
Data – The Story So Far. International Journal on
Semantic Web and Information Systems, 5(3), 1-22.
Hickey, T. B. and Toves, J. (2014). Managing Ambiguity
in VIAF. D-Lib Magazine, 20(7), 3.
Miles, A. and Bechhofer, S. (eds.) (2009). SKOS Simple
Knowledge Organization System Reference. W3C
Recommendation. Available from:
http://www.w3.org/TR/skos-reference.
Stūrmane, A., Eglīte, E. and Jankevica-Balode, M. (2014).
Subject Metadata Development for Digital Resources
in Latvia. Cataloging & Classification Quarterly,
52(1), 20-31.
Summers, E., Isaac, A., Redding, C. and Krech, D. (2008).
LCSH, SKOS and Linked Data. In Proceedings of the
2008 International Conference on Dublin Core and
Metadata Applications (DC-2008), pp. 25-33. Dublin
Core Metadata Initiative.
Vila-Suero, D., Villazón-Terrazas, B. and Gómez-Pérez,
A. (2013). datos.bne.es: a Library Linked Dataset.
Semantic Web, 4(3), 307-313.
DATA2015-4thInternationalConferenceonDataManagementTechnologiesandApplications
188