of SPARQL query and semantically similar
attributes. Semantically similar attributes are
generated by the M/R module. Each partitioned
answered set is stored in a separate file with unique
key value for direct access.
4 USE CASE - SEMANTIC MUSIC
SERVICE
This section describes the example of SWP
construction with our proposed M/R execution
modules. We developed a semantic music service
and the data navigator browsing a music metadata
with a FLEX based user interface. When user types
a keyword, it can be accessed in musical entities
such as artist, album and song and some metadata
including artist’s birth day and release date of album
or song as Fig.3.
Figure 3: Semantic music service.
In this implementation, we applied only improved
data storing process to reduce processing time of
making RDF triple indexes. Firstly, we gathered a
music database supplied by KBS (Korea
Broadcasting System), an open API crawling from
madiadb.com database and RDF formatted
Musicbrainz dataset. We covered 8 million songs
from MusicBrainz and 1 millon from KBS and
ManiaDB, 0.5 million artist profiles and 1 million
albums including each relationships data.
From these dataset, we made over 10 million
RDF files based on the simple music ontology
(http://wiki.musicontology.com) and generated over
200 million triples using ten Hadoop instances of the
iCube cloud(https://www.icubecloud.com) platform
made by NEXR in South Korea.
In the future, we will implement proposed
modules for helping reasoning and data accessing
steps and evaluate our workflow compared with pre-
existing methods. Also, the other important issues
involving massive computation like pre-raking of
answer set will be dealt with in this project.
ACKNOWLEDGEMENTS
This work was supported in part by MKE & KEIT
through the Development of Independent
Component based Service-Oriented Peta-Scale
Computing Platform Project.
REFERENCES
Lei, Y., Uren, V., Motta, E., 2006. SemSearch: A Search
Engine for the Semantic Web, In proceeding of
European Semantic Conference 2006.
Hartmann, J.; Sure, Y.; Volz, R.; Studer, R., 2003.
Extended OntoWeb.org Portal, OntoWeb Deliverable
6.4
Lausen, H., Ding, Y., Stollberg, M., Fensel, D., Hernández,
R., Han, S., 2005. Semantic web portals: state-of-the-
art survey Export, Journal of Knowledge
Management, Vol. 9, No. 5, 2005.
Staab, S., Angele, J., Decker, S., Erdmann, M., Hotho, A.,
Maedche, A., Schnurr, H., Studer, R., Sure, Y., 2000.
Semantic Community Web Portals. The International
Journal of Computer and Telecommunications
Networking archive. Volume 33, Issue 1-6, 2000.
Bizer, C., Schultz, A., Berlin SPARQL Benchmark, 2009
(BSBM), http://www4.wiwiss.fu-
berlin.de/bizer/BerlinSPARQLBenchmark
Dean, J., Ghemawat, S., 2008. MapReduce: Simplified
data processing on large clusters, Communications of
the ACM, v.51 n.1, January 2008.
Guo, Y., Pan, Z., Heflin., J., 2005. LUBM: A Benchmark
for OWL Knowledge Base Systems, Journal of Web
Semantics 3(2), 2005.
Urbani, J., Kotoulas, S., Oren, E., Harmelen, F., 2009.
Scalable distributed reasoning using MapReduce. In
proceeding of ISWC 2009.
Weaver, J., Hendler, J., 2009. Parallel materialization of
the finite RDFS Closure for Hundreds of Millions of
Triples, International Semantic Web Conference 2009.
Naumann, F., Bilke, A., Bleiholder, J., Weis, M., 2006.
Data fusion in three steps:Resolving schema, tuple,
and value inconsistencies, IEEE Data Eng. Bull., 2006.
KEOD 2010 - International Conference on Knowledge Engineering and Ontology Development
488