Domain Ontology for Time Series Provenance

Lucélia de Souza

1,2

, Maria Salete Marcon Gomes Vaz

2,3

and Marcos Sfair Sunye

Computer Science Department, State University of Center-West, Guarapuava, Paraná, Brazil

Informatics Department, Federal University of Paraná, Curitiba, Paraná, Brazil

Informatics Department, State University of Ponta Grossa, Ponta Grossa, Paraná, Brazil

Keywords: OWL, Modules, Time Series Processing, Trend Extraction, Nonstationary Time Series.

Abstract: Time series data are generated all the time with a volume without precedent, constituting themselves of a

points sequence spread out over time, usually at time regular intervals. Time series analysis is different from

data analysis, given its intrinsic nature, where observations are dependent and the observations order is

important for analysis. The knowledge about the data which will be analyzed is relevant in an analysis

process, but this knowledge is not always explicit and easy to interpret in many information resources. Time

series can be semantically enriched where provenance information using ontologies allows to representing

and inferring knowledge. The main contribution of this paper is to present a domain ontology developed by

modular design for time series provenance, which adds semantic knowledge and contributes to the choice of

appropriate statistical methods for an important step of time series analysis that is the trend extraction

(detrending). Trend is a time series component that needs be extracted because it can hide other phenomena,

as well as the most statistical methods are developed for stationary time series. With this work, is intended

to contribute for semantically improving the decision making about trend extraction step, facilitating the

preprocessing phase of time series analysis.

1 INTRODUCTION

The scientific knowledge generation, in several

domains, is related with the time series analysis,

from which is extracted useful information. Time

series data are characterized by way as they were

generated and collected, usually at time regular

intervals (Chandler and Scott, 2011; Cryer and

Chan, 2008).

Time series analysis is usually done in two

phases, preprocessing and data analysis, both

containing processing steps in order to obtain

scientific knowledge. Time series analysis is

different from data analysis, given its intrinsic

nature, where observations are dependent or

correlated and the observations order is important

for analysis. Statistical procedures and traditional

techniques based on assumptions of independent and

identically distributed data are not applied in time

series. This way, are necessary different methods of

analysis (Cryer and Chan, 2008).

In time series analysis, provenance information,

such as What the observation type of time series?,

How the time series were generated?, What is the

decomposition model used?, What assumptions were

considered?, How the time series data can be

classified according to assumptions?, What the trend

type considered?, among other information,

allowing the researcher to interpret the data better

and to use appropriate statistical methods,

specifically developed regarding its characteristics.

Hair et al (2010) asserts that the knowledge

about data that will be analyzed is important in an

analysis process. However, according to Hebeler et

al (2009), this is not present in several information

resources. Such knowledge is not always explicit

and easy to interpret. As well as in data analysis,

time series also can be semantically enriched where

provenance information using ontologies allows

representing and inferring knowledge.

This paper describes Time Series Ontology

(namespace tso:), a domain ontology (a module in

Ontology Web Language - OWL) with the definition

of main concepts and relationships involving time

series provenance. The proposed ontology adds

semantic knowledge in time series, contributing to

choose of appropriate statistical methods for an

important step of analysis that is the trend extraction,

also called detrending (Wu et al, 2007; Meinl, 2011).

Besides of this section, Section 2 describes time

217

de Souza L., Marcon Gomes Vaz M. and Sfair Sunye M..

Domain Ontology for Time Series Provenance.

DOI: 10.5220/0004886502170224

In Proceedings of the 16th International Conference on Enterprise Information Systems (ICEIS-2014), pages 217-224

ISBN: 978-989-758-028-4

 2014 SCITEPRESS (Science and Technology Publications, Lda.)

series. The Section 3 relates ontologies and the

Section 4 reports time series provenance. Section 5

presents the development of the domain ontology.

Section 6 brings a comparison with related work.

Finally, Section 7 describes the conclusions and

future works perspectives.

2 TIME SERIES

Time series are an ordered sequence of observations,

usually obtained at time regular intervals (Chandler

and Scott, 2011; Cryer and Chan, 2008). Time series

are correlated data, being preferable the use of

analysis methods specifically developed to this data

type. Chandler and Scott (2011) explain that the

choice of appropriated methods of analysis depends

on questions of interest, where the knowledge about

time series data is essential.

The first step in any time series analysis is the

careful observation of plotted data over the time.

This procedure frequently suggests statistical

methods of analysis, as well as the statistics that

summarize the information about the data.

A time series can have the following classical

components (Meinl, 2011): Seasonal, Cyclic,

Irregular and Trend. In environmental sciences, the

trend is defined by Chandler and Scott (2011) as a

long-term temporal variation in statistics properties

of a process. However, the period of trend is

dependent of each application.

In environmental applications, among the

possible reasons for trend analysis stands out the

analysis of systems, where long-term changes can to

obscure aspects of real interest (Chandler and Scott,

2011). In this case, the first step in the time series

analysis is the identification and extraction of trend

to clearly understand the inter-relationships of data.

This paper is related with this reason, where trend is

clearly defined and needs to be extracted, otherwise

to make difficult the knowledge discovery process.

To many analyses, trend extraction is an

important step because it can to hide other

phenomena, as well as the most statistical methods

are developed for stationary time series (Montesino-

Pouzols and Lendasse, 2010). It making a stationary

time series means to extract all the deterministic

features, such as statistics measures of mean and

variance, in such way that correlations turn

themselves independent over time.

The need of modeling these and others

characteristics must be considered in order to

determine an adequate strategy in the analysis

(Chandler and Scott, 2011). The definitions and the

main features of the time series presented contribute

for the choice of appropriate statistical methods.

Such descriptions are considered in development of

the domain ontology as way of adding semantic

knowledge. The next section describes Ontologies

used to generation of time series provenance.

3 ONTOLOGIES

Ontology is a formal and explicit specification of a

shared conceptualization (Borst 1997). Kiryakov

(2006) defines ontology as a set of classes

representing concepts in domain, relationships

between concepts, axioms used to modeling

restrictions and rules and instances of classes,

constituting a knowledge base.

Guarino (1998) classifies ontologies according to

the generality level, where domain ontologies

constitute vocabularies about a generic domain. In

this work the domain of ontology is related with

time series provenance.

Although Resource Description Framework -

RDF and OWL were not developed to given support

for numeric concepts, because they depend on

schema definition and are based on the eXtensible

Markup Language - XML, the set of upper level

ontologies Semantic Web for Earth and

Environmental Terminology - SWEET

<http://sweet.jpl.nasa.gov/ontology/> is a good

example to integrate mathematical knowledge with

scientific application domains (Lange, 2013). In this

work some statements from SWEET are reused and

extended.

In the Semantic Web defined by Berners-Lee

(2001), the W3C Standard defines the effort of the

Linking Open Data (LOD) community where is

increased the number of data providers that

publishing and interlinking data on the web.

The actual web of data consists of billion of RDF

triples, in several domains. The DBpedia knowledge

base <http://wiki.dbpedia.org/About> is a central

hub of interlinking of the emerging data web, which

extracts structured information from Wikipedia and

makes them available on the web. In this work, some

instances of the classes of knowledge base are

associated with definitions from DBpedia, allowing

semantic interoperability.

The modularization involves identifying one or

more modules in ontology. A module is considered

as one significant and self-contained part of

ontology. Although there is no universal way for

modularization, the choosing of a particular

technique must be guided by application

ICEIS2014-16thInternationalConferenceonEnterpriseInformationSystems

218

requirements (Suárez-Figueroa et al, 2012). The next

section describes four use cases related with time

series provenance.

4 TIME SERIES PROVENANCE

There are many definitions for provenance in

literature (Moreau, 2010). Usually, it refers to origin

or source of something. The knowledge about the

data source in domains where the volume of data

that need to be analyzed is excessive, it contributes

to prove the correctness of resultant data, enabling

the understanding about how the data were

generated. Tan (2007) comments that provenance

information is considered as important as the result

itself.

Besides of ways coarse-grain and fine-grain to

generate provenance (Tan, 2007), another approach

to provenance considers the use of semantic

information based on ontologies, modeling concepts

and relationships used in the generation of

provenance, contributing with inferences to

discovering of implicit knowledge by means of

languages as RDF and OWL. This approach

provides as advantages the semantic description of

the context, improvements in queries and proofs of

origin and looking of interoperability of generated

data (Moreau et al, 2011).

In this work, time series provenance contributes

to the researcher to get information about the origin

and other characteristics of time series, helping and

facilitating the decision making about the use of

appropriate statistics procedures. In the following,

are presented four use cases involving real time

series data, which are related in Section 5.

- Use Case 1: in this case, it is important for the

researcher to known, by instance, if discrete

nonstationary time series presenting extreme events

such as outliers (Hair et al, 2010). This information

contributes to choose of an appropriate statistic

method for use, as a robust method (Chandler and

Scott, 2011).

- Use Case 2: another case is when the

researcher can identify the appropriate statistical

technique considering the observation type

(regularly or irregularly sampled). For analysis, the

autocorrelation function needs multiple pairs of

observations to quantify the serial dependence. In

this situation, the time series need to be regularly

spaced and with little bit of missing data. This

approach is not appropriate for observations in

intervals highly irregular, needing alternatives

techniques (Chandler and Scott, 2011).

- Use Case 3: also it is important to the

researcher to know the generator process and in

which statistic measure occurs the nonstationarity of

the time series, that is, in the mean or in the

variance. The nonstationarity in the mean can be

removed by differencing, for instance. However, in

inhomogeneous time series in the variance

(heteroscedastic time series), to reduce

nonstationarity, other transformations in the data are

needed (Wei, 2006).

- Use Case 4: another case is about the time

series decomposition model. This information

contributes to researcher to choose appropriate

statistical methods to trend removal of the time

series. For instance, if the time series were

decomposed additively (Yaffee and McGee, 2000),

the estimated trend is subtracted from data. To a

multiplicative decomposition model, this is done by

the division of time series by values of trend. In the

next section the ontology for time series provenance

is presented, developed as an OWL module.

5 ONTOLOGY FOR TIME

SERIES PROVENANCE

The methodology for development of domain

ontology is based on Ontology Development 101

(Noy et al., 2001). In parallel to this classic

methodology, the modularization of the ontologies

was considered (Suárez-Figueroa et al, 2012), where

ontologies are developed in separated parts, from the

self-contained form, being important to a sub-

domain or task, allowing scalability. Applied to

Ontology Engineering, modularity is central for

reducing the complexity of understanding and

maintenance, querying and reasoning over modules

(Kutz and Hois, 2012).

The modular design describes Time Series

Ontology (namespace tso:) related to time series

provenance and Detrend Ontology (namespace do:)

which describes statistical methods for trend

estimation. These modules (ontologies) are used in

the Detrend Provenance Model (namespace dpm:)

(De Souza et al, 2014) that reuse and extend the

Open Provenance Model - OPM (namespace opmo:)

(Moreau et al, 2011) as means of generating

semantic knowledge about detrending time series.

The Detrend Ontology and the Detrend Provenance

Model are not described in this paper.

It was identified, along with experts, a set of

competence questions that the ontology should be

able to answer, involving intrinsic features about

DomainOntologyforTimeSeriesProvenance

219

time series data and its components. They were

identified based on conceptual W7 Model (Ram and

Liu, 2009), which contributes to define, capture and

to use data provenance, presenting seven inter-

connected elements: What?, When?, Where?, How?,

Who?, Which? and Why?. These elements can be

used to track events that affect the data during its

lifetime. This provenance model is general and

extensible for capture provenance semantics for data

in different domains (Ram and Liu, 2009).

From these questions, the classes and its

relationships were identified, as well as the

instances. Restrictions on the classes and relations

are declared using axioms and/or rules, providing

semantics and allowing inferences by a reasoner in

the knowledge base. The elements from ontologies

are represented in this paper between parentheses.

The reuse from ontologies SWEET utilizes

namespaces of its sub-ontologies. For instance, the

subclass (phen:StochasticProcess) is declared as a

disjoint subclass of (tso:NonStochasticProcess). The

object property (rela:hasPhenomena) is reused and

(tso:hasDynamicalPhenomena) was created and

extended in (tso:hasStochasticProcess) and

(tso:hasNonStochasticProcess) which are declared as

disjoint properties.

The Time Series Ontology describes time series

related to nonstationary processes, which presenting

trends. These time series are the rule and are not the

exception in several application domains.

In relation to scope, are not included statistical

methods to transform the time series, which are

modeled in detrend module. About scalability, on

the one hand, the ontology is extensible due to reuse

of the triples from set of Ontologies SWEET, which

can be extended based on these ontologies.

The Classes Diagram developed in Ontograf

Plugin from Protégé 4.1, presents the main classes

related with the class (tso:TimeSeriesData) and the

respective provenance elements of W7 Model

(Figure 1). The classes represent the main

definitions and features of time series, including:

processes that generated them and time related;

analysis type associated and assumptions

considered; observation type done and how they can

be classified according to assumptions, knowledge

domain, collection, scientific instrument or generator

software related; models and types of decomposition

of time series and its components, as well as the

event component and the mathematical property

associated.

All classes are noted by means of the tag

(rdfs:comment), identifying which is the source of

the definition. This contributes to the understanding

of the concepts, as well as allows us to known which

is the provenance of the definitions. The data

properties and object properties are also noted by

means of this tag. Also the tag (rdfs:label) is used for

labeling the elements from this module.

The association of instances with DBpedia

resources allow, besides knowing its provenance,

obtaining more information about the data. This

way, it is possible to obtain semantic interoperability

about such concepts with LOD.

The ontology presents a classification as the

assumptions declared by means of defined rules.

Below some rules are presented that infers

knowledge about the time series.

tso:TimeSeriesData(?x),

tso:hasObservationType(?x,

tso:Regularly_Spaced) ->

tso:hasTimeSeriesAssumption(?x

tso:Homogeneity)

(1)

tso:TimeSeriesData(?x),

tso:hasTimeSeriesAssumption(?x

, tso:Homogeneity) ->

tso:hasObservationType(?x,

tso:Regularly_Spaced)

(2)

tso:TimeSeriesData(?x),

tso:hasTimeSeriesAssumption(?x

, tso:Homogeneity) ->

tso:HomogeneousTimeSeries(?x)

(3)

tso:TimeSeriesData(?x),

tso:hasTrendType(?x,

tso:Deterministic_Trend) ->

tso:NonStationaryTimeSeries(?x

)

(4)

The two first rules are related with the observation

type of time series (regularly or irregularly spaced)

and its classification, as the type of time series

related. If the observation is regularly spaced, the

time series are declared as presenting the

Homogeneity Assumption.

Also if the researcher declares that the time

series presenting the Homogeneity Assumption, they

are inferred how regularly spaced time series. The

opposite also occurs, when the observation is

irregularly spaced, the time series are inferred as

presenting the Heterogeneity Assumption.

According to third rule, if time series present the

Homogeneity Assumption, they are classified as

being of the type Homogeneous, belonging to class

(tso:HomogeneousTimeSeries). The same occurs

with the Heterogeneity Assumption.

When the time series presenting some type of

trend, for instance deterministic, according to fourth

rule, the same are inferred in the class

(tso:NonStationaryTimeSeries), where the trend

ICEIS2014-16thInternationalConferenceonEnterpriseInformationSystems

220

Figure 1: TimeSeriesData Class and relationships with conceptual elements of the W7 Model.

component needs be extracted because it can hide

other phenomena, as well as the most statistical

methods are developed for stationary time series.

Figure 2 shows the extension from SWEET

Ontology related with the class

(phen:StochasticProcess), extended with the classes

(tso:NonStationaryProcess), (tso:StationaryProcess)

and its respective subclasses. According to a defined

rule, if the nonstationarity occurs in the mean and

the generator process of time series is considered

(tso: Stationary_Difference_Process), the inference

done is (tso:Stochastic_Trend). In the same way, if

the generator process of time series is

(tso:Stationary_Trend_Process), the inference upon

the trend type related is (tso:Deterministic_Trend),

done by Pellet reasoner (Sirin et al, 2007).

The information about the trend type is relevant

for the researcher to choose appropriate statistical

methods for detrending time series. In Figure 2(a),

the trend can be removed by Differencing, where the

same can be subtracted from time series. And in

Figure 2(b), the trend can be estimated by adjusting

of deterministic method as Regression Analysis that

fits a model to the trend, which is subsequently

removed from them.

Considering the Uses Cases (Section 4), Figure 3

presents a query about Use Case 1, showing which

nonstationary time series and its mathematical

property presenting some extreme event component.

Such knowledge contributes for the researcher to

choose statistical methods specifically developed for

these features as a robust method. The association of

the instances with DBpedia allows getting more

knowledge, besides contributing to semantic

interoperability.

About Use Case 2, Figure 4 shows features of

time series such as observation type, observation

interval and missing data percent. This knowledge

allows quantifying the autocorrelation function that

measures the dependence among sucessive

observations.

In Use Case 3 (Figure 5), the knowledge about in

which process and statistical measure the

nonstationarity occurs allows choosing an

appropriate method to extract it. About the Use Case

4 (Figure 6) describes about decomposition model,

bringing knowledge about the way as the trend can

be removed, in this case, by subtraction due to

additive decomposition model.

The ontology was evaluated by Ontologists and

Experts of time series area. For evaluation’s

applicability, were developed the following

documents: commitment term, describing the

purpose of the evaluation and about ethical

questions; list of competence questions; feedback

from evaluators about nomenclature, sources of

definitions and concordance in relation to the reuse.

The feedback from evaluators was analyzed and

considered in this module.

The semantic knowledge about time series

provenance, contributes meaningfully with the

analysis process, as the choice of appropriate

statistical methods that considers its characteristics.

DomainOntologyforTimeSeriesProvenance

221

Figure 2: TimeSeriesData class and its relationship with the phen:StochasticProcess class.

Figure 3: Features of nonstationary time series showing

extreme events, associated with DBpedia.

Figure 4: Observation type, missing data percent and

observation interval of time series.

6 RELATED WORKS

Henson et al (2009) presents an ontological

representation of time series observations for

Figure 5: Generator process and which statistical measure

the nonstationarity occurs in time series.

Figure 6: Model and type of Decomposition and

components related.

Semantic Sensor Web. It is described as time series

observations can be modeled in ontology, in order to

solve problems related to integration and queries.

This work is related to the modeling of observations

of time series from O&M XML Model. It is

presented as OWL allows data restrictions better

than XML, and also promotes semantic

ICEIS2014-16thInternationalConferenceonEnterpriseInformationSystems

222

interoperability. Nevertheless, this ontology

describes observations and measurements and does

not model intrinsic characteristics of time series.

The concept of semantic time series is defined by

Bozic (2011), where technologies from Semantic

Web are combined with time series processing

models, making possible its use in new applications.

However, this work proposes a generic language of

processing for simulation and modeling of the

semantic time series.

Other two works of the same author are Bozic

and Winiwarter (2013) and (Bozic and Winiwarter,

2012). The first is an extension of the second,

presenting a showcase about semantic time series

process. The showcase presents the functionality of

the Time Series Semantic Language (TSSL),

demonstrating as this technology can improve the

time series processing by the usage of a dedicated

language in a community building. This work shows

high practical impact in the time series process,

giving in new data source for applications of

semantic web.

Bozic and Winiwarter (2012) propose Time

Series Semantic Language – TSSL, a generic

community building language for semantic time

series, allowing observing data flux as data sensor

with additional information, tagging postings of

scientists with a specific search topic. The TSSL

architecture supports high level of expressivity, user-

friendly syntax, extensibility, allowing significant

data models. It presents the Time Series Processor -

TSP, which coordinates all workflow process of

time series. The main contribution of this paper is

the semantic time series processing, related to time

series data and also its meaning, creating new

information by means of links among different data

structures.

Comparing with related works, the same do not

describe intrinsic features of time series data. The

OWL module proposed in this work can be extended

and contributes to the researcher to know and

understanding better the data (time series),

facilitating the decision making about as turn them

stationary, improving semantically an important step

of preprocessing phase of time series analysis that is

the trend extraction.

7 CONCLUSIONS

The scientific contribution of this paper presents two

aspects. First, about ontology engineering, is

presented a study of case related to modular design

of ontologies, presenting the development of a

module related to time series provenance, developed

by separated way of related statistical procedures.

In this case, the modularity decrease the

modeling complexity, facilitate individual

evaluation, promoting the reuse and extensibility of

modules. Another contribution of this nature is the

generation of provenance information for a special

type of data, characterized by showing temporal

dependence of observations. Also, the definition of

the competence questions using expressions of the

W7 model, contribute to obtain knowledge about

time series provenance.

Second, the contribution that stands out is in time

series area, enriching them semantically, improving

the analysis process, facilitating the decision making

about appropriate statistical procedures and

contributing for the scientific knowledge generation.

The applicability of proposal domain ontology is

related with nonstationary time series, which

presenting trends. In development of ontology, was

considered the orthogonality of concepts, where the

same are decomposing in its components parts,

facilitating its extensibility.

The proposal ontology presents as main

advantages: the generation of provenance

information about time series; the reuse of

statements from set of Ontologies SWEET, allowing

semantic interoperability and extensibility; the

definition of classes using nomenclature from

bibliographies of time series analysis, contributing to

the understanding of concepts, which can be

visualized by an online documentation developed;

the association, when applicable, of the instances

with DBpedia enables to increase the ontology

definitions and contributes for semantic

interoperability with linked open data; and the use of

rules allows to infer more semantic knowledge. The

ontology also contributes to the choice of the

appropriate statistical methods, facilitating the

decision making in detrending step.

With the modular development is necessary to

select subject for module composition. In this work,

data and methods are considered as separated

modules, which are combined in the detrend

provenance model. Also, in this context, stands out

that the wide range of time series analysis area turns

it difficult the understanding about concepts and its

relationships.

The main contribution of this paper is based on

intersection of the following key topic of search:

time series and its components, provenance,

ontologies and the semantic web, resulting in the

generation of domain ontology for time series

provenance, as a means to enrich them semantically,

DomainOntologyforTimeSeriesProvenance

223

allowing logic inferences and the development of

queries.

As future works stands out the development of

an online environment using the proposal ontology,

contributing to facilitate and enrich semantically the

trend extraction step of preprocessing phase of the

time series analysis.

REFERENCES

Berners-Lee, T., Hendler, J., Lassila, O., 2001. The

semantic web. Scientific American, 284(5):34–43.

Borst, W. N., 1997. Construction of Engineering

Ontologies. Thesis. University of Tweenty Centre for

Telematica and Information Technology, Enschede,

Nederland.

Bozic, B., 2011. Simulation and Modeling of Semantically

Enriched Time Series. 19th International Congress on

Modeling and Simulation, Perth, Australia.

Bozic, B., Winiwarter, W., 2012. Community Building

Based on Semantic Time Series. iiWAS 2012:213-222.

Bozic B., Winiwarter, W., 2013. A Showcase of Semantic

Time Series Processing. IJWIS. Volume 9. Number

Chandler, R., Scott, M., 2011. Statistical Methods for

Trend Detection and Analysis In the Environmental

Sciences. First Edition. John Wiley & Sons, Ltd.

Cryer, J. D., Chan, Kung-Sik, 2008. Time Series Analysis

With Applications in R. Second Edition. Springer.

De Souza, L., Vaz, M. S. M. G., Sunye, M. S., 2014.

Modular Development of Ontologies for Provenance

in Detrending Time Series. Accept for presentation in

ITNG 2014, 7-9 April. Nevada, Las Vegas, EUA.

Guarino, N., 1998. Formal Ontology and Information

Systems. Nicola Guarino, editor, Proceedings of

FOIS’98, Trento, Italy, 6-8 June 1998, pages 3–15,

Amsterdam. IOS Press.

Hair, J. F. Jr., Black, W.C., Babin, B.J., Anderson, R. E.,

2010. Multivariate Data Analysis. 7th edition. Pearson

Prentice Hall.

Hebeler, J., Fisher, M., Blace, R., Perez-Lopez, A., Dean,

M., 2009. Semantic Web Programming. John Wiley &

Sons Inc., Chichester, West Sussex, Hoboken, NJ.

Henson, Cory, Neuhaus, H., Sheth, A., P., Thirunarayan,

K., Buyya, R., 2009. An Ontological Representation of

Time Series Observations on the Semantic Sensor

Web. ESWC 2009. Herkalion, Greece.

Kiryakov, A., 2006. Ontologies for Knowledge

Management. DAVIES, J. et al. (Eds). Semantic Web

Technologies: trends and research in ontology-based

systems, pages 115–138.

Kutz, O., Hois, J. 2012. Modularity in ontologies. Guest

Editorial. Applied Ontology. 7, 109-112. IOS Press.

Lange, C., 2013. Ontologies and Languages for

Representing Mathematical Knowledge on the

Semantic Web. Semantic Web, vol. 4, nr 2, pages 119-

158.

Meinl, T., 2011. A Novel Wavelet Based Approach for

Time Series Data Analysis. PhD Thesis. Ref. Prof. Dr.

Svetlozar Rachev. Karlsruhe.

Montesino-Pouzols, F., Lendasse, A., 2010. Effect of

Different Detrending Approaches on Computational

Intelligence Models of Time Series. IJCNN, pages 1–

8. IEEE.

Moreau, Luc, 2010. The Foundations for Provenance on

the Web. Foundations and Trends in Web Science.

Volume 2, Numbers 2-3 (2010). Pages 99-241.

Moreau, L., Clifford, B., Freire, J. Futrelle, J. Gil, Y.,

Groth, P., Kwasnikowska, N., Miles, S., Missier, P.,

Myers, J., Plale, B., Simmhan, Y., Stephan, E., den

Bussche, J. V., 2011. The Open Provenance Model

Core Specification (v1.1). Future Gener. Comput.

Syst., 27(6):743–756.

Noy, N. F., McGuinness, D, L., 2001. Ontology

Development 101: A Guide to Creating Your First

Ontology. Development, 32(1):1–25.

Ram, S., Liu, J., 2009. A New Perspective on Semantics

of Data Provenance. SWPM, 2009.

Sirin, E., Parsia, B., Grau, B.C., Kalyanpur, A., Katz, Y.

2007. Pellet: A practical OWL-DL Reasoner. Web

Semant., 5(2):51–53.

Suárez-Figueroa, M. C., Gómez-Pérez, A., Motta, E.,

Gangemi, A. 2012. Ontology Engineering in a

Networked World. Berlin, Springer.

Tan, W.C., 2007. Provenance in Databases: Past, Current,

and Future. IEEE Data E. Bull., 30(4):3-12.

Wei, W. S. W., 2006. Time Series Analysis. Univariate

and Multivariate Methods, 2

edition. Pearson

Education.

Wu, Z., Huang, N. E., Long, S. R., Peng, Chung-Kang,

2007. On the Trend, Detrending, and Variability of

Nonlinear and Nonstationary Time Series. Proc. of the

National Academy of Sciences, 104(38):14889–14894.

Yaffee, R. A., McGee, M., 2000. Introduction to Time

Series Analysis and Forecasting with Applications of

SAS and SPSS. Acad. Press, Inc. Orlando, FL, USA.

ICEIS2014-16thInternationalConferenceonEnterpriseInformationSystems

224