to retrieve only consultants.
Types are very important for query building, as
we can propose to the user the list of types in which
he can pick to refine its query.
Extracting the answer is not trivial too, as user
could skip several intermediary nodes, because of not
knowing it or simply to have a simpler query. Thus
asking for a consultant skilled in java programming
language could return a graph containing a node
“object languages” that gather several languages like
java, c++, c#, etc. More generally, simple queries
could return complex chains of dependencies: the
graph matching algorithm has to deal with this.
3.5 Result Interpretation
The query’s result is a set of sub-graphs extracted
from the global graph and matching the query graph.
Representing this result in a human readable manner
is not so trivial. It is easy to use a graphical
representation where nodes are boxes and links are
arrows. This form is acceptable for a human reading
as long as the result set is not too big, but becomes
unusable if there are hundreds or thousands of nodes.
Addressing this issue is not quite simple. Many
studies have been done to try to solve it and many
approaches exist (Shengqi et al., 2014), (
Bergmann et
al., 2014) with different approaches. However there is
not any universal good approach. Each need may
have own adapted representation. The goal is here to
determine a good representation in existing tools, and
if needed (and possible), adapt it to closely cover the
specific need of our project.
3.6 Self-learning
Automatic learning concerns the design, analysis,
development and implementation of methods
allowing to a machine to evolve in a systematic
process, and so fulfill the tasks difficult or impossible
to fill by more conventional algorithmic means.
There are some kinds of self-learning algorithms.
In our project we develop a self-learning method
based on the user feedback. This method refers to a
class of automatic learning problems, where the aim
is to learn from experience, to optimize a quantitative
reward over time.
The user feedback acts as a reward, and is used to
improve the search algorithm, which in turn will be
able to provide more accurate results.
4 CONCLUSION
Most of the techniques described above to achieve the
goal of opportunity detection are recent research
subjects. Some partial answers already exist, but it
remains a lot of issues, difficulties and weakness in
the big data mining and in the graph based knowledge
representation. This research will try, using a test case
of business opportunity detection, to address some of
them and to propose original solutions to increase the
efficiency and accuracy of the knowledge mining and
restitution in the big data.
REFERENCES
Bergmann, G., Hegedüs, Á., Gerencsér, G., & Varró, D.,
2014, ‘Graph Query by Example’, in CMSEBA in
conjunction with MoDELS, pp. 17-24.
Ching-Yung Lin., 2014. ‘Graph Computing and linked big
data’, Keynote speech at International Conference on
Semantic Computing.
Diebold, F. X., 2012, ‘A Personal Perspective on the
Origin(s) and Development of Big Data: The
Phenomenon, the Term, and the Discipline’, Penn
Institute for Economic Research, Department of
Economics, University of Pennsylvania.
Fan, S., Lau, R. Y. & Zhao, J. L., 2015, ‘Demystifying big
data analytics for business intelligence through the lens
of marketing mix’, Big Data Research, vol. 2, no 1, pp.
28-32.
Fan, W. & Bifet, A., 2012, ‘Mining big data: Current status,
and forecast to the future’, ACM SIGKDD
Explorations Newsletter, Vol. 14, no 2, pp. 1-5.
Howe, A. D., Costanzo, M., Fey, P., Gojobori, T., Hannick,
L., Hide, W., Rhee, S. Y., 2008, ‘Big data: The future
of biocuration’. Nature, Vol. 455, no 7209, pp. 47–50.
Laurila, J. k. et al., 2012, ‘The Mobile Data Challenge: Big
Data for Mobile Computing Research’, Nokia
Workshopp, in conjunction with Int. Conf. on Pervasive
Computing, no EPFL-CONF-192489
Letouzé, E., 2012. Big data for development: Challenges
and opportunities, UN Global Pulse.
Probst, L. et al., 2013, ‘Big data Analytics and decision
making’, Business Innovation Observatory, European
Commission.
Shengqi Yang, Yinghui Wu, Huan Sun, and Xifeng Yan,
2014, ‘Schemaless and structureless graph querying’,
Proc. VLDB Endow. Vol. 7, no 7 pp. 565-576.
Trelles, O., Prins, P., Snir, M. & C.Jansen, R., 2011, ‘Big
data, but are we ready?’, Nature, Vol. 12, no 224.
Valerie Bönström, Annika Hinze, Heinz Schweppe, 2003.
‘Storing RDF as a Graph’, 1st Latin American Web
Congress, pp.27-36
World Economic Forum, 2012. Big Data, Big Impact: New
Possibilities for International Development.