Figure 1: Integration of bioinformatics data and services
using mashups technology.
We are considering to use open source mashup
server such as WSO2 mashup server
(http://wso2.org/projects/mashup), which support
different data formats commonly used by biological
databases. JackBe Presto Enterprise Mashup Server
(http://www.jackbe.com/) is also a potential
candidate as it provides many interesting features.
4 CONCLUSIONS
As a result of worldwide biological research
activities, data is spread across disparate databases
and organizations in various formats. Data Mining
and analysis require comprehensive integration of
these heterogeneous data. The need for data and
services integration is widely recognized in the
bioinformatics community. Successful data
integration is one of the keys to enhanced
productivity in biology and biopharmaceutical R&D.
In this paper, we have provided an overview of
existing approaches for data and applications
integration in bioinformatics. We have also
discussed the promise of emerging technologies,
service oriented architecture and mashups.
Furthermore, we have presented our work in
progress in implementing an architecture that relies
on a mashups server and Web services for
integrating data and applications in Life Sciences.
REFERENCES
Achard, F., Vaysseix, G., and Barillot, E., 2001. XML,
bioinformatics and data integration. Bioinformatics,
17(2):115-25.
Badidi, E., Salem, M. V., Bouktif, S., and Esmahi, L.,
2009. A Web Services based Framework for Uniform
Integration of Command-line Bioinformatics Software
Tools. International Journal on Web Services
Practices (IJWSP), 4 (1) (2009). 36-43.
Badidi, E., De Sousa, C., Lang, B. F. and Burger, G.,
2003. AnaBench: a Web/CORBA-based workbench
for biomolecular sequence analysis. BMC
Bioinformatics, 4: 63.
Kei-Hoi, C., Kevin Y. Y., Jeffrey, P. T., and Matthew, S.,
2008. HCLS 2.0/3.0: Health care and life sciences data
mashup using Web 2.0/3.0. Journal of Biomedical
Informatics, 41(5): 694–705.
Kemp, G. J. L., Angelopoulos, N., and Gray, P.M.D.,
2002. Architecture of a Mediator for a Bioinformatics
Database Federation. IEEE Transactions on
Information Technology in Biomedicine, 6(2): 116-
122.
Littlejohn, T.G., 2001. Bioinformatics tools for genome
projects. In Molecular Breeding of Forage Crops,
Spangenberg, G. (ed.), Kluwer Acad. Publ., The
Netherlands, 83-99.
Ludäscher, B., Altintas, I., Berkley, C., Higgins, D.,
Jaeger-Frank, E., Jones, M., Lee, E., Tao, J., and Zhao,
Y., 2006. Scientific Workflow Management and the
Kepler System, Concurrency and Computation:
Practice & Experience, 18(10): 1039-1065.
Masseroli, M., and Pinciroli, F., 2006. Using Gene
Ontology and genomic controlled vocabularies to
analyze high-throughput gene lists: Three tool
comparison. Computers in Biology and Medicine,
36(7): 731-747.
Oinn, T., Addis, M. J., Ferris, J., Marvin, D. J.,
Greenwood, M., Carver, T., Wipat, A., and Li, P.,
2004. Taverna, lessons in creating a workflow
environment for the life sciences. Paper presented at
the GGF10, Germany, 2004.
Senger, M., 1999. AppLab - A CORBA-Java based
Application Wrapper. Retrieved from
http://www.omg.org/docs/corbamed/98-03-08.pdf
Siepel, A., Farmer, A., Tolopko, A., Zhuang, M., Mendes,
P., Beavis, W., and Sobral, B., 2001. ISYS: a
decentralized, component-based approach to the
integration of heterogeneous bioinformatics resources.
Bioinformatics, 17, 83-94.
Sumitomo, J., Hogan, J. M., and Roe, P., 2008.
BioMashups: The New World of Exploratory
Bioinformatics? In Proceedings of the Fourth IEEE
International Conference on eScience, 422-423.
Unwin, R., Fenton, J., Whitsitt, M., Jamison, C., Stupar,
M., Jakobsson, E., and Subramaniam, S., 1998.
Biology Workbench: A WWW-based Virtual
Computing and Analysis Environment for the
Biological Sciences. Bioinformatics(Databases and
Systems, S. Letovsky (Ed.)), 233-244.
Wilkinson, M. D. and Links, M., 2003. BioMOBY: An
open source biological web services proposal.
Briefings in bioinformatics, 3(4): 331–341.
ICSOFT 2010 - 5th International Conference on Software and Data Technologies
98