CyanoFactory Knowledge Base & Synthetic Biology - A Plea for Human Curated Bio-databases

Gabriel Kind, Eric Zuchantke, Röbbe Wünschiers

Abstract

Nowadays, life science research is dominated by two conditions: interdisciplinarity and high-throughput. The former leads to highly diverse datasets from a data type point of view while high-throughput yields massive amounts of data. Both aspects are reflected by the byte-growth of public bio-databases and the sheer number of specialised databases or databases of databases (i.e. data warehouses). We provide an insight to the development of a biodata knowledge base (dubbed CyanoFactory KB) targeted to bio-engineers in the field of synthetic biology and exemplify the need for data type specific data curation and cross-linking. CyanoFactory KB is unique in incorporating experimental data from a broad range of scientific methods that are based on one strain of Synechocystis sp. PCC 6803. The knowledge base can be accessed upon request via cyanofactory.hs-mittweida.de.

References

  1. Arzt, S., Starlinger, J., Arnold, O., Kr öger, S., Jaeger, S., and Leser, U. (2011). Pipa: Custom integration of protein interactions and pathways. In Workshop Daten In den Lebenswissenschaften, Berlin, Germany. Citeseer.
  2. Baumbach, J. (2007). CoryneRegNet 4.0 - A reference database for corynebacterial gene regulatory networks. BMC Bioinformatics, 8(1):429.
  3. Franceschini, A., Szklarczyk, D., Frankild, S., Kuhn, M., Simonovic, M., Roth, A., Lin, J., Minguez, P., Bork, P., von Mering, C., and Jensen, L. J. (2013). STRING v9.1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res., 41(Database issue):D808-815.
  4. Fujisawa, T., Okamoto, S., Katayama, T., Nakao, M., Yoshimura, H., Kajiya-Kanegae, H., Yamamoto, S., Yano, C., Yanaka, Y., Maita, H., Kaneko, T., Tabata, S., and Nakamura, Y. (2014). CyanoBase and RhizoBase: databases of manually curated annotations for cyanobacterial and rhizobial genomes. Nucleic Acids Research, 42(Database issue):D666-70.
  5. Gamermann, D., Montagud, A., Infante, R. A. J., Triana, J., de Crdoba, P. F., and Urchuegua (2014). PyNetMet: Python tools for efficient work with networks and metabolic models. Computational and Mathematical Biology, 3(5):1-11.
  6. Hippe, K., Kormeier, B., Töpel, T., and Janowski, S. (2010). DAWIS-MD-A Data Warehouse System for Metabolic Data. GI Jahrestagung.
  7. Ikeuchi, M. and Tabata, S. (2001). Synechocystis sp. PCC 6803 - a useful tool in the study of the genetics of cyanobacteria. Photosynthesis research., 70(1):73- 83.
  8. Kanehisa, M., Goto, S., Sato, Y., Kawashima, M., Furumichi, M., and Tanabe, M. (2014). Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic Acids Res., 42(Database issue):199- 205.
  9. Kanesaki, Y., Shiwa, Y., Tajima, N., Suzuki, M., Watanabe, S., Sato, N., Ikeuchi, M., and Yoshikawa, H. (2012). Identification of substrain-specific mutations by massively parallel whole-genome resequencing of Synechocystis sp. PCC 6803. DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes, 19(1):67-79.
  10. Karr, J. R., Sanghvi, J. C., Macklin, D. N., Arora, A., and Covert, M. W. (2013). WholeCellKB: model organism databases for comprehensive whole-cell models. Nucleic Acids Res., 41(Database issue):D'7-792.
  11. Karr, J. R., Sanghvi, J. C., Macklin, D. N., Gutschow, M. V., Jacobs, J. M., Bolival Jr., B., Assad-Garcia, N., Glass, J. I., and Covert, M. W. (2012). A Whole-Cell Computational Model Predicts Phenotype from Genotype. Trends in Genetics, 150(2):389-401.
  12. Mering, C., Jensen, L. J., and Bork, P. (2014). STITCH 4: integration of protein-chemical interactions with user data. Nucleic Acids Res., 42(Database issue):D401-407.
  13. Küntzer, J., Backes, C., Blum, T., Gerasch, A., Kaufmann, M., Kohlbacher, O., and Lenhof, H.-P. (2007). BNDB - the Biochemical Network Database. BMC Bioinformatics, 8(1):367.
  14. Lee, T. J., Pouliot, Y., Wagner, V., Gupta, P., StringerCalvert, D. W. J., Tenenbaum, J. D., and Karp, P. D. (2006). BioWarehouse: a bioinformatics database warehouse toolkit. BMC Bioinformatics, 7(1):170.
  15. Lyne, R., Smith, R., Rutherford, K., Wakeling, M., Varley, A., Guillier, F., Janssens, H., Ji, W., Mclaren, P., North, P., Rana, D., Riley, T., Sullivan, J., Watkins, X., Woodbridge, M., Lilley, K., Russell, S., Ashburner, M., Mizuguchi, K., and Micklem, G. (2007). FlyMine: an integrated database for Drosophila and Anopheles genomics. Genome Biology, 8(7):R129.
  16. Michal, G. and Schomburg, D., editors (2012). Biochemical Pathways. An Atlas of Biochemistry and Molecular Biology. Wiley.
  17. Stanier, R. Y., Kunisawa, R., Mandel, M., and CohenBazire, G. (1971). Purification and properties of unicellular blue-green algae (order Chroococcales). Bacteriological reviews, 35(2):171-205.
  18. Taubert, J., Hassani-Pak, K., Castells-Brooke, N., and Rawlings, C. J. (2014). Ondex Web: web-based visualization and exploration of heterogeneous biological networks. Bioinformatics (Oxford, England), 30(7):1034-1035.
  19. Töpel, T., Kormeier, B., Klassen, A., and Hofestädt, R. (2008). BioDWH: a data warehouse kit for life science data integration. Journal of Integrative Bioinformatics, 5(2).
  20. Töpel, T., Scheible, D., Trefz, F., and Hofestädt, R. (2010). RAMEDIS: a comprehensive information system for variations and corresponding phenotypes of rare metabolic diseases. Human mutation, 31(1):E1081-8.
  21. Trautmann, D., Voss, B., Wilde, A., Al-Babili, S., and Hess, W. R. (2012). Microevolution in cyanobacteria: resequencing a motile substrain of Synechocystis sp. PCC 6803. DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes, 19(6):435-448.
  22. Triplet, T. and Butler, G. (2011). Systems Biology Warehousing: Challenges and Strategies toward Effective Data Integration. DBKDA 2011 : The Third International Conference on Advances in Databases, Knowledge, and Data Applications, pages 34-40.
  23. Triplet, T., Shortridge, M. D., Griep, M. A., Stark, J. L., Powers, R., and Revesz, P. (2010). PROFESS: a PROtein function, evolution, structure and sequence database. Database, 2010(0):baq011-baq011.
  24. Zhang, J., Duggan, G. E., Khaja, R., and Scherer, S. W. (2004). Bioxrt: a novel platform for developing online biological databases based on the cross-referenced tables model. In 3rd Canadian Working Conference on Computational Biology, Markham, Canada.
Download


Paper Citation


in Harvard Style

Kind G., Zuchantke E. and Wünschiers R. (2015). CyanoFactory Knowledge Base & Synthetic Biology - A Plea for Human Curated Bio-databases . In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2015) ISBN 978-989-758-070-3, pages 237-242. DOI: 10.5220/0005285802370242


in Bibtex Style

@conference{bioinformatics15,
author={Gabriel Kind and Eric Zuchantke and Röbbe Wünschiers},
title={CyanoFactory Knowledge Base & Synthetic Biology - A Plea for Human Curated Bio-databases},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2015)},
year={2015},
pages={237-242},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005285802370242},
isbn={978-989-758-070-3},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2015)
TI - CyanoFactory Knowledge Base & Synthetic Biology - A Plea for Human Curated Bio-databases
SN - 978-989-758-070-3
AU - Kind G.
AU - Zuchantke E.
AU - Wünschiers R.
PY - 2015
SP - 237
EP - 242
DO - 10.5220/0005285802370242