For each population ratio column the maximum
value, the mean and the standard deviation were
extracted and are presented in Table 7 and Table 8.
The prediction performance of the Bayesian
inference shows a slightly better performance when
we analyze the linear tendency lines of the means of
ratios of each population (Figure 2), since it is lower
in the graph and nearer from 0, although, comparing
Table 7 and Table 8, the Standard Deviation in the
ANN approach is lower. Overall, performance of
both methods is similar.
6 CONCLUSIONS
In this work we provided a complete statistical
analysis method of a complex biological database,
including a method to mixture qualitative and
quantitative data, which can be used in several
inference models. Also, the regression model
associated to the compositional data analysis is a
powerful statistical tool to understand several
biological population data.
The Bayesian method may be improved and
other prior distributions for the parameters and/or
other error distributions in (2) can be used for a
better prediction performance. The ability to
evaluate the significance of each variable is an
important tool to maximize experiments resources
and understand biological processes. It is expected
with the improvement of the Bayesian inference
method, that less data could be necessary to train the
algorithm to acquire good regression parameters. It
is important to notice that previously knowledge of
the problem can be very useful to model the problem
and determine the best distributions for the
problems.
REFERENCES
Aitchison, J., 1982. The statistical analysis of
compositional data. Journal of the Royal Statistical
Society, pp. p. 139-177.
Aitchison, J., 1986. The statistical analysis of
compositional data. Chapman and Hall.
Aitchison, J. & Shen, S. M., 1980. Logistic-normal
distributions: Some properties and uses. Biometrika,
Issue 67, pp. 261-272.
da Silva, I. N., Spatti, D. H. & Flauzino, R. A., 2010.
Redes Neurais Artificiais para engenharia e ciências
aplicadas. I ed. São Paulo: ArtLiber.
Gelfand, A. E., Carlin, J. B., Stern, H. S. & Rubin, D. B.,
1995. Bayesian Data Analysis. Issue 85, pp. 398-409.
Gelfand, A. E. & Smith, A. F. M., 1990. Sampling based
approaches to calculating marginal densities. Journal
of the American Statistical Association, Issue 85, pp.
398-409.
Iyengar, M. & Dey, D. K., 1996. Bayesian Analysis of
Compositional Data. Department of Statistics,
University of Connecticut, Storrs, CT.
Iyengar, M. & Dey, D. K., 1998. Box-Cox transformations
in Bayesian analysis of compositional data.
Environmetrics, Issue 9, pp. 657-671.
López-Flores, R., Romaní, A. M. & Quintana, X. D.,
2011. Phytoplankton composition in shallow water
ecosystems: influence of environmental gradients and
nutrient availability, In 4
th
international Workshop on
Compositional Data Analysis.
Lynch, S. M., 2007. Introduction to Applied Bayesian
Statistics and Estimation for Social Scientists.
s.l.:Springer .
OpenBUGS, 2009. OpenBUGS, accessed 10 Feb 2013,
<http://www.openbugs.info/w.cgi/FrontPage>.
Rayens, W. S. & Srinivasan, C., 1991a. Box-Cox
transformations in the analysis of compositional data.
Journal of Chemometrics, Issue 5, pp. 227-239.
Rayens, W. S. & Srinivasan, C., 1991b. Estimation in
compositional data. Journal of Chemometrics, Issue 5,
pp. 361-374.
Roberts, G. O. & Smith, A. F. M., 1993. Bayesian
methods via the Gibbs sampler and related Markov
Chain Monte Carlo methods. Journal of the Royal
Statistical Society, 55(1), pp. 3-23.
The MathWorks, Inc., 1994-2013. Neural Network
Toolbox, accessed 20 Jan 2013, <http://
www.mathworks.com/products/neural-network/descri
ption2.html>.
Tjelmeland, H. & Lund, K. V., 2003. Bayesian modelling
of spatial compositional data, preprint n.1. Journal of
Applied Statistics, Issue 30, pp. 87-100.
Univesity of California - Irvine, 1999. COIL 1999
Competition Data, accessed 10 Jul 2013,
<http://kdd.ics.uci.edu/databases/coil/coil.data.html>.
IJCCI2013-InternationalJointConferenceonComputationalIntelligence
488