measured in the SSN, which are PPI networks built
from the expressed genes without any other biological
information. The results seem to indicate that there
are signatures embedded in the topology dynamics,
modelled through the SSN, which can distinguish
cancer from non-cancer cells for each type of cancer.
This new methodology of creating SSN allows the
capture of the topology dynamics of the system
through the set of samples and allows data to be
reduced and be computationally manageable, keeping
the more informative data, which is supported by the
good results obtained. We consider that this novel
approach is worth and gives different contributions
compared to previous works, namely: the number of
considered topological properties is much higher; the
exclusive use of topological properties (global and
local) with good binary classification results
obtained; the topological dynamics of the system
captured through each sample, different from other
works that use time or states for example, which can
contribute to the capture of different signatures.
The results obtained show that classification
models should be different according to the cancer
disease type considered. More, the knowledge of
which features are more informative can be used, in
the future, to look for signatures based in these
features that could help in the identification of certain
cancer types. Two of the most discriminative features
obtained were the size of the largest clique and motifs
of size 4 and 5. Cliques being fully connected
subnetworks where genes are functionally related and
highly expressed were considered by some
researchers as gene signatures (Pradhan et al., 2012).
The relative frequency and z-score of some motifs as
local topological properties measures, showed to be
discriminatory features, indicating that there are clues
that some small subnetworks could help to distinguish
cancer samples. Adding more biological information
to the more discriminative features found in the
classification, may reveal important signatures like
subgraphs markers of cancer diseases. This approach
also seems worth to be further explored.
Finally the proposed methodology for creating
SSN is a novel contribution that can be extended to
other types of networks, besides PPIs, adding
information that can differentiate samples and capture
their topological dynamics helping to uncover new
signatures that can be biologically relevant for the
identification of diseases.
ACKNOWLEDGEMENTS
This work has received support from the RD-
CONNECT European project (EC contract number
305444).
REFERENCES
Amberger, J. S., Bocchini, C. A., Schiettecatte, F., Scott, A.
F. & Hamosh, A., 2015. Omim. Org: Online Mendelian
Inheritance In Man (Omim®), An Online Catalog Of
Human Genes And Genetic Disorders. Nucleic Acids
Research, 43, D789-D798.
Arrais, J. P. & Oliveira, J. L., 2011. Using Biomedical
Networks To Prioritize Gene-Disease Associations.
Open Access Bioinformatics, 1, 123-130.
Ay, A., Gong, D. & Kahveci, T., 2014. Network-Based
Prediction Of Cancer Under Genetic Storm. Cancer
Informatics, 13, 15.
Barabási, A.-L., Gulbahce, N. & Loscalzo, J., 2011.
Network Medicine: A Network-Based Approach To
Human Disease. Nature Reviews. Genetics, 12, 56-68.
Barabasi, A.-L. & Oltvai, Z. N., 2004. Network Biology:
Understanding The Cell's Functional Organization.
Nature Reviews Genetics, 5, 101-113.
Barter, R., Schramm, S.-J., Mann, G. & Yang, Y. H., 2014.
Network-Based Biomarkers Enhance Classical
Approaches To Prognostic Gene Expression
Signatures. Bmc Systems Biology, 8, S5.
Chen, D. & Yang, H., 2014. Comparison Of Gene
Regulatory Networks Of Benign And Malignant Breast
Cancer Samples With Normal Samples. Genetics And
Molecular Research: Gmr, 13, 9453.
Chuang, H. Y., Lee, E., Liu, Y. T., Lee, D. & Ideker, T.,
2007. NetworkBased Classification Of Breast Cancer
Metastasis. Molecular Systems Biology, 3, 140.
Consortium, T. U., 2014. Activities At The Universal
Protein Resource (Uniprot). Nucleic Acids Research,
42, D191-D198.
Demsar, J., Zupan, B. & Leban, G., 2007. Orange: From
Experimental Machine Learning To Interactive Data
Mining. White Paper, Faculty Of Computer And
Information Science, University Of Ljubljana (2004).
Dennis, G., Jr., Sherman, B. T., Hosack, D. A., Yang, J.,
Gao, W., Lane, H. C. & Lempicki, R. A., 2003. David:
Database For Annotation, Visualization, And
Integrated Discovery. Genome Biol, 4, P3.
Dezfuly, M. & Sajedi, H,. 2015. Predict Survival Of
Patients With Lung Cancer Using An Ensemble Feature
Selection Algotithm And Classification Methods In
Data Mining. Journal Of Information, 1, 1-11.
Dominietto, M., Tsinoremas, N. & Capobianco, E., 2015.
Integrative Analysis Of Cancer Imaging Readouts By
Networks. Molecular Oncology, 9, 1-16.
Farkas, I. J., Korcsmáros, T., Kovács, I. A., Mihalik, Á.,
Palotai, R., Simkó, G. I., Szalay, K. Z., Szalay-Beko,
M., Vellai, T. & Wang, S., 2011. Network-Based Tools
For The Identification Of Novel Drug Targets. Sci
Signal, 4, Pt3.
Furey, T. S., Cristianini, N., Duffy, N., Bednarski, D. W.,
Schummer, M. & Haussler, D. 2000. Support Vector