loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Bráulio Roberto Gonçalves Marinho Couto 1 ; Macelo Matos Santoro 2 and Marcos Augusto dos Santos 2

Affiliations: 1 Centro Universitário de Belo Horizonte / UNI-BH, Brazil ; 2 UFMG, Brazil

Keyword(s): Genomics, Matrix analysis, BLAST, SVD.

Related Ontology Subjects/Areas/Topics: Algorithms and Software Tools ; Bioinformatics ; Biomedical Engineering ; Pattern Recognition, Clustering and Classification ; Sequence Analysis

Abstract: The dominant methods to search for relevant patterns in protein sequences are based on character-by-character matching, performed by software known as BLAST. In this paper, sequences are recoded as p-peptide frequency matrix that is reduced by singular value decomposition (SVD). The objective is to evaluate the association between statistics used by BLAST and similarity metrics used by SVD (Euclidean distance and cosine). We chose BLAST as a standard because this string-matching program is widely used for nucleotide searching and protein databases. Three datasets were used: mitochondrial-gene sequences, non-identical PDB sequences and a Swiss-Prot protein collection. We built scatter graphs and calculated Spearman correlation () with metrics produced by BLAST and SVD. Euclidean distance was negatively correlated with bit score (>-0.6) and positively correlated with E value (>+0.7). Cosine had negative correlation with E value (>-0.7) and positive correlation with bit score (>+0. 8). Besides, we made agreement tests between SVD and BLAST in classifying protein families. For the mitochondrial gene database, we achieved a kappa coefficient of 1.0. For the Swiss-Prot sample there is an agreement higher than 80%. The fact that SVD has a strong correlation to BLAST results may represent a possible core technique within a broader algorithm. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.226.104.30

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Roberto Gonçalves Marinho Couto, B.; Matos Santoro, M. and Augusto dos Santos, M. (2011). SINGULAR VALUE DECOMPOSITION (SVD) AND BLAST - Quite Different Methods Achieving Similar Results. In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms (BIOSTEC 2011) - BIOINFORMATICS; ISBN 978-989-8425-36-2; ISSN 2184-4305, SciTePress, pages 189-195. DOI: 10.5220/0003162301890195

@conference{bioinformatics11,
author={Bráulio {Roberto Gon\c{C}alves Marinho Couto}. and Macelo {Matos Santoro}. and Marcos {Augusto dos Santos}.},
title={SINGULAR VALUE DECOMPOSITION (SVD) AND BLAST - Quite Different Methods Achieving Similar Results},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms (BIOSTEC 2011) - BIOINFORMATICS},
year={2011},
pages={189-195},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003162301890195},
isbn={978-989-8425-36-2},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms (BIOSTEC 2011) - BIOINFORMATICS
TI - SINGULAR VALUE DECOMPOSITION (SVD) AND BLAST - Quite Different Methods Achieving Similar Results
SN - 978-989-8425-36-2
IS - 2184-4305
AU - Roberto Gonçalves Marinho Couto, B.
AU - Matos Santoro, M.
AU - Augusto dos Santos, M.
PY - 2011
SP - 189
EP - 195
DO - 10.5220/0003162301890195
PB - SciTePress