skip to main content
10.1145/1273496.1273506acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Structural alignment based kernels for protein structure classification

Published:20 June 2007Publication History

ABSTRACT

Structural alignments are the most widely used tools for comparing proteins with low sequence similarity. The main contribution of this paper is to derive various kernels on proteins from structural alignments, which do not use sequence information. Central to the kernels is a novel alignment algorithm which matches substructures of fixed size using spectral graph matching techniques. We derive positive semi-definite kernels which capture the notion of similarity between substructures. Using these as base more sophisticated kernels on protein structures are proposed. To empirically evaluate the kernels we used a 40% sequence non-redundant structures from 15 different SCOP superfamilies. The kernels when used with SVMs show competitive performance with CE, a state of the art structure comparison program.

References

  1. Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., Weissig, H., Shindyalov, I. N., & Bourne, P. E. (2000). The protein data bank. Nucleic Acids Research, 28, 235--242.Google ScholarGoogle ScholarCross RefCross Ref
  2. Bertsimas, D., & Tsitsiklis, J. (1997). Introduction to linear optimization. Athena Scientific. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Bhattacharya, S., Bhattacharyya, C., & Chandra, N. (2006). Projections for fast protein structure retrieval. BMC Bioinformatics, 7 suppl., 5:S5.Google ScholarGoogle Scholar
  4. Bourne, P. E., & Shindyalov, I. N. (1998). Protein structure alignment by incremental combinatorial extension of optimal path. Protein Engineering, 11, 739--747.Google ScholarGoogle ScholarCross RefCross Ref
  5. Bourne, P. E., & Shindyalov, I. N. (2003). Protein structure comparison and alignment. In P. E. Bourne and H. Weissig (Eds.), Structural bioinformatics, 321--337. Wiley-Liss.Google ScholarGoogle Scholar
  6. Eidhammer, I., Jonassen, I., & Taylor, W. R. (2000). Structure comparison and structure patterns. Journal of Computational Biology, 7, 685--716.Google ScholarGoogle ScholarCross RefCross Ref
  7. Haussler, D. (1999). Convolution kernels on discrete structures (Technical Report). University of California, Santa Cruz.Google ScholarGoogle Scholar
  8. Holm, L., & Sander, C. (1993). Protein structure comparison by alignment of distance matrices. Journal of Molecular Biology, 233, 123--138.Google ScholarGoogle ScholarCross RefCross Ref
  9. Horn, B. K. P. (1987). Closed form solution of absolute orientation using unit quaternions. Journal of the Optical Society of America, 4, 629--642.Google ScholarGoogle ScholarCross RefCross Ref
  10. J.-P. Vert, H. Saigo, T. A. (2004). Kernel methods in computational biology, chapter Local alignment kernels for biological sequences, 131--154. MIT Press.Google ScholarGoogle Scholar
  11. Jaakkola, T., Diekhaus, M., & Haussler, D. (1999). Using the fisher kernel method to detect remote protein homologies. 7th Intell. Sys. Mol. Biol., 149--158. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Leslie, C., & Kwang, R. (2004). Fast string kernels using inexact matching for protein sequences. Journal of Machine Learning Research, 5, 1435 -- 1455. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Murzin, A. G., Brenner, S. E., Hubbard, T., & Chothia, C. (1995). Scop: a structural classification of proteins database for the investigation of sequences and structures. Journal of Molecular Biology, 247, 536--540.Google ScholarGoogle ScholarCross RefCross Ref
  14. Pardalos, P., Rendl, F., & Wolkowicz, H. (1994). The quadratic assignment problem: a survey and recent developments. In P. Pardalos and H. Wolkowicz (Eds.), Quadratic assignment and related problems (new brunswick, NJ, 1993), 1--42. Providence, RI: Amer. Math. Soc.Google ScholarGoogle Scholar
  15. Schölkopf, B., Weston, J., Eskin, E., Leslie, C. S., & Noble, W. S. (2002). A kernel approach for learning from almost orthogonal patterns. ECML (pp. 511--528). Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Umeyama, S. (1988). An eigendecomposition approach to weighted graph matching problems. IEEE transactions on pattern analysis and machine intelligence, 10, 695--703. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Wang, C., & Scott, S. D. (2005). New kernels for protein structural notif discovery and function classification. International Conference on Machine Learning. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Structural alignment based kernels for protein structure classification

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICML '07: Proceedings of the 24th international conference on Machine learning
        June 2007
        1233 pages
        ISBN:9781595937933
        DOI:10.1145/1273496

        Copyright © 2007 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 20 June 2007

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate140of548submissions,26%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader