Knowledge Discovery in Biological Datasets Using a Hybrid Bayes Classifier/Evolutionary Algorithm

Michael L. Raymer, Leslie A. Kuhn, William F. Punch

Research output: Contribution to conferencePoster

Abstract

A key element of bioinformatics research is the extraction of meaningful information from large experimental data sets. Various approaches, including statistical and graph theoretical methods, data mining, and computational pattern recognition, have been applied to this task with varying degrees of success. We have previously shown that a genetic algorithm coupled with a k-nearest-neighbors classifier performs well in extracting information about protein-water binding from X-ray crystallographic protein structure data. Using a novel classifier based on the Bayes discriminant function, we present a hybrid algorithm that employs feature selection and extraction to isolate salient features from large biological data sets. The effectiveness of this algorithm is demonstrated on various biological and medical data sets.

Original languageAmerican English
DOIs
StatePublished - Nov 1 2001
EventProceedings of the 2nd Annual IEEE International Symposium on Bioinformatics Bioengineering -
Duration: Nov 1 2001 → …

Conference

ConferenceProceedings of the 2nd Annual IEEE International Symposium on Bioinformatics Bioengineering
Period11/1/01 → …

Keywords

  • Feature Extraction
  • Feature Selection
  • Genetic Algorithms
  • Pattern Classification
  • Bayes Classifier
  • Curse of Dimensionality
  • Protein Solvation

Disciplines

  • Bioinformatics
  • Communication
  • Communication Technology and New Media
  • Computer Sciences
  • Databases and Information Systems
  • Life Sciences
  • OS and Networks
  • Physical Sciences and Mathematics
  • Science and Technology Studies
  • Social and Behavioral Sciences

Cite this