Classification of bio-data with small data set using additive factor model and SVM

Hyeyoung Park, Minkook Cho

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Bio-data, which are obtained from human individuals, have been one of main applications of pattern classification these days. A critical property of bio-data classification is the small number of data in each class due to high cost of obtaining data from each individuals. Since most classification methods are based on the distribution of data in each class, the lack of data can be a main cause of low classification performance of conventional classifiers. To solve this problem, we propose a modified additive factor model for bio-data which has two factors; the individual factor and the environment factor. Under the proposed model, we estimate the distribution of environment factor which gives robust information even in case of small data set. We then define new similarity measures using the information. The similarity measure is applied to nearest neighbor method for classification. We also use the support vector machines (SVM) to find a sophisticated similarity measure. Through computational experiments, we confirm that the proposed model and similarity measure is appropriate enough to show better classification performance compared to conventional similarity measure as well as conventional SVM classifier.

Original languageEnglish
Title of host publicationAI 2006
Subtitle of host publicationAdvances in Artificial Intelligence - 19th Australian Joint Conference on Artificial Intelligence, Proceedings
PublisherSpringer Verlag
Pages770-779
Number of pages10
ISBN (Print)9783540497875
DOIs
StatePublished - 2006
Event19th Australian Joint Conference onArtificial Intelligence, AI 2006 - Hobart, TAS, Australia
Duration: 4 Dec 20068 Dec 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4304 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th Australian Joint Conference onArtificial Intelligence, AI 2006
Country/TerritoryAustralia
CityHobart, TAS
Period4/12/068/12/06

Keywords

  • Additive factor model
  • Bio-data classification
  • Class factor
  • Data generation model
  • Environment factor
  • Similarity function

Fingerprint

Dive into the research topics of 'Classification of bio-data with small data set using additive factor model and SVM'. Together they form a unique fingerprint.

Cite this