A novel approach for effective learning of cluster structures with biological data applications

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recently DNA microarray gene expression studies have been actively performed for mining unknown biological knowledge hidden under a large volume of gene expression data in a systematic way. In particular, the problem of finding groups of co-expressed genes or samples has been largely investigated due to its usefulness in characterizing unknown gene functions or performing more sophisticated tasks, such as modeling biological pathways. Nevertheless, there are still some difficulties in practice to identify good clusters since many clustering methods require user's arbitrary selection of the number of target clusters. In this paper we propose a novel approach to systematically identifying good candidates of cluster numbers so that we can minimize the arbitrariness in cluster generation. Our experimental results on both synthetic dataset and real gene expression dataset show the applicability and usefulness of this approach in microarray data mining.

Original languageEnglish
Title of host publicationData Mining and Bioinformatics - First International Workshop, VDMB 2006, Revised Selected Papers
PublisherSpringer Verlag
Pages2-13
Number of pages12
ISBN (Print)3540689702, 9783540689706
DOIs
StatePublished - 2006
Event1st International Workshop on Data Mining and Bioinformatics, VDMB 2006 - Seoul, Korea, Republic of
Duration: 11 Sep 200611 Sep 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4316 LNBI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference1st International Workshop on Data Mining and Bioinformatics, VDMB 2006
Country/TerritoryKorea, Republic of
CitySeoul
Period11/09/0611/09/06

Fingerprint

Dive into the research topics of 'A novel approach for effective learning of cluster structures with biological data applications'. Together they form a unique fingerprint.

Cite this