TY - JOUR
T1 - Microarray expression analysis using seed-based clustering method
AU - Shin, Miyoung
AU - Park, Seon Hee
PY - 2005
Y1 - 2005
N2 - Clustering methods have been often used to find biologically relevant groups of genes or conditions based on their expression levels. Since many functionally related genes tend to be co-expressed, by identifying groups of genes with similar expression profiles, the functionalities of unknown genes can be inferred from those of known genes in the same group. In this paper we address a novel clustering approach, called seed-based clustering, where seed genes are first systematically chosen by computational analysis of their expression profiles, and then the clusters are generated by using the seed genes as initial values for &-means clustering. The seed-based clustering method has strong mathematical foundations and requires only a few matrix computations for seed extraction. As a result, it provides stability of clustering results by eliminating randomness in the selection of initial values for cluster generation. Our empirical results reported here indicate that the entire clustering process can be systematically pursued using seed-based clustering, and that its performance is favorable compared to current approaches.
AB - Clustering methods have been often used to find biologically relevant groups of genes or conditions based on their expression levels. Since many functionally related genes tend to be co-expressed, by identifying groups of genes with similar expression profiles, the functionalities of unknown genes can be inferred from those of known genes in the same group. In this paper we address a novel clustering approach, called seed-based clustering, where seed genes are first systematically chosen by computational analysis of their expression profiles, and then the clusters are generated by using the seed genes as initial values for &-means clustering. The seed-based clustering method has strong mathematical foundations and requires only a few matrix computations for seed extraction. As a result, it provides stability of clustering results by eliminating randomness in the selection of initial values for cluster generation. Our empirical results reported here indicate that the entire clustering process can be systematically pursued using seed-based clustering, and that its performance is favorable compared to current approaches.
KW - Clustering algorithm
KW - Gene expression data analysis
KW - Microarray data
KW - Seed extraction
UR - http://www.scopus.com/inward/record.url?scp=33646420895&partnerID=8YFLogxK
U2 - 10.4028/0-87849-958-x.343
DO - 10.4028/0-87849-958-x.343
M3 - Article
AN - SCOPUS:33646420895
SN - 1013-9826
VL - 277-279
SP - 343
EP - 348
JO - Key Engineering Materials
JF - Key Engineering Materials
IS - I
ER -