TY - JOUR
T1 - Identifying novel transcripts and novel genes in the human genome by using novel SAGE tags
AU - Chen, Jianjun
AU - Sun, Miao
AU - Lee, Sanggyu
AU - Zhou, Guolin
AU - Rowley, Janet D.
AU - Wang, San Ming
PY - 2002/9/17
Y1 - 2002/9/17
N2 - The number of genes in the human genome is still a controversial issue. Whereas most of the genes in the human genome are said to have been physically or computationally identified, many short cDNA sequences identified as tags by use of serial analysis of gene expression (SAGE) do not match these genes. By performing experimental verification of more than 1,000 SAGE tags and analyzing 4,285,923 SAGE tags of human origin in the current SAGE database, we examined the nature of the unmatched SAGE tags. Our study shows that most of the unmatched SAGE tags are truly novel SAGE tags that originated from novel transcripts not yet identified in the human genome, including alternatively spliced transcripts from known genes and potential novel genes. Our study indicates that by using novel SAGE tags as probes, we should be able to identify efficiently many novel transcripts/novel genes in the human genome that are difficult to identify by conventional methods.
AB - The number of genes in the human genome is still a controversial issue. Whereas most of the genes in the human genome are said to have been physically or computationally identified, many short cDNA sequences identified as tags by use of serial analysis of gene expression (SAGE) do not match these genes. By performing experimental verification of more than 1,000 SAGE tags and analyzing 4,285,923 SAGE tags of human origin in the current SAGE database, we examined the nature of the unmatched SAGE tags. Our study shows that most of the unmatched SAGE tags are truly novel SAGE tags that originated from novel transcripts not yet identified in the human genome, including alternatively spliced transcripts from known genes and potential novel genes. Our study indicates that by using novel SAGE tags as probes, we should be able to identify efficiently many novel transcripts/novel genes in the human genome that are difficult to identify by conventional methods.
UR - http://www.scopus.com/inward/record.url?scp=0037126025&partnerID=8YFLogxK
U2 - 10.1073/pnas.192436499
DO - 10.1073/pnas.192436499
M3 - Article
C2 - 12213963
AN - SCOPUS:0037126025
SN - 0027-8424
VL - 99
SP - 12257
EP - 12262
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 19
ER -