TY - JOUR
T1 - Evaluation of the clinical data dictionary (CiDD)
AU - Lee, Myung Kyung
AU - Park, Hyeoun Ae
AU - Min, Yul Ha
AU - Kim, Younglan
AU - Min, Hyo Ki
AU - Ham, Sung Woo
PY - 2010
Y1 - 2010
N2 - Objectives: The purpose of the study was to evaluate content coverage and data quality of the Clinical Data Dictionary (CiDD) developed by the Center for Interoperable EHR (CiEHR). Methods: A total of 12, 994 terms were collected from 98 clinical forms of a tertiary cancer center hospital with 500 beds. After data cleaning, 9, 418 terms were mapped with the data items of the CiDD by the research team, and validated by 30 doctors and nurses at the research hospital. Results: Mapping results were classified into five categories: lexically mapped; semantically mapped; mapped to either a broader term or a nar-rower term; mapped to more than one term and not mapped. In terms of coverage, out of 9, 418 terms, 6, 750 (71.7%) terms were mapped; 4, 319 (45.9%) terms were lexically mapped; 2, 431 (25.8%) were semantically mapped; 281 (3.0%) terms were mapped to a broader term; 43 (0.5%) were mapped to a narrower term; and 550 (5.8%) were mapped to more than one term. In terms of data quality, the CiDD has problems such as errors in concept namingand representation, redundancy in syn-onyms, inadequate synonyms, and ambiguity in meaning. Conclusions: Although the CiDD has terms covering 72% of local clinical terms, the CiDD can be improved by cleaning up errors and redundancies, adding textual definitions or use cases of the concept, and arranging the concepts in a hierarchy.
AB - Objectives: The purpose of the study was to evaluate content coverage and data quality of the Clinical Data Dictionary (CiDD) developed by the Center for Interoperable EHR (CiEHR). Methods: A total of 12, 994 terms were collected from 98 clinical forms of a tertiary cancer center hospital with 500 beds. After data cleaning, 9, 418 terms were mapped with the data items of the CiDD by the research team, and validated by 30 doctors and nurses at the research hospital. Results: Mapping results were classified into five categories: lexically mapped; semantically mapped; mapped to either a broader term or a nar-rower term; mapped to more than one term and not mapped. In terms of coverage, out of 9, 418 terms, 6, 750 (71.7%) terms were mapped; 4, 319 (45.9%) terms were lexically mapped; 2, 431 (25.8%) were semantically mapped; 281 (3.0%) terms were mapped to a broader term; 43 (0.5%) were mapped to a narrower term; and 550 (5.8%) were mapped to more than one term. In terms of data quality, the CiDD has problems such as errors in concept namingand representation, redundancy in syn-onyms, inadequate synonyms, and ambiguity in meaning. Conclusions: Although the CiDD has terms covering 72% of local clinical terms, the CiDD can be improved by cleaning up errors and redundancies, adding textual definitions or use cases of the concept, and arranging the concepts in a hierarchy.
KW - Clinical terminology
KW - Data dictionary
KW - Semantic interoperability
KW - Standard terms
UR - http://www.scopus.com/inward/record.url?scp=84878687410&partnerID=8YFLogxK
U2 - 10.4258/hir.2010.16.2.82
DO - 10.4258/hir.2010.16.2.82
M3 - Article
AN - SCOPUS:84878687410
SN - 2093-3681
VL - 16
SP - 82
EP - 88
JO - Healthcare Informatics Research
JF - Healthcare Informatics Research
IS - 2
ER -