TY - JOUR
T1 - A guideline for the statistical analysis of compositional data in immunology
AU - Yoo, Jinkyung
AU - Sun, Zequn
AU - Greenacre, Michael
AU - Ma, Qin
AU - Chung, Dongjun
AU - Kim, Young Min
N1 - Publisher Copyright:
© 2022. The Korean Statistical Society, and Korean International Statistical Society. All rights reserved.
PY - 2022
Y1 - 2022
N2 - The study of immune cellular composition has been of great scientific interest in immunology because of the generation of multiple large-scale data. From the statistical point of view, such immune cellular data should be treated as compositional. In compositional data, each element is positive, and all the elements sum to a constant, which can be set to one in general. Standard statistical methods are not directly applicable for the analysis of compositional data because they do not appropriately handle correlations between the compositional elements. In this paper, we review statistical methods for compositional data analysis and illustrate them in the context of immunology. Specifically, we focus on regression analyses using log-ratio transformations and the alternative approach using Dirichlet regression analysis, discuss their theoretical foundations, and illustrate their applications with immune cellular fraction data generated from colorectal cancer patients.
AB - The study of immune cellular composition has been of great scientific interest in immunology because of the generation of multiple large-scale data. From the statistical point of view, such immune cellular data should be treated as compositional. In compositional data, each element is positive, and all the elements sum to a constant, which can be set to one in general. Standard statistical methods are not directly applicable for the analysis of compositional data because they do not appropriately handle correlations between the compositional elements. In this paper, we review statistical methods for compositional data analysis and illustrate them in the context of immunology. Specifically, we focus on regression analyses using log-ratio transformations and the alternative approach using Dirichlet regression analysis, discuss their theoretical foundations, and illustrate their applications with immune cellular fraction data generated from colorectal cancer patients.
KW - Compositional data
KW - Compositional regression
KW - Dirichlet regression
KW - Immuno-oncology
KW - Immunology
KW - Log-ratio transformation
UR - http://www.scopus.com/inward/record.url?scp=85135863907&partnerID=8YFLogxK
U2 - 10.29220/CSAM.2022.29.4.453
DO - 10.29220/CSAM.2022.29.4.453
M3 - Article
AN - SCOPUS:85135863907
SN - 2287-7843
VL - 29
SP - 453
EP - 469
JO - Communications for Statistical Applications and Methods
JF - Communications for Statistical Applications and Methods
IS - 4
ER -