Abstract
Over the past decades, variable selection for high-dimensional data has drawn increasing attention. With a large number of predictors, there rises a big challenge for model fitting and prediction. In this paper, we develop a new Bayesian method of best subset selection using a hybrid search algorithm that combines a deterministic local search and a stochastic global search. To reduce the computational cost of evaluating multiple candidate subsets for each update, we propose a novel strategy that enables us to calculate exact marginal likelihoods of all neighbor models simultaneously in a single computation. In addition, we establish model selection consistency for the proposed method in the high-dimensional setting in which the number of possible predictors can increase faster than the sample size. Simulation study and real data analysis are conducted to investigate the performance of the proposed method.
| Original language | English |
|---|---|
| Pages (from-to) | 1991-2007 |
| Number of pages | 17 |
| Journal | Computational Statistics |
| Volume | 36 |
| Issue number | 3 |
| DOIs | |
| State | Published - Sep 2021 |
Keywords
- Bayesian variable selection
- Best subset selection
- High-dimensional regression analysis
- Hybrid search algorithm
Fingerprint
Dive into the research topics of 'Bayesian selection of best subsets via hybrid search'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver