TY - JOUR
T1 - Is There a Universal Dimensionality Reduction Technique for Feature Extraction?–A Comparative Analysis
AU - Balasubramaniam, Anandkumar
AU - Balasubramaniam, Thirunavukarasu
AU - Paul, Anand
AU - Han, Dong Seog
AU - Nayak, Richi
N1 - Publisher Copyright:
© 2025 IETE.
PY - 2025
Y1 - 2025
N2 - The demand for high-dimensional data processing in machine learning has led to the increasing use of dimensionality reduction techniques. These techniques aim to extract the most important information from high-dimensional data, reducing it to a lower-dimensional representation that can be easily processed by machine learning algorithms. However, with the availability of a multitude of dimensionality reduction techniques and heterogeneous datasets, it can be challenging for researchers to select the most appropriate one for their specific application. This research conducts a comparative analysis to identify the distinctive behaviors of various dimensionality reduction techniques under different data situations. The state-of-the-art linear and non-linear dimensionality reduction techniques are analyzed. The study also analyses the performance of each technique in terms of its ability to extract meaningful, interpretable, and low-dimensional features from high-dimensional data. The analysis results provide insights into each technique's strengths and weaknesses and highlight the most appropriate technique when dealing with heterogeneous datasets for different machine-learning tasks. We use multiple tabular, text, and image datasets to validate our findings.
AB - The demand for high-dimensional data processing in machine learning has led to the increasing use of dimensionality reduction techniques. These techniques aim to extract the most important information from high-dimensional data, reducing it to a lower-dimensional representation that can be easily processed by machine learning algorithms. However, with the availability of a multitude of dimensionality reduction techniques and heterogeneous datasets, it can be challenging for researchers to select the most appropriate one for their specific application. This research conducts a comparative analysis to identify the distinctive behaviors of various dimensionality reduction techniques under different data situations. The state-of-the-art linear and non-linear dimensionality reduction techniques are analyzed. The study also analyses the performance of each technique in terms of its ability to extract meaningful, interpretable, and low-dimensional features from high-dimensional data. The analysis results provide insights into each technique's strengths and weaknesses and highlight the most appropriate technique when dealing with heterogeneous datasets for different machine-learning tasks. We use multiple tabular, text, and image datasets to validate our findings.
KW - Classification
KW - Dimensionality reduction
KW - Feature extraction
KW - Machine learning
KW - Pattern distinctiveness
KW - Reconstruction
UR - https://www.scopus.com/pages/publications/105019661087
U2 - 10.1080/02564602.2025.2573465
DO - 10.1080/02564602.2025.2573465
M3 - Review article
AN - SCOPUS:105019661087
SN - 0256-4602
JO - IETE Technical Review (Institution of Electronics and Telecommunication Engineers, India)
JF - IETE Technical Review (Institution of Electronics and Telecommunication Engineers, India)
ER -