Abstract
The identification of scenes poses a notable challenge within the realm of image processing. Unlike object recognition, which typically involves relatively consistent forms, scene images exhibit a broader spectrum of variability. This research introduces an approach that combines image and text data to improve scene recognition performance. A model for tagging images is employed to extract textual descriptions of objects within scene images, providing insights into the components present. Subsequently, a pre-trained encoder converts the text into a feature set that complements the visual information derived from the scene images. These features offer a comprehensive understanding of the scene’s content, and a dynamic integration network is designed to manage and prioritize information from both text and image data. The proposed framework can effectively identify crucial elements by adjusting its focus on either text or image features depending on the scene’s context. Consequently, the framework enhances scene recognition accuracy and provides a more holistic understanding of scene composition. By leveraging image tagging, this study improves the image model’s ability to analyze and interpret intricate scene elements. Furthermore, incorporating dynamic integration increases the accuracy and functionality of the scene recognition system.
| Original language | English |
|---|---|
| Article number | 3102 |
| Journal | Mathematics |
| Volume | 13 |
| Issue number | 19 |
| DOIs | |
| State | Published - Oct 2025 |
Keywords
- context-aware
- dynamic integration
- scene recognition
Fingerprint
Dive into the research topics of 'Context-Aware Dynamic Integration for Scene Recognition'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver