Niyonkuru, Venant and Sekou, Sylla and Sinzinkayo, Jimmy Jackson (2025) Features extraction for image identification using computer vision. World Journal of Advanced Research and Reviews, 27 (1). pp. 1341-1351. ISSN 2581-9615
Abstract
This study examines various feature extraction techniques in computer vision, the primary focus of which is on Vision Transformers (ViTs) and other approaches such as Generative Adversarial Networks (GANs), deep feature models, traditional approaches (SIFT, SURF, ORB), and non-contrastive and contrastive feature models. Emphasizing ViTs, the report summarizes their architecture, including patch embedding, positional encoding, and multi-head self-attention mechanisms with which they overperform conventional convolutional neural networks (CNNs). Experimental results determine the merits and limitations of both methods and their utilitarian applications in advancing computer vision.
Item Type: | Article |
---|---|
Official URL: | https://doi.org/10.30574/wjarr.2025.27.1.2647 |
Uncontrolled Keywords: | Feature Extraction; Positional Embeddings; Self-Attention; Vision Transformers (ViTs) |
Date Deposited: | 01 Sep 2025 13:45 |
Related URLs: | |
URI: | https://eprint.scholarsrepository.com/id/eprint/5057 |