Multimodal machine learning for catalogue metadata correction in online retail

Jeganathan, Yaswanth (2025) Multimodal machine learning for catalogue metadata correction in online retail. World Journal of Advanced Engineering Technology and Sciences, 16 (1). pp. 475-483. ISSN 2582-8266

[thumbnail of WJAETS-2025-1241.pdf] Article PDF
WJAETS-2025-1241.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download ( 589kB)

Abstract

The quality of catalogue metadata affects the success of the e-commerce platforms at every level in search accuracy, customer satisfaction, and many other. This review examines the implementation of multimodal machine learning (MML) in correcting catalogue metadata, with the emphasis being put on the combination of the input of textual, visual, and structured data. It describes the theoretical underpinnings, model frameworks, fusion policies, and benchmarking procedures that are being actively used in the research. As empirical evidence, it has proven that MML methods performed more soundly than unimodal baselines at accuracy, F1 scores, and tasks involving metadata imputation. Another essential struggle, namely modality misalignment, interpretability, and domain generalization, is also mentioned in the review. The directions of future work are addressed, which include multilingual support, explainable AI, knowledge graph integration, and active learning. The current paper serves as a multifaceted guide to inform researchers and practitioners who are interested in enhancing the accuracy of metadata in a very large-scale retail setting of a digital nature.

Item Type: Article
Official URL: https://doi.org/10.30574/wjaets.2025.16.1.1241
Uncontrolled Keywords: Multimodal Machine Learning; Metadata Correction; E-Commerce; Product Catalogs; Cross-Modal Fusion
Depositing User: Editor Engineering Section
Date Deposited: 22 Aug 2025 08:55
Related URLs:
URI: https://eprint.scholarsrepository.com/id/eprint/5249