Building an LLM Agent for Life Sciences Literature QA and Summarization

PAULRAJ, NISHANTH JOSEPH (2025) Building an LLM Agent for Life Sciences Literature QA and Summarization. World Journal of Advanced Research and Reviews, 26 (2). pp. 657-668. ISSN 2581-9615

[thumbnail of WJARR-2025-1665.pdf] Article PDF
WJARR-2025-1665.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download ( 521kB)

Abstract

This article explores the development of a specialized artificial intelligence agent that combines Large Language Models (LLMs) with Retrieval-Augmented Generation (RAG) techniques to address the challenges of biomedical literature search and synthesis. The unprecedented growth of published research in life sciences has created an information crisis that traditional search methods cannot effectively manage. Researchers face significant challenges including overwhelming volume, domain-specific terminology barriers, difficulty in making cross-study connections, and severe time constraints. The proposed LLM+RAG architecture offers a comprehensive solution featuring specialized document processing for scientific papers, biomedical-specific vector embeddings, advanced retrieval strategies, and sophisticated reasoning capabilities. The system integrates with PubMed and other biomedical databases while providing natural language interfaces that significantly reduce the cognitive burden for researchers. Domain-specific optimizations such as biomedical entity recognition, relationship extraction, and specialized embeddings further enhance performance across diverse research scenarios. Evaluation through benchmark testing, expert validation, and citation accuracy assessment demonstrates the system's ability to provide comprehensive, accurate information while substantially reducing literature review time. This article represents a transformative tool for biomedical researchers, potentially revolutionizing how scientific discovery progresses in the life sciences.

Item Type: Article
Official URL: https://doi.org/10.30574/wjarr.2025.26.2.1665
Uncontrolled Keywords: Biomedical Literature Search; Large Language Models; Retrieval-Augmented Generation; Knowledge Graphs; Scientific Information Extraction
Depositing User: Editor WJARR
Date Deposited: 27 Jul 2025 16:08
Related URLs:
URI: https://eprint.scholarsrepository.com/id/eprint/2613