Leveraging large language models for automated performance appraisals: Opportunities and challenges

Kuchibhotla, Sri (2025) Leveraging large language models for automated performance appraisals: Opportunities and challenges. International Journal of Science and Research Archive, 14 (3). pp. 1268-1273. ISSN 2582-8185

[thumbnail of IJSRA-2025-0803.pdf] Article PDF
IJSRA-2025-0803.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download ( 500kB)

Abstract

One major issue with traditional performance appraisals is inefficiency, bias and subjectivity. Oftentimes large language models (LLMs) like GPT-4 offer a promising approach to standardize performance evaluations which leverage structured and unstructured feedback for data-driven assessments. In this study, a data set with structured and unstructured data is taken and fed into GPT-4 to analyze self-evaluations and mid-year performance reviews to automate the appraisal process and compare it to human evaluations. Although GPT-4 is generally accurate and is similar to human assessment, the main challenge lies in the non-quantifiable factors such as workplace dynamics and lack of emotional intelligence. Although AI models have a much more accurate prediction rate than manual performance appraisals, there is always a need for a human-in-the-loop (HITL) approach to help AI perform better. This study focuses on how human-in-the-loop (HITL) can help AI-based performance appraisals by bringing in non-quantifiable factors such as workplace dynamics and conflict resolution within the employee data.

Item Type: Article
Official URL: https://doi.org/10.30574/ijsra.2025.14.3.0803
Uncontrolled Keywords: Artificial Intelligence; Large Language Models in HR; Performance Appraisals; Human Resources; AI Bias; Human-in-the-Loop
Depositing User: Editor IJSRA
Date Deposited: 17 Jul 2025 17:02
Related URLs:
URI: https://eprint.scholarsrepository.com/id/eprint/1213