Kuchibhotla, Sri (2025) Leveraging large language models for automated performance appraisals: Opportunities and challenges. International Journal of Science and Research Archive, 14 (3). pp. 1268-1273. ISSN 2582-8185
![IJSRA-2025-0803.pdf [thumbnail of IJSRA-2025-0803.pdf]](https://eprint.scholarsrepository.com/style/images/fileicons/text.png)
IJSRA-2025-0803.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial Share Alike.
Abstract
One major issue with traditional performance appraisals is inefficiency, bias and subjectivity. Oftentimes large language models (LLMs) like GPT-4 offer a promising approach to standardize performance evaluations which leverage structured and unstructured feedback for data-driven assessments. In this study, a data set with structured and unstructured data is taken and fed into GPT-4 to analyze self-evaluations and mid-year performance reviews to automate the appraisal process and compare it to human evaluations. Although GPT-4 is generally accurate and is similar to human assessment, the main challenge lies in the non-quantifiable factors such as workplace dynamics and lack of emotional intelligence. Although AI models have a much more accurate prediction rate than manual performance appraisals, there is always a need for a human-in-the-loop (HITL) approach to help AI perform better. This study focuses on how human-in-the-loop (HITL) can help AI-based performance appraisals by bringing in non-quantifiable factors such as workplace dynamics and conflict resolution within the employee data.
Item Type: | Article |
---|---|
Official URL: | https://doi.org/10.30574/ijsra.2025.14.3.0803 |
Uncontrolled Keywords: | Artificial Intelligence; Large Language Models in HR; Performance Appraisals; Human Resources; AI Bias; Human-in-the-Loop |
Depositing User: | Editor IJSRA |
Date Deposited: | 17 Jul 2025 17:02 |
Related URLs: | |
URI: | https://eprint.scholarsrepository.com/id/eprint/1213 |