Skip to main content
Top

05-03-2025 | Artificial Intelligence | Original Article

A cross-sectional study to evaluate responses generated by two AI software programs for common patient queries about laparoscopic repair of inguinal hernia

Authors: Meeran Banday, Kirat Kaur

Published in: Updates in Surgery

Login to get access

Abstract

This study aimed to evaluate the quality and accuracy of responses provided by two user-interactive AI chatbots, namely ChatGPT and ChatSonic, in response to patient queries regarding laparoscopic repair of inguinal hernias, and additionally determine the suitability of these chatbots in addressing patient queries related to inguinal hernia repair. Ten questions regarding laparoscopic repair of inguinal hernias were developed and presented to ChatGPT 4.0 and ChatSonic. Responses were evaluated by two experienced surgeons blinded to the source, using the Global Quality Score (GQS) and modified DISCERN Score to gauge response quality and reliability. ChatGPT demonstrated high-quality responses (GQS = 4 & 5) for all ten questions according to one evaluator, and for seven out of ten questions according to the other. Similarly, ChatGPT showed high reliability (DISCERN = 4 & 5) for nine responses according to one evaluator, and for three responses according to the other, with only slight agreement between evaluators for both GQS (kappa = 0.20) and modified DISCERN scores (kappa = 0.08). ChatSonic also provided high-quality and reliable responses for a majority of questions, albeit to a lesser extent than ChatGPT, and both demonstrating limited concordance in responses (p > 0.05). Overall, Both ChatGPT and ChatSonic demonstrated potential utility in providing responses to patient queries about hernia surgery. However, due to inconsistencies in reliability and quality, ongoing refinement and validation of AI generated medical information remain necessary before widespread clinical adoption.
Appendix
Available only for authorised users
Literature
2.
go back to reference Hassan AH, Sadek AH, Ibrahim IM, Zaitoun MA (2023) Brief overview about ventral hernias. Tob Regul Sci 1783–1797 Hassan AH, Sadek AH, Ibrahim IM, Zaitoun MA (2023) Brief overview about ventral hernias. Tob Regul Sci 1783–1797
3.
go back to reference Kalaba S, Gerhard E, Winder JS, Pauli EM, Haluck RS, Yang J (2016) Design strategies and applications of biomaterials and devices for hernia repair. Bioact Mater 1(1):2–17PubMedPubMedCentral Kalaba S, Gerhard E, Winder JS, Pauli EM, Haluck RS, Yang J (2016) Design strategies and applications of biomaterials and devices for hernia repair. Bioact Mater 1(1):2–17PubMedPubMedCentral
5.
go back to reference Mediboina A, Badam RK, Chodavarapu S (2024) Assessing the accuracy of information on medication abortion: a comparative analysis of ChatGPT and google bard AI. Cureus 16(1) Mediboina A, Badam RK, Chodavarapu S (2024) Assessing the accuracy of information on medication abortion: a comparative analysis of ChatGPT and google bard AI. Cureus 16(1)
6.
go back to reference Chaka C (2023) Detecting AI content in responses generated by ChatGPT, YouChat, and chatsonic: the case of five AI content detection tools. J Appl Learn Teach 6(2) Chaka C (2023) Detecting AI content in responses generated by ChatGPT, YouChat, and chatsonic: the case of five AI content detection tools. J Appl Learn Teach 6(2)
7.
go back to reference Langille M, Bernard A, Rodgers C, Hughes S, Leddin D, van Zanten SV (2010) Systematic review of the quality of patient information on the internet regarding inflammatory bowel disease treatments. Clin Gastroenterol Hepatol 8(4):322–328CrossRefPubMed Langille M, Bernard A, Rodgers C, Hughes S, Leddin D, van Zanten SV (2010) Systematic review of the quality of patient information on the internet regarding inflammatory bowel disease treatments. Clin Gastroenterol Hepatol 8(4):322–328CrossRefPubMed
8.
13.
go back to reference Danquah G, Mittal V, Solh M, Kolachalam RB (2007) Effect of internet use on patient’s surgical outcomes. Int Surg 92(6):339–343PubMed Danquah G, Mittal V, Solh M, Kolachalam RB (2007) Effect of internet use on patient’s surgical outcomes. Int Surg 92(6):339–343PubMed
16.
go back to reference Hilding P (2019) Making Chatbots more conversational : using follow-up questions for maximizing the informational value in evaluation responses. Dissertation, Uppsala University Hilding P (2019) Making Chatbots more conversational : using follow-up questions for maximizing the informational value in evaluation responses. Dissertation, Uppsala University
Metadata
Title
A cross-sectional study to evaluate responses generated by two AI software programs for common patient queries about laparoscopic repair of inguinal hernia
Authors
Meeran Banday
Kirat Kaur
Publication date
05-03-2025
Publisher
Springer International Publishing
Published in
Updates in Surgery
Print ISSN: 2038-131X
Electronic ISSN: 2038-3312
DOI
https://doi.org/10.1007/s13304-025-02158-5
SPONSORED

Mastering chronic pancreatitis pain: A multidisciplinary approach and practical solutions

Severe pain is the most common symptom of chronic pancreatitis. In this webinar, experts share the latest insights in pain management for chronic pancreatitis patients. Experts from a range of disciplines discuss pertinent cases and provide practical suggestions for use within clinical practice.

Sponsored by:
  • Viatris
Developed by: Springer Healthcare
Watch now
Video