Arch Craniofac Surg Search

CLOSE


Arch Craniofac Surg > Volume 25(4); 2024 > Article
Zhang, Dimock, Gupta, and Chen: Reply: Comment on “The new frontier: utilizing ChatGPT to expand craniofacial research”
Reply:
We would like to respond to the comment by Daungsupawong and Wiwanitkit on our published article “The new frontier: utilizing ChatGPT to expand craniofacial research” [1]. Our study demonstrated ChatGPT’s ability to generate novel systematic review ideas within the field of craniofacial surgery. Our results showed an average 57.5% total accuracy across both general and specific topics, 39% accuracy for general topics, and 76% for specific topics.
The authors in the reply letter discussed two main points: (1) the need for further investigation of ChatGPT’s ability to generate specialized research ideas and (2) the usage of only four well-known medical resources for the cross-referencing and assessment of ChatGPT’s accuracy. We will address each point individually below: we agree with the commenters that ChatGPT’s accuracy and dependability in producing research ideas were called into question by this study. The inability of ChatGPT to produce novel general topics highlights the difficulty of acquiring a specialized knowledge base, such as in craniofacial surgery, and the need to examine whether later iterations of ChatGPT with larger and more up-to-date training databases will perform better.
Regarding the decision to utilize four major medical literature databases to cross-reference ChatGPT’s accuracy for generating novel research ideas; we believe that the current blend of broad and specialized medical literature databases is optimal and provides an accurate reflection of the state of literature currently available. In any type of literature search or review, a balance needs to be achieved between minimizing the manual search burden for the investigators and not missing relevant references, thereby reducing the validity of the research [2]. However, the authors agree that increasing the number of databases searched would contribute to a more comprehensive search. It is possible that by increasing the number of databases searched, ChatGPT’s accuracy might decrease slightly when previously considered novel ideas are found in the newly added medical literature databases.
Overall, we agree with the commentors’ observation that artificial intelligence (AI) in craniofacial research needs to be studied further. With the release of more advanced large language mode in recent months, their abilities to generate general and specialized topics need to be examined.

Notes

Conflict of interest

No potential conflict of interest relevant to this article was reported.

Funding

None.

ACKNOWLEDGEMENTS

AI declaration: the author used a language editing computational tool in preparation of the article.

REFERENCES

1. Zhang A, Dimock E, Gupta R, Chen K. The new frontier: utilizing ChatGPT to expand craniofacial research. Arch Craniofac Surg 2024;25:116-22.
crossref pmid pmc pdf
2. Bramer WM, Rethlefsen ML, Kleijnen J, Franco OH. Optimal database combinations for literature searches in systematic reviews: a prospective exploratory study. Syst Rev 2017;6:245.
crossref pmid pmc pdf


ABOUT
ARTICLE CATEGORY

Browse all articles >

BROWSE ARTICLES
AUTHOR INFORMATION
Editorial Office
Dept. of Plastic and Reconstructive Surgery Chonnam National University Medical School, 42 Jebong-ro, Dong-gu, Gwangju 61469, Korea
Tel: +82-62-220-6354    Fax: +82-62-220-6357    E-mail: office_acfs@kcpca.or.kr                

Copyright © 2024 by Korean Cleft Palate-Craniofacial Association.

Developed in M2PI

Close layer
prev next