2023
DOI: 10.1001/jamaophthalmol.2023.3119
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation and Comparison of Ophthalmic Scientific Abstracts and References by Current Artificial Intelligence Chatbots

Abstract: ImportanceLanguage-learning model–based artificial intelligence (AI) chatbots are growing in popularity and have significant implications for both patient education and academia. Drawbacks of using AI chatbots in generating scientific abstracts and reference lists, including inaccurate content coming from hallucinations (ie, AI-generated output that deviates from its training data), have not been fully explored.ObjectiveTo evaluate and compare the quality of ophthalmic scientific abstracts and references gener… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 16 publications
0
7
0
Order By: Relevance
“…Chatbots can generate average quality scientific abstracts (41.7% correct) but remain plagued by fake data and references, when not provided with a data set. 85 GPT-4 scores slightly better than GPT-3.5 with lower fake score and hallucination rates (Table 5). Chatbots can assist people with relatively weak writing or language skills to prepare written assignments both faster and of higher quality.…”
Section: Performance In Other Potential Applicationsmentioning
confidence: 93%
See 1 more Smart Citation
“…Chatbots can generate average quality scientific abstracts (41.7% correct) but remain plagued by fake data and references, when not provided with a data set. 85 GPT-4 scores slightly better than GPT-3.5 with lower fake score and hallucination rates (Table 5). Chatbots can assist people with relatively weak writing or language skills to prepare written assignments both faster and of higher quality.…”
Section: Performance In Other Potential Applicationsmentioning
confidence: 93%
“…But there is a growing concern that AI chatbots are being abused in writing essays, scientific abstracts and even manuscripts. 85 With the number of factual errors these chatbots generate and their apparently comprehensive response, it is important for authors to know their limitations and pitfalls and for publishers/editors to identify AI-generated text in manuscripts. 86 GPT-4 can categorise refractive surgery candidates to their ideal procedures (68%-88% correct) with low to moderate agreement (0.399-0.610) with clinicians.…”
Section: Performance In Other Potential Applicationsmentioning
confidence: 99%
“…Studies have shown that abstracts automatically generated by ChatGPT contain natural language and content that causes scientists to perceive them as having been written by humans rather than artificial intelligence [35 && ]. However, limitations of ChatGPT in generating biomedical abstracts of presenting 'hallucinated' or inaccurately cited information [36]. This suggests that LLMs could support researchers by generating initial outlines or introductory drafts.…”
Section: Scientific Writingmentioning
confidence: 99%
“…The influence of ChatGPT is attributed to its conversational prowess and its performance, which approaches or matches human-level competence in cognitive tasks, spanning various domains including medicine. 16 ChatGPT has achieved commendable results in the United States Medical Licensing Examinations, leading to discussions about the readiness of LLM applications for integration into clinical [17][18][19] , educational [20][21][22] , and research 23 environments.…”
Section: Introductionmentioning
confidence: 99%