Sunday, 24 November 2024
26.2 C
Singapore

Hospitals adopt AI transcription tool, but accuracy concerns grow

Hospitals use OpenAI's Whisper for medical transcription, but accuracy concerns rise as AI "hallucinations" emerge, raising patient care risks.

Hospitals nationwide increasingly use an AI transcription tool powered by OpenAI’s Whisper model to record and summarise patient meetings. While this tool shows promising results in easing doctors’ documentation, researchers have raised concerns about its accuracy. Evidence suggests the tool sometimes “hallucinates” – a term for AI systems producing information that sounds plausible but is incorrect. In these cases, Whisper has been shown to generate completely fabricated phrases, which may be particularly troubling in settings.

Widespread use of Whisper in healthcare

According to ABC News, the transcription tool is developed by Nabla. This healthcare tech company estimates its software has processed approximately 7 million medical conversations across more than 30,000 clinicians and 40 health systems. While many doctors and healthcare providers report that the transcription tool improves efficiency, Nabla acknowledges the model’s potential for inaccuracies and states it is working to address the hallucination issue.

Whisper’s hallucinatory responses can produce errors that range from inserting random, unrelated statements to inventing medical conditions that do not exist. Nabla has confirmed its awareness of these limitations and reassures its clients that it is improving the model to ensure greater accuracy in clinical settings.

Study reveals concerning hallucinations in transcriptions

A recent study by researchers from Cornell University, the University of Washington, and other institutions explored Whisper’s performance under various conditions, including during moments of silence or while working with people affected by language disorders, such as aphasia. The researchers found that the model occasionally inserted sentences or words without input, creating phrases that had no basis in the conversation. Examples of these hallucinations include fabricated conditions and irrelevant comments, such as “Thank you for watching!” – a phrase likely drawn from Whisper’s exposure to millions of hours of videos during its training.

The study highlighted that Whisper hallucinated in about 1% of the transcriptions, a seemingly small percentage but one that can have serious implications in healthcare. While researchers primarily used samples from TalkBank’s AphasiaBank, they argue that the tool’s tendency to generate content during silent pauses could affect various clinical situations, especially communication difficulties.

OpenAI’s response and ongoing research

OpenAI knows these issues and has responded to researchers’ findings with promising ongoing improvements. OpenAI spokesperson Taya Christianson emphasised that the company is actively refining Whisper to reduce hallucinations. OpenAI has also set strict usage guidelines for its API, advising against using Whisper in high-stakes decision-making contexts without additional checks. OpenAI’s model card for Whisper advises developers against applying it in sensitive areas where accuracy is critical.

Despite Whisper’s potential as a transcription tool, its limitations may leave healthcare providers hesitant to rely on it entirely for medical documentation. For now, hospitals and clinicians may need to review transcriptions thoroughly, especially in sensitive situations where accuracy is paramount.

Hot this week

Perplexity launches shopping tool to challenge Google and Amazon

Discover Perplexity’s AI-powered shopping tool offering unbiased product suggestions, visual search, and one-click checkout for Pro subscribers.

LG wins multiple CES 2025 innovation awards

LG wins over 20 CES 2025 awards, including three Best of Innovation Awards, highlighting its smart life solutions, OLED TVs, and gaming monitors.

Hitachi Vantara unveils AI infrastructure solutions with NVIDIA HGX platform

Hitachi Vantara introduces AI-ready infrastructure solutions with NVIDIA HGX, offering scalable and efficient systems for modern AI demands.

Apple may have upgraded M4 MacBook Pro with quantum dot display technology

Apple may have added quantum dot technology to the M4 MacBook Pro display, enhancing its colour accuracy and performance while staying eco-friendly.

Fantasian Neo Dimension launches on consoles this December

Bandai Namco and Square Enix announced Fantasian Neo Dimension for consoles, which will launch on December 5. Pre-orders are open now.

Nvidia’s bold 1997 rivalry with Intel revealed in new book

Nvidia CEO Jensen Huang’s bold 1997 statement reveals the company’s early rivalry with Intel, as detailed in a new book, The Nvidia Way.

Steam sets stricter rules and better support for season pass content

Steam introduces stricter rules for season passes, requiring precise content details and refunds for undelivered DLC, improving fairness for players.

Anti-deepfake declaration faces scrutiny over possible AI involvement

Minnesota's anti-deepfake law faces controversy as an affidavit supporting it shows signs of AI-generated text with non-existent citations.

Google reportedly cancels Pixel Tablet 2 and exits tablet market again

Google cancels the Pixel Tablet 2, signalling another exit from the tablet market. Poor sales and competition from Apple may be to blame.

Related Articles

Popular Categories