Let's get started. Thanks to everyone for joining us today on such short notice. This session is part of our special NP interest group. Today, it's our great pleasure to have Dr. Aring from the Chinese Academy of Medical Science and Peking Union Medical College. Dr. Long and I have been colleagues at NIH, collaborating on various AI and genomics projects related to medicine.
Dr. Long completed his PhD at the University of Michigan and his postdoc training at NIH. He is a clinician trained in hematology, and he has won several awards, including the NIH Fellows Award for Research Excellence and the NCI Informatics Tool Challenge Award. Today, Dr. Long will discuss our recent work on outpatient reception and large language model (LLM) collaboration frameworks validated in clinical settings. Please join me in welcoming Dr. Long.
Thank you for the introduction. I'm honored to present our collaborative research here. We have been working on an innovative framework involving AI, particularly large language models (LLMs), to enhance outpatient reception.
The Chinese medical system presents several unique challenges. Most high-quality medical services are centralized in government-based hospitals in big cities like Beijing and Shanghai. These hospitals are often heavily populated, leading to long waiting times and overworked staff, including reception nurses who are typically underpaid, undertrained, and have high turnover rates.
Our research aims to address the inefficiencies in outpatient reception, focusing on using LLMs to manage queries and streamline operations. Our real-world medical dialogue dataset includes nearly 40,000 minutes of conversation data from two medical centers, offering valuable insights to train our models.
We initiated a real-world medical dialogue project, capturing conversations at 10 different reception sites. The project provided valuable statistics, such as the number of cases handled per hour and specific inefficiencies like repeated Q&A and negative emotional exchanges. These statistics highlight the complexities faced by reception nurses.
Our goal is to use LLMs to address redundant and straightforward queries, allowing reception nurses to focus on more complex issues. We developed a framework with context-specific training, showing significant improvements in response accuracy, empathy, and efficiency compared to human nurses.
We evaluated our system using a six-dimensional matrix focusing on factuality, integrity, empathy, readability, and safety. The LLM-based responses showed better or comparable performance across these dimensions.
To ensure objectivity, we conducted a randomized controlled trial (RCT). The trial involved two groups: one managed by LLMs with human oversight for flagged responses, and the other managed entirely by human nurses. The LLM group showed higher patient and nurse satisfaction and reduced repeated Q&A and negative emotional exchanges.
Our research confirms the potential of LLMs to handle outpatient reception more efficiently and empathetically. This collaboration framework not only improves patient experience but also reduces the burden on medical staff.
We are excited about the future applications of this technology, including its deployment in more complex medical scenarios.
This research was supported by numerous collaborators and organizations, including the Chinese Academy of Medical Science and Peking Union Medical College. Special thanks to the dedicated clinical staff and technical advisors who made this project possible.
The inefficiencies and high workload in Chinese outpatient reception prompted this investigation, focusing on using AI to assist reception nurses.
Conversations at 10 different reception sites were recorded, providing nearly 40,000 minutes of dialogue data for training and analysis.
The LLM framework significantly improves accuracy, empathy, and efficiency in handling patient queries compared to human nurses.
The framework was evaluated through a six-dimensional matrix and a randomized controlled trial, showing higher patient and nurse satisfaction.
Yes, an internal alert system flags unsafe or uncertain responses, which are then reviewed by human nurses.
Future applications include more complex medical scenarios and broader deployment across different medical centers for improved efficiency and patient care.
In addition to the incredible tools mentioned above, for those looking to elevate their video creation process even further, Topview.ai stands out as a revolutionary online AI video editor.
TopView.ai provides two powerful tools to help you make ads video in one click.
Materials to Video: you can upload your raw footage or pictures, TopView.ai will edit video based on media you uploaded for you.
Link to Video: you can paste an E-Commerce product link, TopView.ai will generate a video for you.