Google Says Its Med-Gemini AI Healthcare Models Beat GPT-4

https://www.forbes.com/sites/talpatalon/2024/05/01/med-geminis-lions-roar/

Untitled #196, 2018, Digital C-Print by Simen Johan // Med-Gemini’s Lion’s Roar. Each of my stories ... [+] includes an original artwork. Clicking on them will take you to their websites. I am grateful to the artists for sharing their work.

© Simen Johan, Courtesy Yossi Milo, New York

The race for tailored medical AI models is heating up. Google and DeepMind released a paper on Monday describing Med-Gemini, a group of advanced AI models targeting healthcare applications. The models are still in the research phase, but the authors claim that Med-Gemini is outperforming competing models such as OpenAI’s GPT-4. However, the latter is not lagging behind in the medical arena, recently expanding its collaboration with Moderna, a large pharmaceutical company.

Med-Gemini’s striking leap forward, if validated in real-world settings, is its ability to capture context and temporality, such as potentially to understands the background and setting of symptoms as well as the timing and sequence of their onset. This is a known pitfall in existing health-related AI models. It’s true that we physicians are notorious for our abbreviations and lack of uniformity in documentation. Nonetheless, the true challenge in training medical algorithms is not the textual complexity—but rather the contextual one.

A simple example is one any parent of a toddler knows well: having to visit a pediatrician for your youngster’s fever and rash. The doctor will always ask, which came first, the fever or the rash? Did it spread from the head down or the legs up? These simple characteristics can differentiate a mild and self-limiting disease, like Roseola, from a potentially life threatening one, such as meningococcal meningitis.

These seemingly straightforward questions, with their multidimensionality and time-series characteristics, can throw an AI model completely off with the slightest inaccuracy.

This exact contextuality seems to have been tackled by Med-Gemini through breaking away from the massive undertaking of building an all-encompassing general medical model. Instead, Google’s developers have adopted a vertical-by-vertical approach of related models, referred to as a “family” of models, each optimizing a specific medical domain or scenario, such as image analysis in the fields of radiology and pathology, signal interpretation such as deciphering electrocardiogram exams or long-context understanding such as reading lengthy medical records. This, according to researchers, has resulted in improved and nuanced accuracy, and a more transparent reasoning, providing some interpretable feedback, such as why a suggested diagnosis is the most likely one.

As doctors are expected to keep abreast of recent research, Google seems to hold Med-Gemini to the same standard. The new model also incorporates a significant additional layer—a web-based search of up-to-date information, allowing augmentation of data with external knowledge, integrating online results into the model.

Though Med-Gemini has leveraged diverse data sources, such as excerpts from health records, X-rays, photos of skin lesions, medical exam prep question and others, it is still important to remember what has yet to happen: a real-world validation on actual production-level data in an everyday clinic setting, or at least a prospective double blind randomized clinical trial.

Multimodal models have provided AI powered health advancements. Yet, the burden of proof is still to be demonstrated in real-life clinical settings.

{
"by": "biscuit1v9",
"descendants": 84,
"id": 40244681,
"kids": [
40245373,
40245236,
40244971,
40246135,
40245400,
40245258
],
"score": 30,
"time": 1714717076,
"title": "Google Says Its Med-Gemini AI Healthcare Models Beat GPT-4",
"type": "story",
"url": "https://www.forbes.com/sites/talpatalon/2024/05/01/med-geminis-lions-roar/"
}
{
"author": "Dr. Tal Patalon, MD, LLB, MBA",
"date": "2024-05-02T16:14:37.000Z",
"description": "The race for tailored medical AI models is heating up. Google just released a new paper describing Med-Gemini, a family of advanced AI models targeting healthcare.",
"image": "https://imageio.forbes.com/specials-images/imageserve/6632748cb4e7c2b9f9d4ba8c/0x0.jpg?format=jpg&height=900&width=1600&fit=bounds",
"logo": "https://logo.clearbit.com/forbes.com",
"publisher": "Forbes",
"title": "Google Says Its Med-Gemini AI Healthcare Models Beat GPT-4",
"url": "https://www.forbes.com/sites/talpatalon/2024/05/01/med-geminis-lions-roar/"
}
{
"url": "https://www.forbes.com/sites/talpatalon/2024/05/01/med-geminis-lions-roar/",
"title": "Google Says Its Med-Gemini AI Healthcare Models Beat GPT-4",
"description": "Untitled #196, 2018, Digital C-Print by Simen Johan // Med-Gemini’s Lion’s Roar. Each of my stories ... [+] includes an original artwork. Clicking on them will take you to their websites. I am grateful to...",
"links": [
"https://www.forbes.com/sites/talpatalon/2024/05/01/med-geminis-lions-roar/"
],
"image": "https://imageio.forbes.com/specials-images/imageserve/6632748cb4e7c2b9f9d4ba8c/0x0.jpg?format=jpg&height=600&width=1200&fit=bounds",
"content": "<div><figure><figcaption><p>Untitled #196, 2018, Digital C-Print by Simen Johan // Med-Gemini’s Lion’s Roar. Each of my stories <span>... [+]</span><span> includes an original artwork. Clicking on them will take you to their websites. I am grateful to the artists for sharing their work.</span></p><small>© Simen Johan, Courtesy Yossi Milo, New York</small></figcaption></figure>\n<p>The race for tailored medical AI models is heating up. Google and DeepMind released <a href=\"https://arxiv.org/abs/2404.18416\" target=\"_blank\" title=\"https://arxiv.org/abs/2404.18416\">a paper</a> on Monday describing Med-Gemini, a group of advanced AI models targeting healthcare applications. The models are still in the research phase, but the authors claim that Med-Gemini is outperforming competing models such as OpenAI’s GPT-4. However, the latter is not lagging behind in the medical arena, recently expanding its collaboration with <a href=\"https://www.mobihealthnews.com/news/openai-expands-partnership-moderna-customizable-gpts?mkt_tok=NDIwLVlOQS0yOTIAAAGSzhSEc9nXvWh5ayX6ZgeDHlYoRKSHuIYSmGzrID21ZMYI9vKErusWIVy7gSExPuWiiarMcbqTmN7jyGb1VNBpgylu6dbyZGkJ4-NN4WEYo68\\\" target=\"_blank\" title=\"https://www.mobihealthnews.com/news/openai-expands-partnership-moderna-customizable-gpts?mkt_tok=NDIwLVlOQS0yOTIAAAGSzhSEc9nXvWh5ayX6ZgeDHlYoRKSHuIYSmGzrID21ZMYI9vKErusWIVy7gSExPuWiiarMcbqTmN7jyGb1VNBpgylu6dbyZGkJ4-NN4WEYo68\\\">Moderna</a>, a large pharmaceutical company.</p>\n<p>Med-Gemini’s striking leap forward, if validated in real-world settings, is its ability to capture context and temporality, such as potentially to understands the background and setting of symptoms as well as the timing and sequence of their onset. This is a known pitfall in existing health-related AI models. It’s true that we physicians are notorious for our abbreviations and lack of uniformity in documentation. Nonetheless, the true challenge in training medical algorithms is not the textual complexity—but rather the <em>con</em>textual one.</p>\n<p>A simple example is one any parent of a toddler knows well: having to visit a pediatrician for your youngster’s fever and rash. The doctor will always ask, which came first, the fever or the rash? Did it spread from the head down or the legs up? These simple characteristics can differentiate a mild and self-limiting disease, like Roseola, from a potentially life threatening one, such as meningococcal meningitis.</p>\n<p>These seemingly straightforward questions, with their multidimensionality and time-series characteristics, can throw an AI model completely off with the slightest inaccuracy.</p>\n<p>This exact contextuality seems to have been tackled by Med-Gemini through breaking away from the massive undertaking of building an all-encompassing general medical model. Instead, Google’s developers have adopted a vertical-by-vertical approach of related models, referred to as a “family” of models, each optimizing a specific medical domain or scenario, such as image analysis in the fields of radiology and pathology, signal interpretation such as deciphering electrocardiogram exams or long-context understanding such as reading lengthy medical records. This, according to researchers, has resulted in improved and nuanced accuracy, and a more transparent reasoning, providing some interpretable feedback, such as why a suggested diagnosis is the most likely one.</p>\n<p>As doctors are expected to keep abreast of recent research, Google seems to hold Med-Gemini to the same standard. The new model also incorporates a significant additional layer—a web-based search of up-to-date information, allowing augmentation of data with external knowledge, integrating online results into the model.</p>\n<p>Though Med-Gemini has leveraged diverse data sources, such as excerpts from health records, X-rays, photos of skin lesions, medical exam prep question and others, it is still important to remember what has yet to happen: a real-world validation on actual production-level data in an everyday clinic setting, or at least a prospective double blind randomized clinical trial.</p>\n<p><a href=\"https://arxiv.org/abs/2404.18416\" target=\"_blank\" title=\"https://arxiv.org/abs/2404.18416\">Multimodal models </a>have provided AI powered health advancements. Yet, the burden of proof is still to be demonstrated in real-life clinical settings.</p>\n</div>",
"author": "@TalPatalon",
"favicon": "https://i.forbesimg.com/48X48-F.png",
"source": "forbes.com",
"published": "2024-05-09T12:24:33.919-04:00",
"ttr": 105,
"type": "article"
}