{"id":407535,"date":"2026-01-14T00:48:07","date_gmt":"2026-01-14T00:48:07","guid":{"rendered":"https:\/\/www.newsbeep.com\/ca\/407535\/"},"modified":"2026-01-14T00:48:07","modified_gmt":"2026-01-14T00:48:07","slug":"next-generation-medical-image-interpretation-with-medgemma-1-5-and-medical-speech-to-text-with-medasr","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/ca\/407535\/","title":{"rendered":"Next generation medical image interpretation with MedGemma 1.5 and medical speech to text with MedASR"},"content":{"rendered":"<p>                Improved performance for medical imaging use cases<\/p>\n<p data-block-key=\"2ajpj\">MedGemma was designed from the ground up as a multimodal model, reflecting the multimodal nature of medicine. MedGemma 1 included support for interpreting two-dimensional medical images, including chest X-rays, dermatology images, fundus images and histopathology patches.<\/p>\n<p data-block-key=\"7rogg\">With MedGemma 1.5, we are expanding support for high-dimensional medical imaging, starting with three-dimensional volume representations of <a href=\"https:\/\/en.wikipedia.org\/wiki\/CT_scan\" target=\"_blank\" rel=\"noopener noreferrer nofollow\">CT imaging<\/a> and <a href=\"https:\/\/en.wikipedia.org\/wiki\/Magnetic_resonance_imaging\" target=\"_blank\" rel=\"noopener noreferrer nofollow\">MRI<\/a>, as well as whole-slide <a href=\"https:\/\/en.wikipedia.org\/wiki\/Digital_pathology\" target=\"_blank\" rel=\"noopener noreferrer nofollow\">histopathology imaging<\/a>. Developers can create applications in which multiple slices (for CT or MRI) or multiple patches (for histopathology) are provided as input along with a prompt that describes the task.<\/p>\n<p data-block-key=\"b8dq3\">On internal benchmarks, the baseline absolute accuracy of MedGemma 1.5 improved by 3% over MedGemma 1 (61% vs. 58%) on classification of disease-related CT findings and by 14% (65% vs. 51%) on classification of disease-related MRI findings, averaged over findings. Additionally, on an internal diverse benchmark of histopathology slides and associated findings, the fidelity of MedGemma 1.5\u2019s predictions, based on <a href=\"https:\/\/en.wikipedia.org\/wiki\/ROUGE_(metric)\" target=\"_blank\" rel=\"noopener noreferrer nofollow\">ROUGE-L<\/a> score on cases with exactly one histopathology slide, improved by 0.47 over MedGemma 1 (0.49 vs. 0.02), matching the 0.498 score achieved by the task-specific <a href=\"https:\/\/www.modernpathology.org\/article\/S0893-3952(25)00184-X\/fulltext\" target=\"_blank\" rel=\"noopener noreferrer nofollow\">PolyPath model<\/a>.<\/p>\n<p data-block-key=\"a6o0m\">This new high-dimensional support is the natural evolution of <a href=\"https:\/\/research.google\/blog\/taking-medical-imaging-embeddings-3d\/\" rel=\"nofollow noopener\" target=\"_blank\">CT foundation<\/a>, our previous API-based tool for generation of CT embeddings. To our knowledge, MedGemma 1.5 is the first public release of an open multimodal large language model that can interpret high-dimensional medical data while also retaining the ability to interpret general 2D data and text. Although these capabilities are in their early stages and remain imperfect, developers will achieve improved results by fine-tuning MedGemma models on their own data, and we hope to continually improve MedGemma models over time. We\u2019ve released tutorial notebooks that illustrate how to use this high dimensional image capability for CT (<a href=\"https:\/\/github.com\/Google-Health\/medgemma\/blob\/main\/notebooks\/high_dimensional_ct_hugging_face.ipynb\" target=\"_blank\" rel=\"noopener noreferrer nofollow\">Hugging Face<\/a>, <a href=\"https:\/\/github.com\/Google-Health\/medgemma\/blob\/main\/notebooks\/high_dimensional_ct_model_garden.ipynb\" target=\"_blank\" rel=\"noopener noreferrer nofollow\">Model Garden<\/a>) and histopathology (<a href=\"https:\/\/github.com\/Google-Health\/medgemma\/blob\/main\/notebooks\/high_dimensional_pathology_hugging_face.ipynb\" target=\"_blank\" rel=\"noopener noreferrer nofollow\">Hugging Face<\/a>, <a href=\"https:\/\/github.com\/Google-Health\/medgemma\/blob\/main\/notebooks\/high_dimensional_pathology_model_garden.ipynb\" target=\"_blank\" rel=\"noopener noreferrer nofollow\">Model Garden<\/a>).<\/p>\n","protected":false},"excerpt":{"rendered":"Improved performance for medical imaging use cases MedGemma was designed from the ground up as a multimodal model,&hellip;\n","protected":false},"author":2,"featured_media":261632,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[34],"tags":[49,48,84,392],"class_list":{"0":"post-407535","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-healthcare","8":"tag-ca","9":"tag-canada","10":"tag-health","11":"tag-healthcare"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts\/407535","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/comments?post=407535"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts\/407535\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/media\/261632"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/media?parent=407535"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/categories?post=407535"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/tags?post=407535"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}