Educational content on VJHemOnc is intended for healthcare professionals only. By visiting this website and accessing this information you confirm that you are a healthcare professional.

The Multiple Myeloma Channel is supported with funding from Sanofi (Gold) and Legend Biotech (Bronze).

VJHemOnc is an independent medical education platform. Supporters, including channel supporters, have no influence over the production of content. The levels of sponsorship listed are reflective of the amount of funding given to support the channel.

Share this video  

ASH 2025 | Unstructured serum electrophoresis data limits real-world AI models in multiple myeloma

Ehsan Malek, MD, Roswell Park Comprehensive Cancer Center, Buffalo, NY, discusses an evaluation of 384 institutions across the United States that found that unstructured serum electrophoresis data limits the development of real-world artificial intelligence (AI) models for multiple myeloma (MM). Of the 1,725 patients included in the analysis, 950 (55.1%) had at least one coded M-protein value recorded, and only 360 (20.9%) had a coded value recorded within a 3-month window near their myeloma diagnosis. Dr Malek notes that this issue can be addressed by pathology departments converting text-embedded values to discrete lab values to provide data compatible with AI model training. This interview took place at the 67th ASH Annual Meeting and Exposition, held in Orlando, FL.

These works are owned by Magdalen Medical Publishing (MMP) and are protected by copyright laws and treaties around the world. All rights are reserved.

Transcript

In this project, we try to answer a very practical question that is how much our data infrastructure is ready for AI models and machine learning models in the future. As you know, AI models and machine learning are changing the landscape of cancer screening, risk stratification, and therapy decision-making. In myeloma, M-spike or monoclonal protein is the main lab variable that measures disease volume and we assess treatment response based on that...

In this project, we try to answer a very practical question that is how much our data infrastructure is ready for AI models and machine learning models in the future. As you know, AI models and machine learning are changing the landscape of cancer screening, risk stratification, and therapy decision-making. In myeloma, M-spike or monoclonal protein is the main lab variable that measures disease volume and we assess treatment response based on that. Therefore, assessing monoclonal protein or M-spike across the board in all electronic medical records is very important to train very robust models. 

We asked this question that how ready are we to use this M-spike across the U.S. Institutes. We looked at 384 institutes across the U.S. with more than 1,700 patient charts reviewed through the Health Tree Foundation. And we found out that only half of the patients really have capturable M-spikes in all electronic medical records. Worse than that, if you look at the time around diagnosis, only one-fifth of patients have discernible lab values that can be captured for training AI models. So this is a challenge. This will be a challenge that limits the AI model’s robustness. Worse than that, we looked at the socioeconomic factors of these patients, and we found out that if you are from a high socioeconomic status, the probability of having discrete lab values that can be used in AI is much higher. It means that if we at the current moment train AI models, we miss the low socioeconomic patients and therefore AI models will be suboptimal for them. So this is the bad news. However, the good news is it’s a very fixable situation. Easily, for a pathology department, they can move the value from text-embedded to a discrete lab value, and that will solve the problem. I think we have to raise awareness, and that’s the purpose of our project, that we raise awareness to make sure the future AI models are trained for all of the patients.

 

This transcript is AI-generated. While we strive for accuracy, please verify this copy with the video.

Read more...