Current Pathology Foundation Models are unrobust to Medical Center Differences
Abstract
Pathology Foundation Models (FMs) hold great promise for healthcare. Before they can be used in clinical practice, it is essential to ensure they are robust to variations between medical centers. We measure whether pathology FMs focus on biological features like tissue and cancer type, or on the well known confounding medical center signatures introduced by staining procedure and other differences. We introduce the Robustness Index. This novel robustness metric reflects to what degree biological features dominate confounding features. Ten current publicly available pathology FMs are evaluated. We find that all current pathology foundation models evaluated represent the medical center to a strong degree. Significant differences in the robustness index are observed. Only one model so far has a robustness index greater than one, meaning biological features dominate confounding features, but only slightly. A quantitative approach to measure the influence of medical center differences on FM-based prediction performance is described. We analyze the impact of unrobustness on classification performance of downstream models, and find that cancer-type classification errors are not random, but specifically attributable to same-center confounders: images of other classes from the same medical center. We visualize FM embedding spaces, and find these are more strongly organized by medical centers than by biological factors. As a consequence, the medical center of origin is predicted more accurately than the tissue source and cancer type. The robustness index introduced here is provided with the aim of advancing progress towards clinical adoption of robust and reliable pathology FMs.
Community
This paper evaluates the robustness of 10 current pathology foundation models against differences between medical centers (hospitals / clinics), caused by differences in image staining processes and differences between scanners. All 10 models are found to strongly encode the medical center; for 9 out of the 10 models, more strongly than the biological information of tissue type of cancer type. A novel metric, the robustness index, is introduced to measure foundation model robustness, and can advance the development of more robust pathology foundation models.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Atlas: A Novel Pathology Foundation Model by Mayo Clinic, Charit'e, and Aignostics (2025)
- A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks (2025)
- A Knowledge-enhanced Pathology Vision-language Foundation Model for Cancer Diagnosis (2024)
- Molecular-driven Foundation Model for Oncologic Pathology (2025)
- Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST (2025)
- Distilling foundation models for robust and efficient models in digital pathology (2025)
- Is Self-Supervision Enough? Benchmarking Foundation Models Against End-to-End Training for Mitotic Figure Classification (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper