This section collects references to the most significant technological developments in the use of AI in medicine and healthcare.
This section collects references to the most significant technological developments in the use of AI in medicine and healthcare.
Benchmarking study evaluating how susceptible LLMs are to medical misinformation across clinical-note and social-media contexts.
Study of gender bias in automatically generated French clinical cases, showing over-generation of male cases and mismatch with real disorder prevalence.
This paper analyses data poisoning vulnerabilities in healthcare AI systems, showing that a small number of malicious samples can compromise models across architectures, highlighting risks for safety, robustness, and clinical reliability.
This paper explores how artificial intelligence systems designed for medical imaging are capable of indirectly learning demographic information about patients (gender, age, race) and using it as “shortcuts” in diagnoses. This finding compromises the quality of diagnoses for certain groups and indicates unpredictable behavior in fairness metrics when these models are used in hospitals with a patient population different from the one in which the model was trained. Although methods for correcting these biases by removing these variables have been proposed, there is no definitive solution to this problem, as this data remains essential in many clinical contexts and requires a detailed assessment by the medical team based on the specific case.
Dataset derived from ToMi for evaluating whether LLMs can use Theory-of-Mind inferences to choose appropriate actions.
This paper examines how the performance of AI-based medical decision tools is measured, showing that common metrics can be misleading and that clinicians need multiple complementary measures to properly evaluate these systems.
This paper explains the potential biases that may arise throughout the lifecycle of artificial intelligence systems, taking an iterative approach to the processes of data generation, model development, and deployment.
We provide a definition of the typical issues associated with each stage and a specific mitigation strategy.