10.1101/2025.07.12.664517
Evaluating large language models in biomedical data science challenges through a classroom experiment
2025-07-17