Fostering Multidisciplinary Collaboration in Artificial Intelligence and Machine Learning Education: Tutorial Based on the AI-READI Bootcamp
Background: The integration of artificial intelligence (AI) and machine learning (ML) into biomedical research requires a workforce fluent in both computational methods and clinical applications. Structured, interdisciplinary training opportunities remain limited, creating a gap between data scientists and clinicians. The National Institutes of Health’s Bridge2AI initiative launched the Artificial Intelligence–Ready and Exploratory Atlas for Diabetes Insights (AI-READI) Data Generation Project to address this gap. AI-READI is creating a multimodal, FAIR (Findable, Accessible, Interoperable, and Reusable) dataset—including ophthalmic imaging, physiologic measurements, wearable sensor data, and survey responses—from approximately 4,000 participants with or at risk for type 2 diabetes. In parallel, AI-READI established a yearlong mentored research program that begins with a two-week immersive summer bootcamp to provide foundational AI/ML skills grounded in domain-relevant biomedical data. Objective: To describe the design, iterative refinement, and outcomes of the AI-READI Bootcamp, and to share lessons for creating future multidisciplinary AI/ML training programs in biomedical research. Methods: Held annually at UC San Diego, the bootcamp combines 80 hours of lectures, coding sessions, and small-group mentorship. Year 1 introduced Python programming, classical ML techniques (e.g., logistic regression, convolutional neural networks), and data science methods such as principal component analysis and clustering, using public datasets. In Year 2, the curriculum was refined based on structured participant feedback—reducing cohort size to increase individualized mentorship, integrating the AI-READI dataset (including retinal images and structured clinical variables), and adding modules on large language models and FAIR data principles. Participant characteristics and satisfaction were assessed through standardized pre- and post-bootcamp surveys, and qualitative feedback was analyzed thematically by independent coders. Results: Seventeen participants attended Year 1 and seven attended Year 2, with an instructor-to-student ratio of approximately 1:2 in the latter. Across both years, post-bootcamp evaluations indicated high satisfaction, with Year 2 participants reporting improved experiences due to smaller cohorts, earlier integration of the AI-READI dataset, and greater emphasis on applied learning. In Year 2, mean scores for instructor effectiveness, staff support, and overall enjoyment were perfect (5.00/5.00). Qualitative feedback emphasized the value of working with domain-relevant, multimodal datasets; the benefits of peer collaboration; and the applicability of skills to structured research projects during the subsequent internship year. Conclusions: The AI-READI Bootcamp illustrates how feedback-driven, multidisciplinary training embedded within a longitudinal mentored research program can bridge technical and clinical expertise in biomedical AI. Core elements—diverse trainee cohorts, applied learning with biomedical datasets, and sustained mentorship—offer a replicable model for preparing health professionals for the evolving AI/ML landscape. Future iterations will incorporate additional pre-bootcamp onboarding modules, objective skill assessments, and long-term tracking of research engagement and productivity.