Short Courses

Short Course 1. Information Extraction and Document Preparation in Clinical Trial Development Using RAG-based LLMs

March 23, 2026 (Full day, 9AM-4:30PM)

Instructors: Zhaohua Lu, Daiichi Sankyo; Arlina Shen, Biomedical Data Science Masters Student at Stanford University School of Medicine

Abstract:

Target Audience

Prerequisite Knowledge

Short Bios:

Zhaohua Lu Zhaohua Lu is an Associate Director of Biostatistics at Daiichi Sankyo and a Ph.D.-trained statistician with over ten years of experience in statistical modeling, data science, and clinical trial design. Before joining industry, he served for five years as a Faculty Member in the Department of Biostatistics at St. Jude Children’s Research Hospital, where he worked extensively with large-scale neuroimaging, genomics, and natural language datasets. He has authored more than 40 peer-reviewed clinical papers and 30 methodological publications, advancing innovative statistical and AI/ML methods and translating them into practical biomedical and clinical applications.

Arlina Shen Arlina Shen is a second-year Master’s student in Biomedical Data Science at Stanford University, where she specializes in statistical methodology for clinical research. Her work centers on designing and analyzing clinical trials with innovative approaches, including adaptive and rank-based methods for composite endpoints in progressive diseases such as ALS. Arlina is particularly interested in improving trial efficiency and fairness by integrating real-world evidence with randomized controlled trials, bridging the gap between traditional study designs and practical clinical applications. Her research aims to advance drug evaluation strategies and contribute to more equitable and effective therapeutic development.


Short Course 2. Trust by Design: Applied AI Governance for Pharma with Hands-On Application Exercises

March 23, 2026 (Morning, 9AM-12PM)

Instructors: Rebecca D. Jones-Taha, WaterworksAI

Abstract:

Short Bios:

Dr. Rebecca Taha Rebecca Taha, PhD, MBA, CEO & Founder of waterworksAI, is a strategic leader in the life sciences industry. With a deep understanding of scientific research and business operations, Dr. Taha guides waterworksAI in delivering innovative, technology-based solutions that address critical challenges in drug development. With over 20 years of experience in the industry, she has a proven track record of delivering impactful solutions.

Dr. Taha and her team develop and evaluate innovative GenAI applications in the pharmaceutical industry, focusing on ensuring that AI-powered tools deliver reliable and safe outcomes while improving the efficiency, effectiveness, and cost of drug development. Prior to founding waterworksAI, Dr. Taha served small, medium, and large pharma and biotech organizations in the strategic development and implementation of their clinical development programs. Dr. Taha received her MS and PhD from the University of Kentucky in Statistics and Gerontology, respectively, and her MBA from the Kelley School of Business.


Short course 3. Personally Identifiable Information Redaction Agent: LLM as a Judge, REGEX

March 23, 2026 (Afternoon, 1:30PM-4:30PM)

Instructors: Andrew Semmes, Moderna

Abstract:

Short Bios:

Andrew Semmes Andrew Semmes is the Associate Director of Pharmacovigilance Artificial Intelligence and PV Transformation at Moderna, where he leads AI adoption and digital transformation initiatives within Clinical Safety & Pharmacovigilance (CSPV). His work focuses on leveraging AI to enhance pharmacovigilance processes, automate workflows, and improve operational efficiency while ensuring GxP compliance. He spearheads enterprise-wide initiatives to enable Moderna’s AI infrastructure to scale effectively in highly regulated environments. Prior to joining Moderna, Andrew was a strategy and analytics consultant at Deloitte, where he helped pharmaceutical and biotech companies integrate AI into pharmacovigilance, automate adverse event triage, and optimize regulatory workflows. He led efforts to develop safety and compliance systems, streamline business processes within digital transformations, and drive data strategy initiatives that generated global cost savings. Andrew holds a Master’s in Information Science with a focus on Data Science and a Bachelor’s in Information Science with a concentration in User Experience, both from Cornell University.


Short course 4. CGM-AI: Causal Generalist Medical AI

March 23, 2026 (Full day, 9AM-4:30PM)

Instructors: Hongtu Zhu, UNC; Qiao Liu, Yale

Abstract:

Short Bios:

Qiao Liu Dr. Qiao Liu is an Assistant Professor in the Department of Biostatistics at Yale University. His is also the core faculty member of Yale Computational Biology and Biomedical Informatics Program. His research lies at the intersection of statistics, artificial intelligence, and computational biology, where he develops practical statistical and AI-driven tools with both theoretical and applied significance. His work leverages advances in generative AI to address high-dimensional statistical challenges, including Bayesian computation and causal inference, with broad applications in single-cell genomics, multi-omics data integration, pharmacogenomics, and genomic large language models. Dr. Liu has authored over 40 publications in leading international journals and conferences. His contributions have been recognized with prestigious honors, including the NIH Pathway to Independence Award.


Short course 5. Python for Clinical Development with AI Applications

March 23, 2026 (Morning, 9AM-12PM)

Instructors: Yilong Zhang, Meta; Pingye (Eric) Zhang, Eikon Therapeutics; Yuting Xu, Merck

Abstract:

Open source programming languages are rapidly transforming drug discovery, research, and development by offering powerful capabilities for study design, data analysis, visualization, and clinical reporting. The emergence of AI tools is also creating new opportunities. This workshop introduces practical strategies for using Python to prepare tables, listings, and figures (TLFs) in clinical study reports (CSRs), with a focus on how AI can accelerate the development lifecycle.

This workshop is designed for clinical programmers, statisticians, and data scientists interested in exploring Python as an alternative approach for clinical trial analysis and reporting. Participants will gain hands-on experience with reproducible workflows and AI in the loop. By the end of the session, attendees will have a clear roadmap for starting a Python project with AI for clinical trial analysis and reporting.

The workshop is based on the open source book Python for Clinical Study Reports and Submission and is organized into three modules:

Short Bios:

Yilong Zhang Yilong Zhang, PhD, is the Manager of the Health Quantitative Science team at Meta. He previously worked as a statistician at Merck & Co., Inc. on late-stage clinical trial development. His interests include developing R and Python packages to improve clinical trial analysis, reporting, and regulatory submission. Yilong has published over 25 peer-reviewed articles and holds a PhD in Biostatistics from New York University.

Eric Zhang Eric Zhang is the Director of Biostatistics at Eikon Therapeutics, where he supports oncology programs spanning early to late-stage clinical development. He earned his Ph.D. in Biostatistics from the University of Southern California and began his career at Merck, where he served as the lead statistician for late-stage oncology trials and played a pivotal role in the regulatory filing and approval of pembrolizumab (KEYTRUDA) for RCC and HN cancer. Following his time at Merck, Eric worked at BeiGene and Gilead Sciences. In addition to his project work, Eric is also an active participant in scientific discussions across industry and regulatory bodies, and has published extensively in both statistical and medical journals.