Mentorship

AIF-GEN Project | 2024 – Present
Lead Supervisor
Supervised a student team in the creation of AIF-GEN, a novel platform for generating synthetic preference data to study the continual alignment of LLMs via reinforcement learning. This work culminated in a publication at the ICML 2025 CodeML workshop (with my mentee as first author and myself as senior author), and a full journal article is in preparation for JMLR.
MSc Thesis | 2024 – Present
Primary supervisor (Co. with Prof. Doina Precup)
Co-supervisor of an MSc student’s thesis investigating architectural choices (e.g., Mixture of Experts, Attention) and neural plasticity mechanisms for my continual RL framework.
Neuro-Inspired RL Projects | 2024 – Present
Research Mentor
Providing mentorship and research direction to a junior PhD students on projects exploring brain-inspired RL. This collaboration resulted in two co-authored manuscripts on modeling the striatum’s functions and learning variable timescales for adaptation.
Inter-Lab Collaboration | 2025 – Present
Reinforcement Learning Advisor
Serving as the primary RL advisor for a PhD student in another lab on a project to discover novel failure modes in Vision-Language Models (VLMs) through adversarial question generation.