AIF-GEN Project| 2024 – Present Lead Supervisor
Supervised a student team in the creation of AIF-GEN, a novel platform for generating synthetic preference data to study the continual alignment of LLMs via reinforcement learning. This work culminated in a publication at the ICML 2025 CodeML workshop (with my mentee as first author and myself as senior author), and a full journal article is in preparation for JMLR.
MSc Thesis| 2024 – Present Primary supervisor (Co. with Prof. Doina Precup)
Co-supervisor of an MSc student’s thesis investigating architectural choices (e.g., Mixture of Experts, Attention) and neural plasticity mechanisms for my continual RL framework.
Neuro-Inspired RL Projects| 2024 – Present Research Mentor
Providing mentorship and research direction to a junior PhD students on projects exploring brain-inspired RL. This collaboration resulted in two co-authored manuscripts on modeling the striatum’s functions and learning variable timescales for adaptation.
Inter-Lab Collaboration| 2025 – Present Reinforcement Learning Advisor
Serving as the primary RL advisor for a PhD student in another lab on a project to discover novel failure modes in Vision-Language Models (VLMs) through adversarial question generation.