Lovedeep Gondara

Lovedeep Gondara

Head of AI R&D | Vanguard

Adjunct Professor | University of British Columbia

About

I am a machine learning researcher, currently serving as Head of AI R&D at Vanguard. I also hold an Adjunct Professor position at University of British Columbia. My current work focuses on language models (large and small) and their applications in various domains. My broader research interests include general machine learning, differential privacy, and multimodal machine learning.

Research Interests

Language Models

Large and small language models with applications across various domains, particularly in resource-constrained environments. Agentic methods and model grounding.

Privacy-Preserving ML

Differential privacy and privacy-preserving machine learning techniques for sensitive data applications.

Applied Machine Learning

Applying machine learning to solve complex real-world problems using approaches for text, vision, tabular, and multimodal data. Focus on translating research into practical solutions.

Contact Information

Vanguard
Enterprise AI and Research (EAiR)
Toronto, ON, Canada

University of British Columbia
School of Population and Public Health
E-mail: lovedeep.gondara@ubc.ca

Research Areas

Machine Learning, Large Language Models, Differential Privacy, Deep Learning, Decentralized Learning, Statistics, Application of ML and LLMs to various domains.

Education

Ph.D., Computer Science
Simon Fraser University, Burnaby, BC, Canada
2022
M.S., Computer Science
University of Illinois, Springfield, Illinois, USA
2015
Post-Baccalaureate Certificate (Applied Statistics/Data Analytics)
University of the Fraser Valley, Abbotsford, BC, Canada
2013
B.Tech, Computer Science
Punjab Technical University, Punjab, India
2011

Work Experience

Head of AI R&D
Vanguard, Enterprise AI and Research (EAiR), Toronto, ON, Canada
August, 2025 - Present
Research Scientist — Team Lead
Provincial Health Services Authority, Vancouver, BC Canada
June, 2023 - August, 2025
• Led the ML/AI endeavors at the BC Cancer Registry
• Designed and developed NLP models for data abstraction, classification, and summarization
• Led and guided the complete lifecycle of LLM development, including data collection and curation, pretraining, supervised finetuning, and alignment
• Developed and deployed RAG based and agentic methods to streamline operations
• Considered ethical implications in large language models, such as bias mitigation, fairness, privacy, and security concerns
• Collaborated with internal and external entities for research and to integrate NLP technologies into real-world applications and solutions
• Disseminated findings via research papers, technical reports, and documentation at conferences, workshops, and meetings
• Supervised visiting researchers and students
Adjunct Professor
University of British Columbia, Vancouver, BC Canada
January, 2024 - Present
• Adjunct professor at the School of Population and Public Health
• Guest lectures in areas of expertise
• Chair MSc and PhD dissertation exams as needed
• Supervise graduate students
(Bio)Statistician/Data Scientist
British Columbia Cancer Agency, Vancouver, BC Canada
June, 2013 - June, 2023
• Design and analysis of clinical trials and retrospective studies using various models from statistics and machine learning
• Design and implementation of interactive dashboards to serve models for visualizing trends, performance monitoring, and predictions in real time
• Develop and implement state-of-the-art machine learning and statistical models for end-to-end operational and research pipelines
• Provide technical expertise regarding data architecture and data standards for the development and maintenance of databases to facilitate reporting and research
• Coordinate internal and external data requests for planning and research
• Participate in recruitment, selection, and training of junior data scientists, interns, and summer students
Team Lead, Statistics
British Columbia Cancer Agency, Vancouver, BC Canada
Dec, 2018 - Mar, 2021
• Lead a team of statisticians and data scientists within Cancer Surveillance and Outcomes, Population Oncology
• Provide leadership to team members via technical expertise and the facilitation and demonstration of the principles of team work and collaboration
• Manage all incoming projects and ensure deliverables by assigning the projects to and coordinating with different team members

Recent Selected Publications

2025:
1. Gondara, L., Arbour, G., Ng, R., Simkin, J., & Devji, S. (2025). Bridging AI Innovation and Healthcare Needs: Lessons Learned from Incorporating Modern NLP at The BC Cancer Registry. arXiv preprint arXiv:2508.09991.

2. Gondara, L., Simkin, J., Sayle, G., Devji, S., Arbour, G., & Ng, R. Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare. arXiv:2504.21191.

3. Beheshti, M., Gondara, L., & Zachary, I. Leveraging Language Models for Automated Patient Record Linkage. arXiv:2504.15261.

4. Gondara, L., Simkin, J., Devji, S.Arbour, G., & Ng, R. ELM: Ensemble of Language Models for Predicting Tumor Group from Pathology Reports. arXiv:2503.21800.

2024:
5. Gondara, L., Simkin, J., & Devji, S. Clinical Trial Design Approach to Auditing Language Models in Health Care Setting. JCO Clinical Cancer Informatics, 9, e2400331.

6. Gondara, L., Simkin, J., Arbour, G., Devji, S., & Ng, R. Classifying Tumor Reportability Status From Unstructured Electronic Pathology Reports Using Language Models in a Population-Based Cancer Registry Setting. JCO Clinical Cancer Informatics, 8, e2400110.

2023:
7. Gondara, L., & Wang, K. PubSub-ML: A Model Streaming Alternative to Federated Learning. Proceedings on Privacy Enhancing Technologies, 2, 464–479.

Click here for complete list