Researcher - Reinforcement Learning & LLM

TLDR

Design advanced reinforcement learning systems for large-scale optimization and enable LLMs to achieve continual, agentic self-improvement while contributing to scientific research.

Huawei Canada has an immediate 12-month contract opening for a Researcher.

About the team:

Welcome to the Advanced Wireless Technology Wireless Lab, an epitome of innovation located in Ottawa, Canada. Here, amid the dynamic panorama of technological progress, our team consists mainly of seasoned graduate computer engineers and computer scientists. With diverse experiences ranging from fresh perspectives to decades-long industry immersion, we are united by our fervour for pioneering wireless solutions.

About the job: 

  • Design and implement advanced reinforcement learning for large-scale graph-structured optimization and complex scheduling problems

  • Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine-tuning toward continual, agentic self-improvement

  • Collaborate closely with researchers to prototype, implement, and integrate research ideas into production-grade code

  • Stay current with literature and recent advances in the AI domain for high-performance computing

  • Contribute to scientific papers

Requirements

About the ideal candidate:

  • PhD degree in Computer Science or related fields or master's degree with comparable experience

  • Background in machine learning, deep learning and practical or research experience in reinforcement learning, self-supervised learning, or language model fine-tuning

  • Strong theoretical and practical expertise in advanced reinforcement learning (agentic RL, multi-agent RL or Meta-RL)

  • Familiarity with LLMs or generative AI methods and their integrations into structured reasoning or decision systems

  • Experience in application-driven research, demonstrated in projects or publications

  • Excellent communication skills, self-motivated, with creative thinking and attention to details.

Huawei aims to support a French-speaking work environment for its employees in Quebec. We have taken steps to avoid requiring a language other than French for this position. However, proficiency in English is essential for this role for the following reasons:

The person will be required to communicate regularly with colleagues located outside Quebec, where English is the primary language used for communication between offices. In addition, the nature of the tasks related to this position, which falls within a highly specialized field of artificial intelligence, also requires knowledge of English.

Huawei Technologies Canada specializes in developing advanced data analytics platforms and innovative programming technologies. Targeted at enhancing public capacity and driving AI/ML advancements, Huawei Canada focuses on creating next-generation operating systems and optimizing performance across embedded systems.

View all jobs
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Researcher Q&A's
Report this job
Apply for this job