Huawei Canada has an immediate Co-op opening for an Engineer.
About the team:
The Intelligent Cloud Infrastructure Lab aims to innovate technologies, algorithms, systems, and platforms for next-generation cloud infrastructure. The lab addresses scalability, performance, and resource utilization challenges in existing cloud services while preparing for future challenges with appropriate technologies and architectures. Additionally, the lab aims to understand industry dynamics and technology trends to create a robust ecosystem.
About the job:
Understand AI System and Infrastructure technology landscape, and identify scalability/performance issues or challenges of current LLM/multi-modal LLM systems
Initiate and charter innovation projects to build or re-architect AI infrastructure platform, and plan milestones accordingly
Provide/contribute a scalable and high-performance architecture design or re-design for the infrastructure system that is optimized for AI training and inferencing, which includes but not limited to cluster management and scheduling, LLM model deployment, elastic LLM as well as AI container cold/warm start-up optimization, and so on.
Collaborate with internal and external teams to deliver the project or project features that improve our overall system scalability and performance.
The target annual compensation (based on 2080 hours per year) ranges from $58,000 to $104,000 depending on education, experience and demonstrated expertise.
About the ideal candidate:
Bachelors, Master/PhD degree in Computer Science, Computer Engineering
Experience in building large scale and high-performance distributed system
Experience in Nvidia TensorRT and/or Triton servers. Experience in container virtualization technologies
Knowledge & experience in distributed system design & development, including serverless technologies
Work experience in one or more of the following technologies: vLLM, Ray, SGLang, Kubernetes, TensorRT-LLM, Pytorch framework, Cuda libraries, GPU technologies
Work experience in one or more of the following programming languages: C/C++, Go, Java, Rust, python, C#
Have excellent interpersonal and communication skills to collaborate with multiple teams and build strong partnerships effectively
Demonstrated success working on software engineering problems that span multiple products
Skills Required
- Bachelors, Master/PhD degree in Computer Science or Computer Engineering
- Experience in building large scale and high-performance distributed systems
- Experience in Nvidia TensorRT and/or Triton servers
- Knowledge & experience in distributed system design & development
- Work experience in container virtualization technologies
- Work experience in one or more of the specified technologies
- Work experience in one or more of the specified programming languages
- Excellent interpersonal and communication skills
What We Do
Founded in 1987, Huawei is a leading global provider of information and communications technology (ICT) infrastructure and smart devices. We are committed to bringing digital to every person, home and organization for a fully connected, intelligent world. We have approximately 197,000 employees and we operate in over 170 countries and regions, serving more than three billion people around the world. In Canada, Huawei conducts innovative and leading edge research in 5G technologies, along with advanced development of emerging cloud, device and network technologies & services. While our renowned Canada Research Centre in the thriving technology landscape of Ottawa, Ontario continues to grow rapidly in size and strategic product initiatives, additional presence has also been established across Canada with R&D facilities in Vancouver, Edmonton, Waterloo, Markham, Montreal, and a R&D office in Quebec City.

.png)






