Research Engineer - Performance Optimization

Reposted 3 Days Ago
Be an Early Applicant
Palo Alto, CA
In-Office
180K-250K Annually
Mid level
HR Tech • Information Technology
The Role
The Research Engineer will optimize implementation of models and systems for data processing and training, focusing on performance tuning across large-scale GPU infrastructures.
Summary Generated by Built In
Job Title: Research Engineer - Performance Optimization
Position Type: Full time
Location: Palo Alto, CA, USA
Salary Range: $180,000 - $250, 000 (USD)
Job ID#: 156204
Job Description:

We are looking for engineers with significant problem solving experience in PyTorch, CUDA and distributed systems. You will work with Research Scientists to build & train cutting edge foundation models on thousands of GPUs.

Multimodal Generative models such as Diffusion Models and GANs.

Good to have experience building inference / demo prototype code (incl. Gradio, Docker etc.)

Responsibilities
  • Ensure efficient implementation of models & systems for data processing, training, inference and deployment

  • Identify and implement optimization techniques for massively parallel and distributed systems

  • Identify and remedy efficiency bottlenecks (memory, speed, utilization) by profiling and implementing high-performance CUDA, Triton, C++ and PyTorch code

  • Work closely together with the research team to ensure systems are planned to be as efficient as possible from start to finish

  • Build tools to visualize, evaluate and filter datasets

  • Implement cutting-edge product prototypes based on multimodal generative AI

Requirements:
  • Experience training large models using Python & Pytorch, including practical experience working with the entire development pipeline from data processing, preparation & data loading to training and inference.

  • Experience optimizing and deploying inference workloads for throughput and latency across the stack (inputs, model inference, outputs, parallel processing etc.)

  • Experience with profiling CPU & GPU code in PyTorch, including Nvidia Nsight or similar.

  • Experience writing & improving highly parallel & distributed PyTorch code, with familiarity in DDP, FSDP, Tensor Parallel, etc.

  • Experience writing high-performance parallel C++. Bonus if done within an ML context with PyTorch, like for data loading, data processing, inference code.

  • Experience with high-performance Triton / CUDA and writing custom PyTorch kernels. Top candidates will be able to utilize tensor cores; optimize performance with CUDA memory and other similar skills.

  • Good to have experience working with Deep learning concepts such as Transformers & Multimodal Generative models such as Diffusion Models and GANs.

  • Good to have experience building inference / demo prototype code (incl. Gradio, Docker etc.)

About Us:
Founded in 2009, IntelliPro is a global leader in talent acquisition and HR solutions. Our commitment to delivering unparalleled service to clients, fostering employee growth, and building enduring partnerships sets us apart. We continue leading global talent solutions with a dynamic presence in over 160 countries, including the USA, China, Canada, Singapore, Japan, Philippines, UK, India, Netherlands, and the EU.
IntelliPro, a global leader connecting individuals with rewarding employment opportunities, is dedicated to understanding your career aspirations. As an Equal Opportunity Employer, IntelliPro values diversity and does not discriminate based on race, color, religion, sex, sexual orientation, gender identity, national origin, age, genetic information, disability, or any other legally protected group status. Moreover, our Inclusivity Commitment emphasizes embracing candidates of all abilities and ensures that our hiring and interview processes accommodate the needs of all applicants. Learn more about our commitment to diversity and inclusivity at https://intelliprogroup.com/.
Compensation: The pay offered to a successful candidate will be determined by various factors, including education, work experience, location, job responsibilities, certifications, and more. Additionally, IntelliPro provides a comprehensive benefits package, all subject to eligibility.

Top Skills

C++
Cuda
Distributed Systems
Docker
Gradio
PyTorch
Triton
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
638 Employees
Year Founded: 2009

What We Do

IntelliPro Group Inc. is one of the fastest growing IT services and HR solutions companies in Americas & APAC. We provide comprehensive IT services to help clients with IT Strategic Planning, Implementation, Deployment, IT Support on Artificial Intelligence, Big Data, Cloud Computing, Mobile Application Development, Data Mining and Business Intelligence, Enterprise Data Warehouse, and more.

Besides our established IT services, our new business now is quickly extending to one-stop HR Solution Services, including Oversea Branch Setup Consulting, Compensation & Benefits Policy Consulting, Payroll Management Service, Talent Recruiting, and Employer Branding to satisfy our clients’ fast business expansion requirement.

We have built our business on our company-wide commitment to continually overdeliver on the high expectations of our clients, employees, and business partners. The secret to our success is that our unified team works harder, faster, smarter, and more collaboratively than anyone else in the talent acquisition business. In addition to the immense talent and proprietary technology, IntelliPro Group is proud to offer continual professional development and extraordinary benefits to both consultants and full-time employees.

Similar Jobs

In-Office
Palo Alto, CA, USA
200K-280K Annually
Hybrid
2 Locations
213000 Employees
37-66 Hourly
Hybrid
Los Angeles, CA, USA
213000 Employees
90K-150K Annually
Hybrid
3 Locations
213000 Employees
37-66 Hourly

Similar Companies Hiring

Compa Thumbnail
Software • Other • HR Tech • Business Intelligence • Artificial Intelligence
Irvine, CA
60 Employees
Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account