Machine Learning Systems Research Intern, PhD, Summer 2026

Reposted Yesterday
Be an Early Applicant
Boston, MA, USA
In-Office
Internship
Cloud • Information Technology • Internet of Things • Software • Consulting • Infrastructure as a Service (IaaS) • Automation
Creating better technology the open source way
The Role
Interns will research and implement LLM optimizations, conduct experiments on model efficiency, and collaborate with engineers on machine learning workflows.
Summary Generated by Built In
Job Summary

At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. We are seeking a highly motivated summer intern to join our Machine Learning Research Team. As an intern, you will work on cutting-edge AI inference and model optimization techniques, and contribute to research and engineering efforts that make LLMs faster and more efficient. This is an exciting opportunity to gain hands-on experience in applied machine learning research while working with leading experts in the field.

Responsibilities
  • Research and implement techniques for LLM inference and LLM optimizations.

  • Conduct experiments to evaluate the impact of optimization methods on model accuracy, latency, and throughput.

  • Collaborate with researchers and engineers to integrate optimizations into real-world machine learning workflows.

  • Document findings and contribute to technical reports, blog posts, or research publications.

Requirements
  • Currently pursuing a Ph.D. degree in Computer Science, Electrical Engineering, Machine Learning, or a related field.

  • Strong programming skills in C++, CUDA, and Python.

  • Experience with tensor math libraries such as PyTorch.

  • Familiarity with AI model optimization techniques such as quantization (e.g., INT4, FP8), pruning, and knowledge distillation.

  • Deep understanding and experience in GPU performance optimizations.

  • Excellent knowledge of large language model architectures

  • Strong analytical and problem-solving skills.

  • Excellent communication skills and ability to work in a team-oriented research environment.

  • Background in efficient inference techniques for large-scale language models or computer vision models.

  • Prior experience contributing to open-source ML frameworks or research publications.

  • ​1 or more co-authored papers at a top tier conference like NeurIPS, ICLR, ACL, CVPR, MLSys is a big plus.

Why work with us
  • Hands-on experience with state-of-the-art AI inference optimization research.

  • Mentorship from leading experts in machine learning and model efficiency.

  • Opportunity to contribute to research papers, patents, or open-source projects.

  • Competitive stipend and potential for full-time opportunities.

About Red Hat

Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. Spread across 40+ countries, our associates work flexibly across work environments, from in-office, to office-flex, to fully remote, depending on the requirements of their role. Red Hatters are encouraged to bring their best ideas, no matter their title or tenure. We're a leader in open source because of our open and inclusive environment. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact.

Inclusion at Red Hat
Red Hat’s culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone. When this is realized, it empowers people from different backgrounds, perspectives, and experiences to come together to share ideas, challenge the status quo, and drive innovation. Our aspiration is that everyone experiences this culture with equal opportunity and access, and that all voices are not only heard but also celebrated. We hope you will join our celebration, and we welcome and encourage applicants from all the beautiful dimensions that compose our global village.

Equal Opportunity Policy (EEO)
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.


Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.

Red Hat supports individuals with disabilities and provides reasonable accommodations to job applicants. If you need assistance completing our online job application, email [email protected]. General inquiries, such as those regarding the status of a job application, will not receive a reply.

Skills Required

  • Pursuing a Ph.D. degree in a relevant field
  • Strong programming skills in C++, CUDA, and Python
  • Experience with tensor math libraries such as PyTorch
  • Familiarity with AI model optimization techniques such as quantization, pruning, and knowledge distillation
  • Deep understanding in GPU performance optimizations
  • Knowledge of large language model architectures
  • Strong analytical and problem-solving skills
  • Excellent communication skills and teamwork
  • Background in efficient inference techniques for LLM or computer vision models
  • Prior experience contributing to open-source ML frameworks or research publications
  • 1 or more co-authored papers at a top tier conference like NeurIPS, ICLR, ACL, CVPR, MLSys

Red Hat Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Red Hat and has not been reviewed or approved by Red Hat.

  • Healthcare Strength Healthcare coverage is presented as comprehensive, spanning medical, dental, and vision along with life and disability coverage. Access to HSA/FSA options and broadly positive reception of health benefits support the view that healthcare is a core strength.
  • Leave & Time Off Breadth Time-off offerings are described as generous, with substantial PTO for new hires plus additional recharge days and an end-of-year shutdown for many non-critical roles. Paid volunteer time, holidays, sick days, and supportive expectations around taking time off reinforce the breadth of leave benefits.
  • Strong & Reliable Incentives The rewards package includes performance bonuses and a recurring quarterly bonus program tied to company and individual performance. Availability of ESPP participation further adds to incentive pathways beyond base pay.

Red Hat Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Raleigh, NC
20,000 Employees
Year Founded: 1993

What We Do

At Red Hat, we connect an innovative community of customers, partners, and contributors to deliver an open source stack of trusted, high-performing solutions. We offer cloud, Linux, middleware, storage, and virtualization technologies, together with award-winning global customer support, consulting, and implementation services. Red Hat is a rapidly growing company supporting more than 90% of Fortune 500 companies.

Why Work With Us

Red Hatters freely exchange different viewpoints, contribute ideas, and solve problems together. Our love of collaboration, accountability, a sense of community, and a measure of autonomy combine to create a powerful force that fosters innovation and makes Red Hat a great place to work.

Gallery

Gallery

Similar Jobs

Hex Logo Hex

Software Engineer

Artificial Intelligence • Big Data • Software • Analytics • Business Intelligence • Big Data Analytics
Remote or Hybrid
3 Locations
160 Employees
176K-220K Annually

Mission Cloud Logo Mission Cloud

Account Executive

Artificial Intelligence • Cloud • Information Technology • Machine Learning • Consulting • Generative AI • Big Data Analytics
In-Office or Remote
10 Locations
300 Employees
110K-137K Annually

Mission Cloud Logo Mission Cloud

Senior Account Executive

Artificial Intelligence • Cloud • Information Technology • Machine Learning • Consulting • Generative AI • Big Data Analytics
In-Office or Remote
10 Locations
300 Employees
110K-137K Annually

PwC Logo PwC

US Tech-Delivery Senior Manager

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Hybrid
41 Locations
370000 Employees
91K-322K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York City, NY
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account