Senior Platform/MLOps Engineer

Posted 3 Hours Ago
Be an Early Applicant
Hiring Remotely in USA
Remote
150K-170K Annually
Senior level
Software
The Role
The Senior Platform/MLOps Engineer will design and implement scalable AI/ML infrastructure, manage deployment pipelines, and optimize GPU workloads in Kubernetes to support manufacturing operations.
Summary Generated by Built In
RETHINK MANUFACTURING  
 
The only way to ignite change is to build the best team. At Bright Machines®, we’re innovators and experts in our craft who have joined together to create a new category of manufacturing that will help transform the industry. We believe software and data are the answer, thoughtfully applied to solve our customers’ unique challenges. Through intelligent automation, we give factories newfound flexibility, scalability, and resilience. We deliver products to meet the demands of today while building a platform to take advantage of what comes next.  
 
Working with us means you’ll have the opportunity to make lasting, impactful changes for our company and our customers. If you’re ready to apply your exceptional skills to create the factory of the future, we’d love to speak with you.

ABOUT THE ROLE
 

Platform Engineers at Bright Machines are responsible for defining and implementing the systems that make Software Defined Manufacturing possible and that power our flexible robotic manufacturing lines. Our robots, and the software that controls them, are deployed in a variety of factory conditions and help support the manufacturing operations for some of the biggest names in the industry. 

As a Senior Platform/MLOps Engineer, you will build scalable systems that are foundational to the Bright Machines technology stack. With a focus on our AI/ML infrastructure, you will design, implement, and maintain our training pipelines, model deployments, and inference app workloads. Our computer vision and deep learning models are used for defect detection, classification, and visual validation, providing end-to-end inspection solutions that deliver consistent, accurate results under real-world factory conditions. You will collaborate with the Smart Robotics team and the Platform Engineering team to design, implement, and deploy our GPU workloads in kubernetes. If you are ready to apply exceptional engineering practices and build the platform that will define the next generation in manufacturing, this is your opportunity to “Be Bright”.

WHAT YOU WILL BE DOING

    • Design, implement, and maintain reliable, scalable, and secure infrastructure, applications, and tooling, with a focus on our ML/AI pipelines and workloads

    • Write clean, maintainable code, and perform peer code-reviews 

    • Write clear and concise documentation and engage in cross-team communication and knowledge sharing

    • Work with other team members to investigate design approaches, prototype new technology and evaluate technical feasibility

    • Pair with adjacent teams to understand how your frameworks and infrastructure are actually used in the field, continuously improving them and leveraging recent advances to improve developer velocity

WHAT YOU WILL BRING

    • At least 5+ years of experience in Platform Engineering, DevOps, or Site Reliability Engineering (SRE).

    • B.S. or M.S. degree (or equivalent) in Computer Science, Engineering, or a related field

    • Proficiency in at least one modern programming languages (Python, Javascript, C#, Go, etc)

    • Demonstrated industry best-practices in MLOps

    • Proficiency with CI/CD tools and GitOps workflows

    • Familiarity with running GPU workloads in kubernetes

    • Strong knowledge of Kubernetes (self-hosted and managed) and modern k8s paradigms (e.g. CNCF)

    • Proficiency with Infrastructure as Code tools (Terraform, etc) and configuration management tools (Ansible, etc)

    • Familiarity with observability stacks (Prometheus, Grafana, OpenTelemetry)

    • Travel 25% 

IT WOULD BE GREAT IF YOU HAD

    • Experience in air-gapped or extremely strict security environments

      • Experience communicating with users, technical leaders and management to collect requirements, describe system designs, and architecting software systems that meets your stakeholders needs

      • Knowledge and demonstrated application of software engineering best practices relating to the SDLC including code reviews, SCM, CI/CD, testing, and operations

      • Demonstrated ability to mentor and grow other team members

BE EMPOWERED TO CHANGE AN INDUSTRY 

Bright Machines is a next-generation, AI-enabled manufacturer focused on data center infrastructure assembly operations. Bright Machines uses its proprietary AI-based robotics and software to assemble AI infrastructure hardware products (i.e., data center servers) for hyperscalers and leading Original Equipment Manufacturers (OEMs). With its new AI factory, Bright Machines addresses increasing market demands for computing power due to the surge of AI and the U.S. national mandate to reshore manufacturing by building data center infrastructure at scale with higher quality and shorter time-to-market.
Bright Machines is headquartered in San Francisco, California, with an integration center in Guadalajara, Mexico. The company has been recognized as one of Forbes’ AI 50, awarded “Best AI-based Solution for Manufacturing” by AI Breakthrough, named a “Technology Pioneer” by the World Economic Forum, and highlighted by several other leading technology and innovation organizations.

Skills Required

  • At least 5+ years of experience in Platform Engineering, DevOps, or Site Reliability Engineering (SRE)
  • B.S. or M.S. degree in Computer Science, Engineering, or a related field
  • Proficiency in at least one modern programming languages
  • Demonstrated industry best-practices in MLOps
  • Proficiency with CI/CD tools and GitOps workflows
  • Familiarity with running GPU workloads in kubernetes
  • Strong knowledge of Kubernetes and modern k8s paradigms
  • Proficiency with Infrastructure as Code tools and configuration management tools
  • Familiarity with observability stacks
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, CA
459 Employees
Year Founded: 2018

What We Do

We deliver intelligent, software-defined manufacturing by bringing together our flexible factory robots with intelligent software, production data and machine learning. Our growing, global team of more than 300 robotics, software and manufacturing industry veterans believe building physical things should be as seamless and simple as creating digital products. Our software-defined manufacturing platform helps customers innovate faster to meet the growing demands of a new era of manufacturing.

Similar Jobs

Zscaler Logo Zscaler

Director Product Marketing, Zscaler Private Access

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
Location, WV, USA
8697 Employees
193K-275K Annually

GameChanger Logo GameChanger

Staff Software Engineer

Computer Vision • Digital Media • Kids + Family • Mobile • Software • Sports
Remote
United States
260 Employees
200K-230K Annually

GRAIL Logo GRAIL

Consultant

Artificial Intelligence • Big Data • Healthtech • Machine Learning • Software • Biotech
Remote or Hybrid
Menlo Park, CA, USA
918 Employees
118K-156K Annually

GRAIL Logo GRAIL

Consultant

Artificial Intelligence • Big Data • Healthtech • Machine Learning • Software • Biotech
Remote or Hybrid
Menlo Park, CA, USA
918 Employees
94K-125K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Software
US
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account