Technical Lead - Senior ML Infrastructure Software Engineer

Sorry, this job was removed Sorry, this job was removed at 06:04 p.m. (CST) on Tuesday, Apr 08, 2025
Hiring Remotely in USA
Remote
Software
The Role

Technical Lead - Senior Machine Learning Infrastructure Software Engineer

Location: Hybrid in New York City, or US remote.


About Flip.shop:

Welcome to Flip.shop, where innovation meets the social commerce revolution! Fresh off our Series C funding round, we've raised $144 million, propelling our valuation to an impressive $1.05 billion. We’re redefining the shopping experience by giving consumers a voice in a space dominated by tech giants. Join us on this exhilarating journey where your technical skills will play a pivotal role in shaping the future of social commerce!


Why Join Us?

At Flip.shop, you’ll be at the forefront of innovation in social commerce. This isn’t just a job—it’s a chance to build infrastructure that empowers our AI-driven platform to scale and deliver personalized shopping experiences. You will have the opportunity to directly partner, work with and learn from the very best engineers and scientists who joined us from some of the leading big-tech companies! 

If you thrive in a fast-paced, collaborative environment where you can develop high-performance systems, we want to hear from you!


Role Overview:

We are seeking an experienced ML Infrastructure Lead to design, build, and optimize the infrastructure that powers our machine learning systems. You’ll drive the scalability, reliability, and performance of our recommendation and ads systems. This role involves leading the design, implementation, and optimization of our serving infrastructure to support high-throughput, low-latency workloads.

Furthermore, you'll ensure the efficient deployment, scaling, and monitoring of machine learning models, and will help streamline the development lifecycle. This role offers the opportunity to create scalable, production-level systems that support real-time recommendations and drive business growth.


You will work closely with our engineering and machine learning leaders to ensure our platform can scale efficiently and reliably as we grow.


Key Responsibilities:

  • Infrastructure Development: Design and implement scalable ML infrastructure for deploying, monitoring, and maintaining machine learning models in production environments. Ensure high availability, reliability, and performance of serving and infra systems.
  • Tooling & Automation: Build tools to automate workflows for model training, testing, and deployment, ensuring that machine learning models can move quickly from development to production.
  • Cloud Infrastructure: Leverage cloud platforms to create efficient, scalable systems for large-scale machine learning workloads.
  • Performance Optimization: Ensure the infrastructure supports high-performance model inference at scale, with a focus on minimizing latency and maximizing throughput.
  • Collaboration: Work closely with data scientists, machine learning engineers, and DevOps teams to create seamless integration between development and production environments.
  • Monitoring & Maintenance: Build robust monitoring systems to track model performance and infrastructure health, ensuring reliability and uptime of machine learning services.
  • Security & Compliance: Implement best practices in infrastructure security, data privacy, and compliance, particularly when handling sensitive user data.

Requirements:

  • Education: Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.
  • Experience: 7+ years of experience in infrastructure engineering, DevOps, or similar domains, with a focus on supporting machine learning workflows in production.
  • Technical Skills: Strong proficiency in cloud platforms (AWS, GCP, or Azure), containerization (Docker, Kubernetes), CI/CD pipelines, and infrastructure-as-code tools (Terraform, Ansible). Experience with SageMaker is a bonus. 
  • ML Workflow Knowledge: Experience working with machine learning frameworks (TensorFlow, PyTorch, or similar) and expertise with MLOps practices.
  • Performance & Scalability: Proven track record of optimizing infrastructure for performance, scalability, and reliability in production environments.
  • Collaboration: Strong teamwork skills, with the ability to partner with ML engineers and data scientists to streamline workflows.
  • Communication: Ability to communicate complex infrastructure solutions to technical and non-technical stakeholders.
  • Problem-Solving: Passion for solving infrastructure challenges that support real-time machine learning at scale.

Preferred Qualifications:

  • Experienced with using node.js for backend development
  • Experienced with infrastructure & tools of AWS
  • Experienced with message Queue such as RabbitMQ.

Why You’ll Love Working Here:

At Flip.shop, you’ll have the opportunity to build the backbone of our AI-driven platform, working on cutting-edge infrastructure that powers personalized shopping experiences for millions of users. Your work will directly contribute to scaling our machine learning systems, ensuring they run efficiently in a high-performance production environment. This is your chance to have a lasting impact and help Flip.shop shape the future of social commerce.


Ready to Build the Future?

If you're passionate about building scalable infrastructure and driving innovation in machine learning at scale, join us at Flip.shop! Let’s redefine the future of online shopping together.


Compensation & Benefits:

Base salary and total compensation will vary based on factors including but not limited to location, experience, and performance. Please note the base salary is just one component of the company’s total rewards package for exempt employees. Other rewards may include equity, bonuses, long term incentives, a PTO policy, and other progressive benefits.

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Los Angeles, CA
69 Employees
On-site Workplace
Year Founded: 2019

What We Do

Flip is a vertically integrated social commerce platform delivering a revolutionary shopping experience in the US. Users discover products by flipping through instantly shoppable content posted by the Flip community, and order through a one-click checkout, with free same-day shipping. Our patented technology allows every shopper to become a creator by posting video reviews of their purchases and earning commissions based on engagement and sales through their content.

Similar Jobs

Tempus AI Logo Tempus AI

Medical Science Liaison (MSL) Manager - South US

Artificial Intelligence • Big Data • Healthtech • Machine Learning • Analytics • Biotech
Easy Apply
Remote
Hybrid
Chicago, IL, USA
2482 Employees
150K-220K Annually

Tempus AI Logo Tempus AI

Medical Science Liaison (MSL) Manager - North US

Artificial Intelligence • Big Data • Healthtech • Machine Learning • Analytics • Biotech
Easy Apply
Remote
Hybrid
Chicago, IL, USA
2482 Employees
150K-220K Annually

Dandy Logo Dandy

Senior Software Engineer II, SRE

Computer Vision • Healthtech • Information Technology • Logistics • Machine Learning • Software • Manufacturing
Remote
USA
1200 Employees

Mondelēz International Logo Mondelēz International

Associate Manager, Shopper Insights

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote
Hybrid
New Worke, NJ, USA
90000 Employees
95K-131K Annually

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account