Principal Site Reliability Engineer

Posted 20 Days Ago
Be an Early Applicant
Hiring Remotely in São Paulo
In-Office or Remote
Senior level
Software
The Role
Lead infrastructure and reliability strategy for an AI-driven SaaS platform. Design scalable, resilient systems, optimize CI/CD and deployments, lead incident response, automate operational workflows, mentor engineers, and make strategic build-vs-buy decisions to improve stability and performance.
Summary Generated by Built In
Company Description

Are you ready to lead infrastructure strategy for a cutting‑edge AI‑driven SaaS platform? We are looking for a Principal Site Reliability Engineer with a proven track record in scaling, optimizing, and securing cloud‑based systems. This senior role offers the opportunity to shape the reliability and performance of a platform used by finance teams worldwide.

In this role, you will be part of a dynamic engineering environment where your expertise will directly influence product stability and growth. You will work with advanced cloud technologies, automation tools, and AI-driven solutions, contributing to projects that push the boundaries of innovation.

If you are ready to take on strategic responsibility and make a tangible impact, apply now and join us in building the future of reliable, scalable systems.

CUSTOMER
Sigma Software is partnering with a fast‑growing AI‑driven SaaS platform serving finance and accounting teams in high‑growth businesses. The platform automates critical workflows — from billing and collections to revenue recognition and reporting, ensuring compliance and accelerating cash flow. Leveraging advanced AI, it reduces manual work, increases operational efficiency, and supports scalability for customers worldwide.

PROJECT

The project focuses on building and scaling an AI-powered SaaS solution for finance automation. It integrates advanced machine learning models with robust cloud infrastructure to deliver secure, compliant, and high‑performance services. The engineering culture emphasizes automation, resilience, and operational excellence.

Job Description

  • Define and lead infrastructure and reliability strategy across the platform
  • Design scalable, resilient systems in collaboration with engineering teams
  • Optimize build, testing, and deployment processes for speed and stability
  • Establish and uphold best practices for CI/CD, monitoring, and observability
  • Lead incident response and drive continuous improvement post‑incident
  • Automate workflows to reduce operational toil and risk
  • Mentor engineers and foster a culture of operational excellence
  • Make strategic build‑vs‑buy decisions balancing speed, quality, and sustainability

Qualifications

  • At least 8 years of experience in Site Reliability Engineering or DevOps roles, including 2+ years in a Principal or Lead position
  • Proven experience in infrastructure modernization and scaling initiatives for high‑growth environments
  • Strong proficiency in Python
  • Deep expertise in cloud platforms and container orchestration tools such as AWS ECS and EKS
  • Solid experience in CI/CD pipeline design and optimization using tools like GitHub Actions and Buildkite
  • Proficiency in infrastructure‑as‑code tools such as Terraform
  • Strong knowledge of monitoring, observability, and performance optimization practices
  • Upper-Intermediate level of spoken and written English

WOULD BE A PLUS

  • Experience with monorepos (Turborepo, pnpm)
  • Familiarity with modern TypeScript tools (swc, biome, oxc)
  • Knowledge of NestJS, NextJS, and testing frameworks (Jest, Vitest)

Additional Information

PERSONAL PROFILE

  • Excellent leadership, communication, and decision‑making abilities
  • Ability to work independently and make pragmatic build‑vs‑buy decisions in fast‑paced environments

Top Skills

Python,Aws Ecs,Aws Eks,Github Actions,Buildkite,Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
New York, New York
1,516 Employees

What We Do

Sigma Software Group, an award-winning and trusted IT partner, has been serving customers for over 21 years, providing comprehensive IT solutions to various businesses, ranging from startups to established software product houses. As one of Europe's substantial IT consultancies, it brings together a dedicated workforce of over 2,100 professionals in 40 offices across 19 countries. With a diverse client base, including more than 300 enterprises, including Fortune 500 stalwarts, Sigma Software Group is a preferred choice for developing solutions that help businesses create cutting-edge products while meeting their unique needs.

Sigma Software Group operates as a dynamic ecosystem of tech companies, offering 25 ready-to-implement innovative products and 40+ value-added services. Furthermore, Sigma Software Group is committed to fostering innovation through initiatives such as the Sigma Software Labs business incubator, Sigma Software University, the SID Venture Partners VC Fund, UA Tech Network, Techosystem, the European Business Association, and other collaborative efforts.

Since 2015, Sigma Software Group has consistently earned recognition on the IAOP's prestigious World's Top 100 Outsourcing list. The company's accomplishments have also been acknowledged by prominent global media outlets such as Forbes, CNBC, The Times, and Reuters

Similar Jobs

Easy Apply
Remote
Brazil
359 Employees
7K-12K Hourly

Circle (Community) Logo Circle (Community)

Head of Media

Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
Easy Apply
Remote
31 Locations
250 Employees
150K-220K Annually

Zscaler Logo Zscaler

Account Executive

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
Brazil
8697 Employees

Circle (Community) Logo Circle (Community)

Lead Product Designer

Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
Easy Apply
Remote
31 Locations
250 Employees
140K-170K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account