Site Reliability Engineer

Posted Yesterday
Be an Early Applicant
Hiring Remotely in USA
Remote
114K-148K Annually
Senior level
Software • Financial Services
The Role
Ensure platform reliability, performance, and availability by implementing observability, automating infrastructure, participating in on-call rotations and post-mortems, partnering with Product and Engineering, designing scalable architectures, mentoring teammates, and integrating Dynatrace with Azure DevOps and Jira while supporting compliance (SOC/FedRAMP).
Summary Generated by Built In

Site Reliability Engineer 


Location: 
Remote, United States 

Employment Type: Full-Time 

Benefits Offered: Vision, Medical, Life, Dental, 401K

Gross Annual Base Salary: USD 114,000 - 148,000 
Additional variable compensation and benefits may apply. Total compensation is based on experience, skills, and location using objective, job-related criteria. 


Summary

As a Site Reliability Engineer, you will focus on ensuring the platform and services customers rely on are reliable, performant, and highly available. If you enjoy staying at the forefront of technology and automating infrastructure deployments, then this is the job for you. This vital role within Cloud Services requires knowledge and experience designing, implementing, and monitoring scalable and secure cloud services. The employee is expected to work well in a small team and willing to share responsibilities with other team members as needed. You will interact with internal staff, managers, and customers to implement and maintain operations. A passion for technology and learning, and the ability to grow others are vital for success in this role.


Primary Duties and Responsibilities 

  • Implement application/infrastructure observability solutions to ensure desired application availability, reliability, and performance.
  • Participate in regular On-Call rotations and share details related to incidents and their resolution through post-mortem reports and regular review meetings.
  • Proactively partner with Product and Engineering teams to identify, develop, deploy, and maintain reliable systems and services.
  • Influence and create new designs, architectures, standards, and methods for large-scale systems.
  • Sustain a high level of reliability for key services and automated systems.
  • Automate processes to improve reliability, performance, and availability.
  • Update technical documentation, workflows, and knowledge base articles.
  • Provide feedback in pull requests and peer coding reviews.
  • Implement codified automated solutions that build integrations between Dynatrace, Azure DevOps and Jira.
  • Solid knowledge in focused areas of OneStream Software.
  • Ability to mentor others in several technical areas.
  • Understanding practical use of SOC/FedRAMP controls to assist Compliance and Security teams.

Required Education and Experience

  • BS/BA in computer science, engineering, or technology-related field (or equivalent work experience).
  • Proven work experience as a Site Reliability Engineer or in a similar role.
  • 6+ years of cloud infrastructure and software development experience.
  • 2+ years hands on experience of Azure Kubernetes Services (AKS) with container-based deployment skills or other platforms such as OpenShift, GKS, EKS.
  • Advanced understanding of APM and observability tools such as Dynatrace, AppInsights, DataDog, Log Analytics, New Relic, Prometheus and Grafana.
  • Advanced understanding of Infrastructure-as-Code (IaC) concepts and tooling (Terraform, CloudFormation templates, Bicep or ARM templates) on Microsoft Azure, Amazon Web Services (AWS), or Google Cloud Platform (GCP).
  • Deep knowledge of Configuration Management/Orchestration utilities such as Ansible, PowerShell DSC, Chef, and Puppet.
  • Advanced understanding of cloud concepts including elasticity, security, and identity management.
  • Well versed familiarity with Agile Development methodologies utilizing Jira or Azure DevOps Boards.
  • 6+ years of hands-on experience with the following technologies, tools, and concepts:
    • Automating processes using PowerShell, Bash, CLI, REST APIs, python, ARM Templates or other scripting languages.
    • Comfortable leveraging source control tools such as Git, Azure DevOps, or GitHub.
    • Knowledge of container orchestration platforms such as Kubernetes, OpenShift, AKS, GKS or helm.
    • Microsoft Azure, Amazon Web Services (AWS) or Google Cloud (GCP).

Preferred Education and Experience

  • Experience working for a cloud service provider (CSP), managed service provider (MSP), or SaaS provider.
  • 6+ years of relevant Azure experience deploying and managing leveraging Infrastructure-as-Code (IAC) concepts.
  • Experience with Microsoft and .NET (.NET, C#, SQL).
  • Experience writing efficient and reliable code in a development environment.
  • Debian, Ubuntu, Alpine or other distributions of the Linux operating systems.
  • Deep knowledge and understanding of containerized applications, with special attention to reliability and monitoring of those containerized applications.

Knowledge, Skills, and Abilities 

  • Deal well with ambiguous/undefined problems.
  • Ability to self-motivate and work independently.
  • Strong organizational and prioritization skills.
  • Ability to find and apply effective solutions to emerging problems and challenges.
  • Strong attention to detail.
  • Comfortable communicating with all levels of management and engineering.
  • Ability to get up to speed quickly with modern technologies and services.
  • Ability to multitask on a variety of projects.

Travel 

  • Travel Requirement: Travel is not expected to exceed 5%.


Who We Are 

OneStream is how today’s Finance teams can go beyond just reporting on the past and Take Finance Further™ by steering the business to the future. It’s the only enterprise finance platform that unifies financial and operational data, embeds AI for better decisions and productivity, and empowers the CFO to become a critical driver of business strategy and execution. Our vision is to be the operating system for modern finance, digitizing core financial functions and empowering the CFO to become a critical driver of business strategy. To learn more visit www.onestream.com. 


Why Join The OneStream Team 

  • Transparency around corporate structure, salary, and benefits. 
  • Core value of customer success. 
  • Variety of project work (not industry-specific).  
  • Strong culture and camaraderie. 
  • Multiple training opportunities. 


Benefits at OneStream

OneStream employees are passionate, hardworking individuals who go above and beyond to keep our customers happy and follow through on our mission statement. They consistently deliver the best and in turn, we make every effort to keep them cared for and happy. A sample of the benefits we provide are: 

  • Excellent Medical Plan. 
  • Dental & Vision Insurance. 
  • Life Insurance. 
  • Short & Long Term Disability. 
  • Vacation Time. 
  • Paid Holidays. 
  • Professional Development. 
  • Retirement Plan. 

All candidates must be legally authorized to work for any company in the country where this position is located without sponsorship. 

OneStream is an Equal Opportunity Employer. 


#LI-CS1

#LI-Remote

Equal Opportunity Employer/Protected Veterans/Individuals with Disabilities
This employer is required to notify all applicants of their rights pursuant to federal employment laws. For further information, please review the Know Your Rights notice from the Department of Labor.

Skills Required

  • BS/BA in computer science, engineering, or technology-related field (or equivalent experience)
  • Proven work experience as a Site Reliability Engineer or similar role
  • 6+ years of cloud infrastructure and software development experience
  • 2+ years hands-on experience with Azure Kubernetes Services (AKS) or platforms like OpenShift, GKS, EKS
  • Advanced understanding of APM and observability tools (Dynatrace, AppInsights, DataDog, Log Analytics, New Relic, Prometheus, Grafana)
  • Advanced understanding of Infrastructure-as-Code tooling (Terraform, CloudFormation, Bicep, ARM Templates) on Azure/AWS/GCP
  • Deep knowledge of configuration management/orchestration (Ansible, PowerShell DSC, Chef, Puppet)
  • 6+ years hands-on experience automating with PowerShell, Bash, CLI, REST APIs, Python, ARM Templates or other scripting languages
  • Experience with source control tools (Git, Azure DevOps, GitHub)
  • Knowledge of container orchestration platforms and helm
  • Familiarity with cloud concepts including elasticity, security, and identity management on Azure/AWS/GCP
  • Well-versed with Agile methodologies using Jira or Azure DevOps Boards
  • Solid knowledge in focused areas of OneStream Software
  • Ability to participate in On-Call rotations and produce incident post-mortems
  • All candidates must be legally authorized to work in the country where the position is located without sponsorship
  • Experience working for a CSP, MSP, or SaaS provider
  • 6+ years of relevant Azure experience deploying and managing leveraging IaC concepts
  • Experience with Microsoft and .NET (C#, SQL)
  • Experience with Debian, Ubuntu, Alpine or other Linux distributions
  • Deep knowledge and understanding of containerized applications reliability and monitoring

OneStream Software Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about OneStream Software and has not been reviewed or approved by OneStream Software.

  • Strong & Reliable Incentives Incentive structures in sales and customer-facing roles are robust, with OTEs commonly positioned at or above market and commission plans praised as strong. Variable components and incentive awards are prominent, enabling higher earnings when attainment and territory align.
  • Healthcare Strength Core coverage includes comprehensive medical, dental, and vision insurance, supplemented by an EAP and wellness perks. Employer-facing materials and third‑party summaries consistently position health coverage as a solid element of the package.
  • Leave & Time Off Breadth Time-off programs span vacation, separate sick time, volunteer time, and paid holidays, with a one‑month paid sabbatical after five years. This breadth extends beyond standard PTO alone.

OneStream Software Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Rochester, MI
0 Employees

What We Do

OneStream Software provides a market-leading intelligent finance platform that reduces the complexity of financial operations. OneStream™ unleashes the power of finance by unifying corporate performance management (CPM) processes such as planning, financial close & consolidation, reporting, and analytics through a single, extensible solution. We empower the enterprise with financial and operational insights to support faster and more informed decision-making. All in a cloud platform designed to continually evolve and scale with your organization. OneStream is an independent software company backed by private equity investors KKR, D1 Capital Partners, Tiger Global, and IGSB. Our primary mission is to deliver 100% customer success, which we’ve done successfully since our inception. To learn more visit www.onestreamsoftware.com.

Similar Jobs

Dropbox Logo Dropbox

Site Reliability Engineer

Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Remote
United States
2500 Employees
223K-302K Annually

NBCUniversal Logo NBCUniversal

Site Reliability Engineer

AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Remote or Hybrid
Orlando, FL, USA
68000 Employees

ServiceNow Logo ServiceNow

Site Reliability Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Santa Clara, CA, USA
29000 Employees
166K-290K Annually

Sprinter Health Logo Sprinter Health

Site Reliability Engineer

Artificial Intelligence • Healthtech • Logistics • Social Impact • Software • Telehealth
Remote or Hybrid
2 Locations
500 Employees
160K-255K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account