Software Site Reliability Engineer

Posted 3 Days Ago
Golden, CO, USA
In-Office
103K-136K Annually
Senior level
Manufacturing
The Role
The Site Reliability Engineer will ensure the reliability, security, and support of Databricks applications while collaborating with various teams to optimize data workflows and incident management.
Summary Generated by Built In

It's exciting to work for a company that makes the world measurably better.

We're committed to bringing safety, quality, and customer focus to the business of advanced ceramics manufacturing.

Job Title

Software Site Reliability EngineerAs the Site Reliability Engineer, you will support CoorsTek's Databricks application and data product strategy by ensuring solutions built, migrated, and deployed on Databricks are reliable, secure, observable, supportable, and cost-effective in production. This role is not solely focused on monitoring and operational support. In this role, you will actively develop automation, platform tooling, deployment pipelines, observability capabilities, and reliability solutions that reduce operational toil and improve the scalability of Databricks-hosted applications and data products.
This role sits within Data & Analytics and partners closely with Architecture, Cybersecurity, Infrastructure, Manufacturing IT/OT, Enterprise Applications, citizen developers, and business teams. As the Databricks Site Reliability Engineer, you will support production reliability for Databricks-hosted applications (pattern B), analytics products, workflows, and AI-enabled solutions.
In this role, you will help CoorsTek move quickly without creating unmanaged technical debt by contributing to and improving support patterns, monitoring standards, deployment practices, runbooks, incident response, and operational guardrails for Databricks solutions created by both business-enabled citizen development and IT delivery teams.

Roles and Responsibilities

  • Support production reliability, operational readiness, and lifecycle support for Databricks-hosted applications, data products, dashboards, notebooks, jobs, workflows, APIs, and AI-enabled solutions.

  • Support applications migrated to Databricks, built directly in Databricks, or promoted from citizen development and IT development into governed production patterns.

  • Execute intake, review, handoff, support, and release practices for Pattern B Databricks applications, including minimum requirements before production deployment.

  • Partner with citizen developers, IT developers, data engineers, enterprise architects, and business stakeholders to convert prototypes into reliable, monitored, documented, and supportable services.

  • Implement and maintain observability standards, including logging, alerting, health checks, SLIs/SLOs, lineage, usage monitoring, cost monitoring, and operational dashboards.

  • Respond to incidents, coordinate troubleshooting, participate in root cause analysis and support corrective actions for failed jobs, broken pipelines, access issues, performance issues, data refresh failures, and application outages.

  • Maintain and update runbooks, support procedures, escalation paths, ownership models, service catalogs, and knowledge articles for Databricks applications and data products.

  • Partner with Data & Analytics on Databricks workflows, Delta Lake, Unity Catalog, data lineage, permissions, SQL warehouses, jobs, clusters, serverless capabilities, and performance tuning.

  • Partner with Cybersecurity and Architecture to ensure Databricks solutions meet standards for identity, access, secrets management, logging, data classification, responsible AI, and least-privilege access.

  • Support CI/CD, testing, environment promotion, release controls, rollback procedures, and change management for Databricks applications and related Azure or integration components.

  • Identify recurring failure patterns and assist with automating manual support work, reducing operational toil, and creating reusable templates and standards.

  • Advise teams on production-ready design, including resiliency, scalability, maintainability, cost control, data quality checks, monitoring hooks, and clear ownership.

  • Collaborate with manufacturing, finance, supply chain, quality, and other business teams to understand impact, prioritize recovery, and maintain trust in critical Databricks-supported solutions.

  • Support governance for citizen-built solutions by ensuring business-created applications have appropriate documentation, testing evidence, security review, support model, and IT transition plan before broad use.

  • Monitor and problem solve service health, support metrics, incidents, problem records, platform risks, and improvement backlog items for Databricks applications and data products.

  • Design and develop automation, self-healing workflows, monitoring integrations, and operational tooling using Python and cloud-native technologies.

Job Requirements

Education:

  • Bachelor's degree in Computer Science, Information Technology, Data Engineering, Software Engineering, Systems Engineering, or a related field required.

  • Master's degree preferred.

Experience:

  • 5 or more years of progressive experience in site reliability engineering, data platform engineering, cloud operations, DevOps, software engineering, data engineering, or production application support.

  • 3 or more years supporting cloud, data, analytics, application, or platform services in production environments preferred.

  • Experience with Databricks, Delta Lake, Unity Catalog, SQL, Python, PySpark, notebooks, jobs/workflows, SQL warehouses, clusters, or lakehouse architecture.

  • Experience operating applications through incident management, problem management, change management, monitoring, release management, and production readiness practices.

  • Preferred experience with Azure, CI/CD pipelines, Git-based development, infrastructure patterns, logging, alerting, automation, and support runbooks.

  • Preferred experience supporting data pipelines, analytics products, dashboards, APIs, AI-enabled applications, or business-critical reporting environments.

Functional / Technical Knowledge, Skills & Abilities:

  • Strong understanding of SRE, DevOps, IT operations, and production support practices, including reliability, observability, automation, incident response, and operational excellence.

  • Working knowledge of Databricks platform capabilities, including Delta tables, notebooks, workflows/jobs, SQL, Unity Catalog, lineage, permissions, compute configuration, and governed access patterns.

  • Ability to troubleshoot Databricks jobs, pipelines, notebooks, SQL queries, permissions, data refreshes, performance issues, and environment or integration failures.

  • Ability to write and review SQL and Python; PySpark, scripting, API, and automation experience preferred.

  • Ability to define operational readiness standards for applications created by citizen developers, IT teams, consultants, and data engineering teams.

  • Strong understanding of monitoring, alerting, logging, service health, SLOs, runbooks, release controls, rollback planning, and root cause analysis.

  • Ability to balance speed, business enablement, cybersecurity, supportability, cost control, and long-term platform sustainability.

  • Ability to partner effectively with Data & Analytics, Cybersecurity, Architecture, Infrastructure, Enterprise Applications, Manufacturing IT/OT, and business stakeholders.

  • Strong documentation and communication skills, including support models, knowledge articles, architecture notes, production checklists, escalation paths, and operational dashboards.

  • Ability to manage multiple production priorities, operate calmly during incidents, drive follow-through on corrective actions, and influence teams without direct authority.

Preferred Certifications:

  • Relevant Databricks certifications, including Data Engineer, Data Analyst, Machine Learning, or Lakehouse Fundamentals preferred.

  • Relevant Microsoft Azure, DevOps, cloud engineering, cybersecurity, ITIL, SRE, observability, or data engineering certifications preferred.

  • ITIL Foundation, Azure Administrator, Azure Developer, GitHub, Terraform, Kubernetes, or related platform operations certifications are a plus.

Additional Position Details:

  • Location: Golden, CO (on-site)

  • Work Authorization: Requires U.S. Person status (U.S. citizen, a Green Card holder, or a protected refugee/asylee)

Target Hiring Range

Annual Salary: USD 103,040.00 - USD 136,013.00

Actual compensation is commensurate with experience, skills and education. CoorsTek strives to give all qualified applicants equal opportunity and to make selection decisions on job related factors. Do not provide any information on the application which will indicate your race, color, religion, national origin, sex, age, disability, sexual orientation, gender identity, pregnancy, genetic information, veteran status, or any other status protected by law or regulation.

If you like working for a company that makes a real difference in the world, you'll enjoy your career with us!

Skills Required

  • Bachelor's degree in Computer Science or related field
  • 5 or more years in site reliability engineering or related field
  • Experience with Databricks and cloud platforms
  • 3 or more years supporting cloud, data, and analytics services
  • Preferred experience with Azure and CI/CD pipelines

CoorsTek Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about CoorsTek and has not been reviewed or approved by CoorsTek.

  • Retirement Support The 401(k) offering is a strong component of total rewards, with a company match that compares well for the industry. This provides meaningful long‑term value beyond base pay.
  • Healthcare Strength Core medical, dental, and vision coverage is complemented by HSA eligibility, virtual care, and mental‑health support. Regional availability of carriers like Aetna and Kaiser broadens access in certain locations.
  • Wellbeing & Lifestyle Benefits Wellness reimbursements, health screenings, and assistance resources add tangible non‑cash value. These programs extend support for everyday health and work‑life needs.

CoorsTek Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Golden, , Colorado
1,928 Employees
Year Founded: 1910

What We Do

CoorsTek is the international partner of choice for companies requiring the unique, high-performance properties of products manufactured from engineered technical ceramics and advanced materials. CoorsTek delivers outstanding value through unsurpassed expertise in materials engineering; broad research, development, and manufacturing capabilities; collaborative relationships, and operational excellence. For more information about CoorsTek, including product information and company history since 1910, visit CoorsTek.com

Similar Jobs

Socure Logo Socure

Senior Software Engineer

Artificial Intelligence • Machine Learning • Software • Analytics
Remote or Hybrid
4 Locations
386 Employees
160K-180K Annually

Cox Enterprises Logo Cox Enterprises

Client Performance Manager III (R&I)

Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
81K-122K Annually

Sierra Space Logo Sierra Space

Manager, Mission Design & Analysis Engineering (Clearance Required)

Aerospace • Hardware • Information Technology • Robotics • Defense • Utilities
In-Office
2 Locations
1600 Employees
158K-217K Annually

BAE Systems, Inc. Logo BAE Systems, Inc.

Engineer II - Systems - Mission Analyst

Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
Hybrid
Broomfield, CO, USA
40000 Employees
79K-135K Annually

Similar Companies Hiring

Turion Space Thumbnail
Aerospace • Artificial Intelligence • Hardware • Information Technology • Software • Defense • Manufacturing
Irvine, CA
150 Employees
Fortune Brands Innovations Thumbnail
Manufacturing
Deerfield, IL
2450 Employees
Amalgamated Sugar Thumbnail
Food • Greentech • Agriculture • Industrial • Manufacturing
Boise, Idaho
768 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account