Responsibilities
- Act as a senior custodian of the production promotion process across the software platform estate.
- Work closely with Technical Leads and QA to define and evolve promotion practices that emphasise quality, performance, and operational readiness.
- Define and evolve observability standards across metrics, logging, tracing, and alerting.
- Ensure systems are instrumented to support rapid diagnosis, learning, and recovery.
- Drive continuous improvement in platform reliability, performance, and release confidence.
- Partner with engineering, architecture, and platform teams to embed operability and resilience into system design.
- Lead and participate in on-call and rota-based operational support for production systems.
- Coordinate and continuously improve incident management practices, including post-incident reviews and preventative actions.
- Act as a senior technical authority for production readiness, operational risk, and release confidence.
- Mentor SREs and senior engineers, raising reliability and operational standards across teams.
- Influence architectural and platform decisions with a strong operational and delivery lens while remaining hands-on.
Skills
- Strong hands-on experience operating distributed, cloud-hosted SaaS platforms at scale.
- Professional experience with at least one modern programming language.
- Experience working with or supporting .NET-based systems is highly beneficial.
- Strong experience with Microsoft Azure, including core platform services, networking, identity, and security.
- Deep expertise in observability tooling and practices. Experience improving production promotion, deployment, and release processes.
- Experience with Infrastructure as Code and automation-driven operations.
- Strong understanding of failure modes, resilience patterns, and recovery strategies. Ability to influence senior stakeholders through technical credibility and pragmatism.
Minimum Qualifications
- Based In East Coast Time Zone
- Typically, 8+ years of experience in SRE, platform, operational, or software engineering roles with a large amount of these spent in multi-tenant environments.
- Experience supporting production systems with formal on-call or rota responsibility.
- Experience in leading and mentoring a team of SRE engineers, with an emphasis on professional and personal growth.
- Experience enabling regular, multi-service production releases at scale.
- Right to work in the country of employment.
Similar Jobs
What We Do
StarCompliance is the world's leading provider of compliance software to the global financial industry. Our clients include asset managers, broker-dealers, private equity firms, insurance providers, investment banks, and diversified financial institutions. Our scalable, easy-to-use solutions provide a 360-degree view of employee and business activity to help firms monitor and reduce risk, meet regulatory obligations, gain efficiencies, and drive employee adoption. Our Employee Conflicts of Interest suite provides clients a single place for monitoring and mitigating potential employee conflicts, covering: personal trading activity; insider trading; private investments, gifts and entertainment spending; outside business activities; and political donations. The STAR Mobile app supports personal trading pre-clearance requests and gifts and entertainment spending submissions, and allows compliance officers and employee supervisors to review and approve those requests and submissions on-the-go. Compliance Control Room centralizes all firm deal-related activity—automatically surfacing critical data that might otherwise be missed—and allowing for easier conflict searches, so deals can be cleared faster and with greater confidence.









