Lead Associate Principal, Software Engineering: DevOps

Posted 24 Days Ago
Be an Early Applicant
Chicago, IL, USA
Hybrid
141K-233K Annually
Senior level
Big Data • Cloud • Fintech • Information Technology • Financial Services
We clear and settle trades for the options industry.
The Role
The role involves leading DevOps initiatives, collaborating across teams to drive site reliability, implementing CI/CD pipelines, managing infrastructure automation, and ensuring system observability and operational excellence.
Summary Generated by Built In

To be considered for this position, applications and resumes are accepted only through our careers site by directly applying to the posted job. We do not accept unsolicited resumes or sales solicitations from staffing agencies. Any OCC employee wishing to submit a referral must do so through their Workday account. Any resume submitted outside of an active job posting will not be considered for employment.

What You'll Do

Successful candidate will collaborate with various product, infrastructure, operations, security, and production control teams to elicit and fulfill technical requirements, while driving site reliability, system observability, and operational excellence across the platform.

Primary Duties and Responsibilities:

To perform this job successfully, an individual must be able to perform each primary duty satisfactorily.

  • Guides the implementation using CI/CD pipelines in Kubernetes environment

  • Directs review, configuration, and execution of Terraform and Ansible automation pipelines delivered by product teams

  • Guides the setup of common infrastructure platforms like multi-region Kubernetes and Kafka clusters

  • Elicits requirements for application deployment and sizing to manage expected workloads

  • Defines and enforces Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets in collaboration with product teams

  • Leads blameless post-mortems and drives resolution of action items to reduce repeat incidents

  • Designs and implements observability frameworks covering metrics, logs, and distributed tracing across all platform services

  • Drives toil reduction initiatives by identifying and automating repetitive operational work

  • Partners with product teams to embed reliability requirements and non-functional requirements (NFRs) early in the software development lifecycle

  • Monitors application performance and tunes systems working with product teams

  • Confers with product team leads and practitioners to create deployment and reliability plans

  • Confers with Enterprise Architecture and Renaissance architecture teams to devise implementation architecture

  • Promotes standards across application configuration towards the highest security posture

  • Collaborates with access management and security teams on setting up roles and permissions using least privilege strategies

  • Collaborates with integration/performance testing teams to leverage integrated release testing in the Release Acceptance environment

  • Collaborates with production controls teams on monitoring, failover, logging, and alerting strategies

  • Owns and continuously improves incident response runbooks, on-call rotations, and escalation procedures

  • Conducts capacity planning and load forecasting to proactively address scalability needs

  • Implements and validates infrastructure failover scenarios

  • Confers with Network team on all connectivity plans and issue resolution (including between on-premises and AWS)

  • Follows and enables program-level agile practices for efficient collaboration and delivery

  • Develops documentation for ORT technical infrastructure, architecture, and reliability support

Supervisory Responsibilities

  • None

Qualifications:

The requirements listed are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the primary functions.

  • [Required] Understanding of Kanban and/or Agile methodologies

  • [Required] Familiarity with SRE principles as defined by Google SRE practices (error budgets, toil elimination, reliability hierarchy)

  • [Required] Able to succeed in a fast-paced environment with frequent changes

  • [Required] Comfortable communicating with both technical and non-technical audiences

  • [Required] Self-starter — takes initiative to research, learn, and deliver; anticipates the play

  • [Required] Team player — humble, collaborative, and focused on making the entire team succeed

Technical Skills & Background

  • [Required] AWS EC2, Kubernetes, Kafka, Jenkins, Terraform, Ansible, Hashicorp Vault

  • [Required] Observability tooling such as Prometheus, Grafana, OpenTelemetry, Datadog, or equivalent

  • [Required] Incident management platforms and on-call tooling (e.g., PagerDuty, OpsGenie)

  • [Required] Microservices and streaming data-intensive application architecture

  • [Required] Application architecture, networking, and security in the cloud

  • [Required] Setting up platforms in AWS for high-performance requirements

  • [Required] Broad experience in API-based development

  • [Required] Git and Artifactory for sourcing artifacts

  • [Required] Multi-AZ, multi-region failover architecture

  • [Required] Chaos engineering principles and tooling (e.g., Chaos Monkey, Gremlin, LitmusChaos)

  • [Required] Fluent with different data formats and structures: JSON, Protobuf, Avro

  • [Required] SQL and NoSQL databases, in-memory data stores

  • [Required] Java/Python/Scala/Golang software development

  • [Required] Two or more of the following: web/mobile application development, Unix/Linux environments, event-driven systems, transaction processing systems, distributed and parallel systems, large software system development, security software development, public-cloud platforms

  • [Required] Fluent in industry best practices, software patterns, and architecture principles

  • [Required] Enterprise architecture frameworks such as TOGAF

  • [Required] Ability to define and document architecture strategies, designs, and requirements across all enterprise architecture domains

  • [Required] Ability to define service-based, component architectures and demonstrate visualization of enterprise architecture concepts

Certifications

  • [Preferred] AWS Certified Solutions Architect / DevOps Engineer

  • [Preferred] Kubernetes, Kafka certification

  • [Preferred] Google Cloud Professional — Site Reliability Engineer or equivalent SRE-focused certification

  • [Preferred] Project/program management certifications

Education & Training

  • [Required] BS degree in Computer Science, similar technical field, or equivalent experience

  • [Required] 7+ years of experience building large-scale, data-centric solutions

  • [Required] 7+ years of recent experience participating on a DevOps or SRE team, or as product owner for such a team

About Us

The Options Clearing Corporation (OCC) is the world's largest equity derivatives clearing organization. Founded in 1973, OCC is dedicated to promoting stability and market integrity by delivering clearing and settlement services for options, futures and securities lending transactions. As a Systemically Important Financial Market Utility (SIFMU), OCC operates under the jurisdiction of the U.S. Securities and Exchange Commission (SEC), the U.S. Commodity Futures Trading Commission (CFTC), and the Board of Governors of the Federal Reserve System. OCC has more than 100 clearing members and provides central counterparty (CCP) clearing and settlement services to 19 exchanges and trading platforms. More information about OCC is available at www.theocc.com.

Benefits

A highly collaborative and supportive environment developed to encourage work-life balance and employee wellness. Some of these components include:

  • A hybrid work environment, up to 2 days per week of remote work
  • Tuition Reimbursement to support your continued education
  • Student Loan Repayment Assistance
  • Technology Stipend allowing you to use the device of your choice to connect to our network while working remotely
  • Generous PTO and Parental leave
  • 401k Employer Match
  • Competitive health benefits including medical, dental and vision

Visit https://www.theocc.com/careers/thriving-together for more information.

Compensation

  • The salary range listed for any given position is exclusive of fringe benefits and potential bonuses. If hired at OCC, your final base salary compensation will be determined by factors such as skills, experience and/or education.
  • In addition, we believe in the importance of pay equity and consider internal equity of our current team members as part of any final offer.
  • We typically do not hire at the maximum of the range in order to allow for future and continued salary growth. We also offer a substantial benefits package as noted on www.theocc.com/careers
  • All employees may be eligible for a discretionary bonus. Discretionary bonuses are based on various factors, including, but not limited to, company and individual performance and are not guaranteed.

Salary Range

$140,800.00 - $232,500.00

Incentive Range

8% to 15%

This position is eligible for an annual discretionary incentive compensation award, for which the target range is listed above (see Incentive Range). The amount of such award, if any, will be based on various factors, including without limitation, both individual and company performance.

Step 1
When you find a position you're interested in, click the 'Apply' button. Please complete the application and attach your resume.  

Step 2
You will receive an email notification to confirm that we've received your application.

Step 3
If you are called in for an interview, a representative from OCC will contact you to set up a date, time, and location. 

For more information about OCC, please click here.

OCC is an Equal Opportunity Employer

Skills Required

  • Understanding of Kanban and/or Agile methodologies
  • Familiarity with SRE principles as defined by Google SRE practices
  • Experience building large-scale, data-centric solutions
  • Experience participating on a DevOps or SRE team, or as product owner for such a team
  • BS degree in Computer Science or similar technical field

What the Team is Saying

Bailey
Daniel
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Chicago, IL
1,200 Employees
Year Founded: 1973

What We Do

As the foundation for secure markets, OCC is a customer-driven organization that delivers world-class Risk Management, Clearing, and Settlement Services for a sophisticated mix of financial products that includes standard options, stock loans, and futures contracts.

Why Work With Us

We're bound together by values and behaviors that shape the way we work and live, from team projects to after-hours events and to making a difference in our communities. OCC colleagues thrive in an atmosphere of intellectual curiosity, creative problem-solving and effective interaction.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

OCC Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

A hybrid work environment, up to 2 days per week of remote work

Typical time on-site: 3 days a week
Company Office Image
HQChicago, IL
Company Office Image
Dallas, TX
Company Office Image
Washington, DC
Learn more

Similar Jobs

OCC Logo OCC

Lead Associate Principal, Security Engineering

Big Data • Cloud • Fintech • Information Technology • Financial Services
Hybrid
2 Locations
1200 Employees
145K-237K Annually

OCC Logo OCC

Associate Principal A, Software Engineering: Software Development Test (SDET)

Big Data • Cloud • Fintech • Information Technology • Financial Services
Hybrid
Chicago, IL, USA
1200 Employees
105K-175K Annually

OCC Logo OCC

Lead Associate Principal, Cloud Engineering

Big Data • Cloud • Fintech • Information Technology • Financial Services
Hybrid
Chicago, IL, USA
1200 Employees
143K-228K Annually

OCC Logo OCC

Lead Assoc Principal, Quantitative Risk Management

Big Data • Cloud • Fintech • Information Technology • Financial Services
Hybrid
Chicago, IL, USA
1200 Employees
129K-230K Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account