Site Reliability Engineer II

Posted 9 Days Ago
The Street, Town of Knox, NY, USA
In-Office
94K-172K Annually
Mid level
Financial Services
The Role
The SRE II will develop and scale systems, improve service reliability using AI/ML, collaborate with teams on observability and incident response, and work on GCP migration.
Summary Generated by Built In

This is Hybrid role, 2 days on site.
Role is located in NYC with alternative location Chicago, IL.

We are looking for local candidates only.

Working days: Tuesday-Saturday
Working hours: 9am-5pm EST

Description
Site Reliability Engineer II (Tuesday - Saturday)

CME Group is seeking an SRE II to help build, operate, and scale systems in our Markets portfolio. Markets SREs work on products and applications related to CME’s Globex trading platform. Our systems deliver an exceptional combination of low-latency performance and rock-solid reliability to seamlessly handle the world’s busiest trading days.
The successful candidate will work alongside senior engineers to learn how we observe, monitor, automate, and improve Production service reliability. As we evolve our operations, we are increasingly emphasizing the integration of Artificial Intelligence (AI) and Machine Learning (ML) to drive smarter, more predictive reliability and reduce operational toil.
Key Responsibilities:

  • Work alongside product teams and senior engineers to assist with building out observability, monitoring, and alerting for key services.
  • Implement AI-driven reliability solutions, including anomaly detection, predictive alerting, and root cause analysis in production environments.
  • Collaborate with engineers and product teams to ensure requirements are understood, planned carefully, and implemented safely.
  • Participate in on-call rotation and assist in incident response under guidance from senior engineers.
  • Write scripts and tools to reduce toil and improve velocity, including building or integrating intelligent auto-remediation and capacity forecasting systems.
  • Leverage LLMs and Generative AI to enhance incident management, automate runbooks, and streamline log analysis.
  • Contribute to disaster recovery (DR) and systems resiliency testing & improvements.
  • Support the migration of markets applications to Google Cloud Platform (GCP).
  • Collaborate with cross-functional teams to improve system performance and operational efficiency.

What We’re Looking For (Required):

  • A keen interest in SRE, automation, and intelligent operations (AIOps).
  • Experience with Linux-based systems.
  • Programming and scripting skills (Python, Bash, etc.).
  • Strong problem-solving and analytical abilities.
  • Excellent communication and teamwork skills.
  • Eagerness to learn and adapt in a fast-paced trading environment.

Preferred / Desirable Qualifications:

  • AI/ML for Operations: Demonstrated hands-on experience applying AI/ML techniques to improve operational efficiency, reliability, or observability.
  • AIOps Platforms: Experience using platforms such as Dynatrace, New Relic, Moogsoft, BigPanda, or integrating open-source tools (e.g., Prometheus with ML models).
  • Generative AI Tooling: Experience with LLMs for operations, incident management, or log analysis (e.g., using LangChain, LlamaIndex, or tools like PagerDuty AIOps).
  • Cloud Platforms: Experience with Cloud-based platforms—Google Cloud Platform (GCP), GCE, and/or GKE is a strong bonus.
  • Traditional Observability: Experience with metrics & monitoring tools like OpenTelemetry, Splunk, Prometheus, and Grafana.
  • Systems Architecture: Experience with Kubernetes and knowledge of working with distributed systems.
  • Core Concepts: Basic knowledge of networking (HTTP/TCP/UDP/IP) and message-oriented middleware.
  • Industry & Process: Experience in financial markets and working in an Agile environment.

Why CME Group:

  • Be part of a global leader in financial services technology.
  • Work on cutting-edge technology and intelligent operations in a collaborative, innovative culture.
  • Competitive compensation and benefits package.
  • Opportunity to grow and advance your career in SRE with an organization that is transforming its approach to system reliability.

Join CME Group and play a crucial role in ensuring the stability and performance of our Markets applications while contributing to our GCP migration and AIOps evolution. Apply now to be a part of our dynamic SRE team!

#LI-DS2

CME Group is committed to offering a competitive total rewards package for our employees that recognizes their contributions to the business and reflects our long-term investment in their future. The pay ranges for this role based on location are: Chicago: $93,900-$156,500 New York/New Jersey: $103,200-$172,000. Actual salary offered will be dependent on a wide array of factors including but not limited to: relevant experience, skills, education and comparison to internal employees (where relevant). Our compensation program also includes an annual target bonus opportunity for all employees, as well as the opportunity to become an owner in the company through our broad-based equity program. Through our benefits program, we strive to offer flexibility, value and choice. From comprehensive health coverage, to a retirement package that includes both a 401(k) and an active pension plan, to highly competitive education reimbursement provisions, paid time off and a mental health benefit, CME Group offers a holistic benefits package for our team and their dependents.

CME Group: Where Futures are Made

CME Group is the world’s leading derivatives marketplace. But who we are goes deeper than that. Here, you can impact markets worldwide. Transform industries. And build a career by shaping tomorrow. We invest in your success and you own it – all while working alongside a team of leading experts who inspire you in ways big and small. Problem solvers, difference makers, trailblazers. Those are our people. And we’re looking for more.

At CME Group, we embrace our employees' unique experiences and skills to ensure that everyone’s perspectives are acknowledged and valued. As an equal-opportunity employer, we consider all potential employees without regard to any protected characteristic.

Important Notice: Recruitment fraud is on the rise, with scammers using misleading promises of job offers and interviews to solicit money and personal information from job seekers. CME Group adheres to established procedures designed to maintain trust, confidence and security throughout our recruitment process. Learn more here.

Skills Required

  • Experience with Linux-based systems
  • Programming and scripting skills (Python, Bash, etc.)
  • Strong problem-solving and analytical abilities
  • Excellent communication and teamwork skills
  • Keen interest in SRE, automation, and intelligent operations
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Chicago, IL
3,291 Employees

What We Do

As the world's leading derivatives marketplace, CME Group (www.cmegroup.com) is where the world comes to manage risk. CME Group exchanges offer the widest range of global benchmark products across all major asset classes, including futures and options based on interest rates, equity indexes, foreign exchange, energy, agricultural commodities, metals, weather and real estate. CME Group brings buyers and sellers together through its CME Globex® electronic trading platform and its trading facilities in New York and Chicago. CME Group also operates CME Clearing, one of the world’s leading central counterparty clearing provider in the world, which offers clearing and settlement services for exchange-traded contracts, as well as for over-the-counter derivatives transactions through CME ClearPort®. These products and services ensure that businesses everywhere can substantially mitigate counterparty credit risk in both listed and over-the-counter derivatives markets.

Similar Jobs

MongoDB Logo MongoDB

Site Reliability Engineer

Big Data • Cloud • Software • Database
Easy Apply
Remote or Hybrid
10 Locations
5550 Employees
127K-249K Annually
In-Office
New York, NY, USA
690 Employees
140K-170K Annually

RapidSOS Logo RapidSOS

Senior Site Reliability Engineer

Information Technology • Internet of Things • Social Impact • Software
In-Office or Remote
Boston, NY, USA
530 Employees
160K-195K Annually

Akamai Technologies Logo Akamai Technologies

Site Reliability Engineer

Cloud • Security • Software • Cybersecurity
In-Office or Remote
2 Locations
10285 Employees
95K-171K Annually

Similar Companies Hiring

Rain Thumbnail
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3 • Infrastructure as a Service (IaaS)
New York, NY
100 Employees
Granted Thumbnail
Mobile • Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account