Site Reliability Engineer (SRE)

Reposted 10 Days Ago
Easy Apply
Be an Early Applicant
Hiring Remotely in EST
Remote
Senior level
Fintech • Payments • Financial Services
Experience a better way to move money
The Role
As a Site Reliability Engineer, you will ensure system reliability, lead incident management, optimize infrastructure operations, and drive continuous improvements.
Summary Generated by Built In
About Us

OpenFX is on a mission to move money as freely as data, unrestricted by time zones, banking hours, or legacy systems. We are building the infrastructure that powers the next generation of cross-border payment systems for institutions. Our early team comes with experience from J.P. Morgan, Goldman Sachs, FalconX, PayPal, Affirm, Kraken, and Nium, and we’re backed by Accel, Lightspeed, NfX, and other top-tier investors.

As a Site Reliability Engineer, you will ensure the reliability, availability, and performance of OpenFX’s systems. This is a hands-on, high-impact role at the intersection of DevOps and incident response. You will participate in on-call rotations covering U.S. operating hours, triage production issues in real time, and work with engineering pods to quickly resolve or escalate incidents.

This role is ideal for a reliability-focused engineer with a strong DevOps background, excellent troubleshooting skills, and a bias for ownership when it comes to uptime and incident management.

Responsibilities & Expectations

Incident Response & Reliability

  • Serve as first responder for production incidents during U.S. operating hours (±2h EST).
  • Lead triage during outages, analyzing logs, metrics, and traces to identify root causes.
  • Drive incident postmortems and follow-ups to prevent recurrence.
  • Communicate clearly and quickly during incidents to internal stakeholders.

Infrastructure & Operations

  • Own reliability outcomes across all OpenFX systems, with a focus on uptime, latency, and error budgets.
  • Enhance observability through logging, metrics, alerting, and dashboards.
  • Optimize on-call processes and ensure smooth handoffs across IST, EST, and PST coverage.
  • Partner with DevOps and engineering pods to implement fixes or approve production changes.

Continuous Improvement

  • Proactively identify systemic reliability risks and propose improvements.
  • Contribute automation and tooling to reduce manual incident handling.
  • Champion best practices in reliability engineering and operational excellence.
Must-Have Qualifications
  • 5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
  • Proven experience leading incident response, running postmortems, and communicating during outages.
  • Strong background with cloud infrastructure (AWS preferred), container orchestration (Kubernetes, ECS), and Infrastructure-as-Code (Terraform, CloudFormation).
  • Familiarity with observability stacks (e.g., Prometheus, Grafana, Datadog, ELK, OpenTelemetry).
  • Ability to triage errors at both the infrastructure and application level, and escalate effectively when deeper intervention is required.
  • Ownership mindset with strong communication skills in high-pressure situations.
Nice-to-Have Qualifications
  • Experience in fintech, trading systems, or other low-latency/high-availability environments.
  • Coding/scripting ability in Python, Go, or similar (for automation, not feature development).
  • Familiarity with CI/CD pipelines and release engineering.
  • Experience working in a follow-the-sun on-call rotation across global teams.
What We Offer
  • Competitive salary and benefits package.
  • Equity in a rapidly growing company.
  • Opportunity to work on mission-critical infrastructure in fintech.
  • A collaborative team culture with a bias toward ownership and outcomes.
  • The chance to make a direct impact on the resilience of global financial infrastructure.

We are committed to building a diverse and inclusive workplace. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status

Top Skills

AWS
CloudFormation
Datadog
Ecs
Elk
Go
Grafana
Kubernetes
Opentelemetry
Prometheus
Python
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, New York
15 Employees
Year Founded: 2024

What We Do

OpenFX is redefining the future of global money movement.

We enable money to flow across borders as effortlessly as data, unbound by time zones, legacy systems, or banking hours. Our FX infrastructure transforms how finance teams operate, delivering cross-border transfers with industry-leading spreads, real-time settlement and 24/7 availability.

We are a team of serial entrepreneurs from Affirm, BofA, Charles Schwab, Goldman Sachs, Intuit, JPM, Kraken, Microsoft, Meta, PayPal, Slack.

Similar Jobs

GitLab Logo GitLab

Site Reliability Engineer

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
28 Locations
2500 Employees
In-Office or Remote
30 Locations
22 Employees

GitLab Logo GitLab

Site Reliability Engineer

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
28 Locations
2500 Employees
In-Office or Remote
30 Locations
45 Employees

Similar Companies Hiring

Camber Thumbnail
Social Impact • Healthtech • Fintech
New York, NY
53 Employees
Rain Thumbnail
Web3 • Payments • Infrastructure as a Service (IaaS) • Fintech • Financial Services • Cryptocurrency • Blockchain
New York, NY
40 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account