Staff Site Reliability Engineer, Database

Posted 25 Days Ago
Hiring Remotely in San Mateo, CA, USA
In-Office or Remote
Senior level
Fintech • Information Technology
The Role
As a Site Reliability Engineer at Alpaca, you will ensure system reliability and performance, troubleshoot issues, and collaborate with teams to design scalable features.
Summary Generated by Built In

Who We Are:

Alpaca is a US-headquartered self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24/5 trading, and more. Our recent Series D funding round brought our total investment to over $320 million, fueling our ambitious vision.

Amongst our subsidiaries, Alpaca is a licensed financial services company, serving hundreds of financial institutions across 40 countries with our institutional-grade APIs. This includes broker-dealers, investment advisors, wealth managers, hedge funds, and crypto exchanges, totalling over 9 million brokerage accounts.

Our global team is a diverse group of experienced engineers, traders, and brokerage professionals who are working to achieve our mission of opening financial services to everyone on the planet. We're deeply committed to open-source contributions and fostering a vibrant community, continuously enhancing our award-winning, developer-friendly API and the robust infrastructure behind it.

Alpaca is proudly backed by top-tier global investors, including Portage Ventures, Spark Capital, Tribe Capital, Social Leverage, Horizons Ventures, Unbound, SBI Group, Derayah Financial, Elefund, and Y Combinator.


Our Team Members:

We're a dynamic team of 380+ globally distributed members who thrive working from our favorite places around the world, with teammates spanning the USA, Canada, Japan, Hungary, Nigeria, Brazil, the UK, and beyond!
We're searching for passionate individuals eager to contribute to Alpaca's rapid growth. If you align with our core values—Stay Curious, Have Empathy, and Be Accountable—and are ready to make a significant impact, we encourage you to apply.

Your Role:

As a Site Reliability Engineer (SRE) at Alpaca, you will ensure the reliability, scalability, and performance of our systems and services. You will work closely with development, operations and devops teams to build and maintain robust applications, ensuring they run smoothly and efficiently. This role requires a blend of software engineering and operations skills, with a strong ability to troubleshoot technical issues and resolve problems before they impact our users.


Things You Get To Do:
  • Triage difficult technical problems and implement solutions
  • Improve our observability stack (monitoring, logging, profiling)
  • Incident Management: Respond to and resolve incidents in a timely manner, conducting post-incident reviews to identify and implement improvements.
  • Collaboration: Work closely with development teams to ensure new features and services are designed with reliability and scalability in mind.
  • Capacity Planning: Monitor system capacity and performance, making recommendations and implementing changes to handle future growth.
Who you are (must-haves):
  • 5+ years of experience in Site Reliability Engineering, Performance Engineering, or similar roles.
  • 5+ years of experience with multi-terabyte scale PostgreSQL clusters.
  • Proven track record of managing and maintaining large-scale, high-availability, and high-performance PostgreSQL database.
  • Experience designing and implementing SLIs, SLOs, and SLAs for internal systems and databases.
  • Experience with troubleshooting PostgreSQL performance problems and slow queries.
  • Extensive experience with efficient schema design and efficient query design.
  • Experience migrating multi-terabyte tables into more efficient schemas.
  • Proficient with Go.
  • Proficient with Prometheus.
  • Proficient with Linux.
  • Knowledgeable in trading/fintech domains.
  • Experience with low-latency systems.
  • Experience with distributed tracing.
  • Experience scaling PostgreSQL clusters rapidly.
  • Experience with pgx, gorm, or sqlc.
How We Take Care of You:
  • Competitive Salary & Stock Options
  • Health Benefits
  • New Hire Home-Office Setup: One-time USD $500
  • Monthly Stipend: USD $150 per month via a Brex Card

Alpaca is proud to be an equal opportunity workplace dedicated to pursuing and hiring a diverse workforce.

Recruitment Privacy Policy

Skills Required

  • 5+ years of experience in Site Reliability Engineering or similar roles
  • 5+ years of experience with multi terabyte scale PostgreSQL clusters
  • Proficient with Go
  • Proficient with Prometheus
  • Proficient with Linux
  • Experience with troubleshooting PostgreSQL performance problems
  • Experience designing and implementing SLIs, SLOs, and SLAs for internal systems and databases
  • Experience migrating multi-terabyte tables into more efficient schemas
  • Knowledgeable in trading/fintech domains
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Mateo, CA
132 Employees
Year Founded: 2015

What We Do

Alpaca's mission is to unlock asset management for the people. We are a technology company that modularizes the world’s asset management activities. Alpaca’s products enable anyone to build and connect applications and algorithms to buy and sell stocks with zero commissions. We believe that everyone should have fair access to financial markets, regardless of who we are or where we are from. *Securities are offered through Alpaca Securities LLC (alpaca.markets)*

Similar Jobs

Akamai Technologies Logo Akamai Technologies

Site Reliability Engineer

Cloud • Security • Software • Cybersecurity
In-Office or Remote
2 Locations
10285 Employees
95K-171K Annually

Cox Enterprises Logo Cox Enterprises

Human Resources Business Partner

Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
67K-101K Annually

Cox Enterprises Logo Cox Enterprises

Customer Success Manager

Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
92K-154K Annually

Jasper Logo Jasper

Designer

Artificial Intelligence • Marketing Tech • Software • Generative AI • Automation
Remote
United States
220 Employees
160K-200K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account