Principal DevOps Engineer, Infrastructure Performance

Posted 7 Days Ago
Easy Apply
Hiring Remotely in United States
Remote or Hybrid
Senior level
Automotive • Fintech • Hardware • Payments • Travel • Financial Services
The Role
Design and build a cloud-based observability platform, troubleshoot performance issues, improve monitoring tools, and scale infrastructure. Lead operational improvements in a collaborative environment.
Summary Generated by Built In

Upgrade helps customers move in the right direction with affordable and responsible financial products. Since 2017, we’ve helped over 7 million customers access over $40 billion in consumer credit. With a relentless focus on improving our customers' financial well-being, we build products that put more money in their pocket and support their journey toward a better financial future. We’re backed by some of the most prominent technology investors and were most recently valued at $6.3B.

We’re consistently recognized for our collaborative and inclusive culture. Most recently, we were named one of the World’s Top Fintech Companies by CNBC, Best Places to Work by Built In, Best Places to Work by the San Francisco Business Times, America’s Greatest Workplaces by Newsweek, Best Startup Employer by Forbes, and Healthiest Employers by Phoenix Business Journal. 

We’re looking for new team members who get excited about designing and delivering new and better products. Come join us and help build a better financial future for millions of people.


What You'll Do:
  • Build a resilient, secure, and efficient cloud based observability platform.
  • Monitor and troubleshoot platform issues, including finding solutions to reduce known issues.
  • Build and scale the observability infrastructure to meet rapidly increasing demand.
  • Develop and improve operational practices and procedures.
  • Sample projects:
    • Improve database monitoring: develop custom prometheus exporters in Go for use cases that go beyond what is possible with SQL exporter. Create Grafana dashboards and alerts for these new metrics.
    • MCP servers for observability: deploy MCP server to integrate our observability stack with our LLM tools.
What We Look For:
  • 8+ years of relevant production-level experience.
  • Experience with VictoriaMetrics.
  • Experience with Sumologic.
  • Experience with tracing tools (e.g. OpenTelemetry, Honeycomb, Tempo).
  • Experience with profiling tools (e.g. Pyroscope)
  • Knowledge of cloud monitoring, logging and cost management tools.
  • Programming/scripting knowledge (Go, Java, or Python) and understanding of JVM concepts.
  • In-depth knowledge of AWS services, hands-on experience in AWS provisioning using terraform.
  • Experience with containerized applications and Kubernetes / EKS. Creating and updating / maintaining Helm charts.
  • Understanding of microservices architecture and debugging/investigation techniques.
  • Strong understanding of systems, networking and troubleshooting techniques.
  • Experience in automated build pipeline, continuous integration and continuous deployment.
  • Ability to operate in an agile, entrepreneurial start-up environment.
  • Experience with running Linux in production.
Our Tech Stack:
  • Monitoring: VictoriaMetrics, Grafana, Prometheus, OpenTelemetry, Honeycomb, Sumologic.
  • Infrastructure as code: Terraform.
  • CD: GitOps, ArgoCD, ArgoRollouts.
  • CI: Tekton.
  • Scripting: Bash.
  • Programming: Golang (preferred).
  • AWS: EKS, Cloudwatch, S3, DynamodDB, RDS, SNS, SQS, Lambda.

What We Offer You: 

  • Competitive salary and stock option plan.
  • 100% paid coverage of medical, dental and vision insurance.
  • Flexible PTO.
  • Learning stipend for personal growth and development. 
  • Paid parental leave.
  • Health & wellness initiatives. 

#LI-Remote  #BI-Remote

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Upgrade does not accept unsolicited resumes from staffing agencies, search firms, or any third parties. Any resume submitted to any employee of Upgrade without a prior written agreement in place will be considered the property of Upgrade, and Upgrade will not be obligated to pay any referral or placement fee. Agencies must obtain advance written approval from Upgrade's Talent Acquisition department to submit resumes and only in conjunction with a valid, fully executed agreement. English is required for all positions, as it involves interacting with staff at Upgrade's offices worldwide.

Top Skills

Argocd
Argorollouts
AWS
Bash
Eks
Gitops
Go
Grafana
Honeycomb
Java
Kubernetes
Opentelemetry
Prometheus
Pyroscope
Python
Sumologic
Tekton
Tempo
Terraform
Victoriametrics

What the Team is Saying

Vicky Choy
Seti Momayez
Nelson Lobo
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
1,950 Employees
Year Founded: 2017

What We Do

Upgrade helps customers move in the right direction with affordable and responsible financial products. Since 2017, we’ve helped over 7 million customers access over $40 billion in consumer credit. With a relentless focus on improving our customers' financial well-being, we build products that put more money in their pocket and support their journey toward a better financial future.

We’re consistently recognized for our innovative technology, rapid growth, and inclusive culture. Most recently, we were named one of the World’s Best Fintech Companies by CNBC, Best Places to Work by Built In, Best Places to Work by the San Francisco Business Times, Best Places to Work in Fintech by American Banker, Best Employer by Forbes, and America’s Greatest Workplaces by Newsweek. Our technology and products have also earned us spots on the World's Top 250 Fintech Companies by CNBC, Deloitte Technology Fast 500, and Fintech Breakthrough Awards.

We’re looking for new team members who get excited about designing and delivering new and better products. Come join us and help build a better financial future for millions of people.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Upgrade, Inc. Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: 2 days a week
Company Office Image
HQSan Francisco, CA
Atlanta, GA
Company Office Image
Irvine, CA
Company Office Image
Montréal, QC
Company Office Image
Phoenix, AZ
Learn more

Similar Jobs

Upgrade, Inc. Logo Upgrade, Inc.

Director, Financial Institution Partnerships - Northeast

Automotive • Fintech • Hardware • Payments • Travel • Financial Services
Easy Apply
Remote or Hybrid
United States

Upgrade, Inc. Logo Upgrade, Inc.

Quality Assurance Automation Engineer

Automotive • Fintech • Hardware • Payments • Travel • Financial Services
Easy Apply
Remote or Hybrid
United States

Upgrade, Inc. Logo Upgrade, Inc.

Senior Model Risk Analyst

Automotive • Fintech • Hardware • Payments • Travel • Financial Services
Easy Apply
Remote or Hybrid
United States

Upgrade, Inc. Logo Upgrade, Inc.

Senior Manager, Credit Risk

Automotive • Fintech • Hardware • Payments • Travel • Financial Services
Easy Apply
Remote or Hybrid
United States

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account