Infrastructure Engineer/SRE

Reposted Yesterday
Hiring Remotely in Canada
Remote
Senior level
Artificial Intelligence • Other • Sales • Software
The Role
Design and advance core infrastructure for engineering, ensure Kubernetes reliability, automate operations, and support AI infrastructure.
Summary Generated by Built In

Cresta unlocks the true potential of the customer experience, turning every conversation into a competitive advantage. Cresta’s unified AI platform combines conversational AI agents, real-time human agent augmentation, and comprehensive conversation intelligence to drive revenue and efficiency gains across every channel. The world’s leading companies, including United Airlines, Cox Communications, and Marriott, use Cresta to power world-class customer experiences every day. 

Born from the Stanford AI Lab, Cresta has raised more than $270 million from the world’s leading investors, including a16z, Greylock, and Sequoia. Cresta’s leadership includes some of the leading minds in AI today. Our CEO, Ping Wu, founded and led Google's Contact Center AI and Vertex AI platforms before joining Cresta to build the future of AI-driven customer experiences.

Over the next few years, AI is going to redefine how people all over the world interact with businesses every day. Come build that future at Cresta.

About the role:
As a member of the infrastructure team you are responsible for designing, building, and advancing our core infrastructure that allows the engineering team to execute quickly, productively, and securely. You will join a collaborative but highly autonomous working environment in which each member has a defined role with clear expectations, as well as the freedom to pursue projects they find interesting.
Responsibilities:
  • Developer Toolchain. Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure.
  • Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
  • Metrics, logging, analytics, and alerting for performance and security across all endpoints and applications.
  • Infrastructure-as-code deployment tooling and supporting services on multiple cloud providers.
  • Automate operations and engineering. Focus on automation so we can spend energy where it matters.
  • Building machine learning infrastructure that enables AI teams to train, test, and deploy on large-scale datasets.
What we are looking for:
  • 5+ years experience in DevOps, Site Reliability Engineering, Production Engineering, or equivalent field.
  • Deep proficiency with coding languages such as Golang or Python.
  • Deep familiarity with container-related security best practices.
  • Production experience working with Kubernetes, and a deep understanding of the Kubernetes ecosystem, including popular open-source tooling such as cert-manager or external-dns.  Experience with GPU-enabled clusters is a bonus.
  • Production experience with Kubernetes templating tools such as Helm or Kustomize.
  • Production experience with IAC tools such as Terraform or CloudFormation.
  • Production experience working with AWS and services such as IAM, S3, EC2, and EKS.
  • Production experience with other cloud providers such as Google Cloud and Azure is a bonus.
  • Production experience with database software such as PostgreSQL
  • Experience with GitOps tooling such as Flux or Argo.
  • Experience with CI/CD such as GitHub Actions.

Perks & Benefits:

  • We offer Cresta employees a variety of medical, dental, and vision plans, designed to fit you and your family’s needs
  • Paid parental leave to support you and your family
  • Monthly Health & Wellness allowance
  • Work from home office stipend to help you succeed in a remote environment
  • Lunch reimbursement for in-office employees 
  • PTO: 3 weeks in Canada 

Compensation for this position includes a base salary, equity, and a variety of benefits. Actual base salaries will be based on candidate-specific factors, including experience, skillset, and location, and local minimum pay requirements as applicable. Your recruiter can provide further details.

This posting will be used to fill a newly-created role.

We have noticed a rise in recruiting impersonations across the industry, where scammers attempt to access candidates' personal and financial information through fake interviews and offers. All Cresta recruiting email communications will always come from the @cresta.ai domain. Any outreach claiming to be from Cresta via other sources should be ignored.  If you are uncertain whether you have been contacted by an official Cresta employee, reach out to [email protected]

Skills Required

  • 5+ years experience in DevOps, Site Reliability Engineering, or equivalent
  • Deep proficiency with coding languages such as Golang or Python
  • Production experience with Kubernetes
  • Production experience with AWS services such as IAM, S3, EC2, and EKS
  • Experience with database software such as PostgreSQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
112 Employees
Year Founded: 2017

What We Do

Cresta is for sales and customer service teams who need to close the performance gap between their top performers and the rest. Our real-time expertise AI helps contact center agents unlock their full potential by uncovering expert behaviors from every customer conversation and amplifies them with real-time assistance and coaching. By nudging best practices around objection responses, expectation setting, troubleshooting, and more: Cresta supercharges agents to focus on what really matters; their customer interactions. Cresta brings together industry-leading AI experts, proven leadership, and top-tier investors including Sequoia, Andreessen Horowitz, Greylock Partners, Andy Bechtolsheim, Mark Leslie and Vivi Nevo.

Similar Jobs

Oscilar Logo Oscilar

Site Reliability Engineer

Artificial Intelligence • Fintech • Software • Financial Services
Remote
2 Locations
104 Employees

Cohere AI Logo Cohere AI

Site Reliability Engineer

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Generative AI
In-Office or Remote
5 Locations
224 Employees

Dynatrace Logo Dynatrace

Account Executive

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Remote or Hybrid
Calgary, AB, CAN
5600 Employees

Zapier Logo Zapier

Sr. Manager, Global Support

Artificial Intelligence • Productivity • Software • Automation
Remote
2 Locations
800 Employees
131K-196K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account