Cloud Ops Engineer

Posted Yesterday
Easy Apply
Be an Early Applicant
2 Locations
In-Office
Mid level
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Unlock the potential of the physical economy.
The Role
Own incident management, run on-call and post-incident processes, manage central on-call integrations, analyze MTTR and reliability metrics, improve change management and automation, collaborate to standardize operational practices, and leverage AI for incident analysis and automation.
Summary Generated by Built In

Who we are:

Motive empowers the people who run physical operations with tools to make their work safer, more productive, and more profitable. For the first time ever, safety, operations and finance teams can manage their drivers, vehicles, equipment, and fleet related spend in a single system. Combined with industry leading AI, the Motive platform gives you complete visibility and control, and significantly reduces manual workloads by automating and simplifying tasks.

Motive serves nearly 100,000 customers – from Fortune 500 enterprises to small businesses – across a wide range of industries, including transportation and logistics, construction, energy, field service, manufacturing, agriculture, food and beverage, retail, and the public sector.

Visit gomotive.com to learn more.

About the Role:

As a Cloud Ops Engineer in the Platform Engineering organization, you will be a core member of the team responsible for the operational health, reliability, and observability of the entire Motive platform. Your mission is to ensure our globally distributed, highly-available systems recover from issues quickly and help ensure we avoid future issues. You will spearhead the effort to manage incident response, build world-class monitoring systems, and drive automation across all operational workflows. This is a critical role that improves the daily lives of engineers and ensures a consistent, high-quality experience for all our customers, across all our complex tech stacks from our core SaaS product to our mobile and AI-powered embedded systems. If you are passionate about high-leverage work, automation, continuously improving critical systems and processes, being involved across all areas of engineering, and are able to be the strong calm in the middle of incidents, this is the perfect role for you.

What You'll Do:
  • Own and refine the incident management lifecycle and be the incident commander, running communication and triage, and post-incident analysis and follow-ups to drive continuous service improvement.
  • Manage the central on-call solution and integrations used by over 100 teams from different monitoring and other platforms, leveraging automation and self-serve tools such as terraform.
  • Analyze operational statistics (MTTR, incident frequency, service-level data) to identify trends and prioritize reliability initiatives and teams’ focus.
  • Improve change management processes and automation to reduce both risk and friction.
  • Collaborate with engineering teams across the organization to standardize operational practices and develop automated workflows.
  • Leverage AI for incident analysis, alert/issue solutioning, and automation.
What We're Looking For:

We are looking for an individual with prior experience in cloud operations, site reliability engineering, or a similar field, who has a passion for improving system reliability and operational processes in a large-scale, distributed environment.

  • Experience managing and participating in a 24/7 on-call rotation and incident response process.
  • Experience with on-call systems such as Rootly, PagerDuty, Opsgenie, etc.
  • Experience with monitoring and observability tools (e.g., Datadog, NewRelic, Grafana, etc.).
  • Ability to communicate clearly and manage incidents, communications, and action items with stakeholders from engineers to directors, and public-facing messaging.
  • Experience with IT Service Management tools (Jira/JSM) for ticket and change management.
  • 3+ years experience in an incident response role.

Bonus Skills to have:
  • Experience with Infrastructure as Code (IaC) tools such as Terraform.
  • Scripting and automation skills in at least one modern language (Python, Go, Bash) -  AI-coding assistance welcomed.
  • Prior experience in an Ops or SRE team supporting a diverse cloud product.

Creating a diverse and inclusive workplace is one of Motive's core values. We are an equal opportunity employer and welcome people of different backgrounds, experiences, abilities and perspectives. 

Please review our Candidate Privacy Notice here.

UK Candidate Privacy Notice here.

The applicant must be authorized to receive and access those commodities and technologies controlled under U.S. Export Administration Regulations. It is Motive's policy to require that employees be authorized to receive access to Motive products and technology. 

Top Skills

Rootly,Pagerduty,Opsgenie,Datadog,Newrelic,Grafana,Jira,Jira Service Management (Jsm)

What the Team is Saying

Laura
Valerie
Angie
Brad
Breanna
Greg
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
4,000 Employees
Year Founded: 2013

What We Do

Motive builds technology to improve the safety, productivity, and profitability of businesses that power the physical economy. The Motive Automated Operations Platform combines IoT hardware with AI-powered applications to automate vehicle and equipment tracking, driver safety, compliance, maintenance, spend management, and more. Motive serves more than 120,000 businesses, across a wide range of industries including trucking and logistics, construction, oil and gas, food and beverage, field service, agriculture, passenger transit, and delivery. Visit gomotive.com to learn more.

Why Work With Us

We work hard, with humility and we see our efforts rewarded in tangible ways every day. At Motive, you’ll have the chance to make a difference for the drivers who keep our world moving. We trust each other to do great work because together we can achieve collective goals that reach beyond the physical economy.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Motive Offices

Remote Workspace

Employees work remotely.

Typical time on-site: None
HQSan Francisco, CA
Company Office Image
MX
GB
Austin, TX
Bengaluru, IN
Buffalo, NY
Nashville, TN
Taipei City, Taiwan
Vancouver, BC
Learn more

Similar Jobs

Motive Logo Motive

Operations Specialist

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Easy Apply
In-Office
2 Locations
4000 Employees

Motive Logo Motive

Senior Order Management Specialist, Operations

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Easy Apply
In-Office
Islamabad, PAK
4000 Employees

Motive Logo Motive

Senior Software Engineer

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Easy Apply
In-Office
3 Locations
4000 Employees

Motive Logo Motive

Operations Coordinator

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Easy Apply
In-Office
Lahore, Punjab, PAK
4000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account