Cloud Operations Engineer

Reposted 17 Days Ago
Easy Apply
Be an Early Applicant
Hiring Remotely in Costa Rica
Remote
Mid level
Information Technology • Software • Travel • Hospitality
Cloudbeds is building the first SaaS platform capable of powering every lodging business on the planet.
The Role
As a NOC Engineer, you'll monitor critical systems, validate alerts, manage incidents, and collaborate with engineering teams to ensure operational stability in a cloud environment.
Summary Generated by Built In

What Makes Us Unique 

At Cloudbeds, we're not just building software, we’re transforming hospitality. Our intelligently designed platform powers properties across 150 countries, processing billions in bookings annually. From independent properties to hotel groups, we help hoteliers transform operations and uplevel their commercial strategy through a unified platform that integrates with hundreds of partners. And we do it with a completely remote team. Imagine working alongside global innovators to build AI-powered solutions that solve hoteliers' biggest challenges. Since our founding in 2012, we've become the World's Best Hotel PMS Solutions Provider and landed on Deloitte's Technology Fast 500 again in 2024 – but we're just getting started. 


Job Title: Cloud Operations Engineer

How You'll Make an Impact:

As a Cloud Operations Engineer, you’ll be the frontline support for our global infrastructure, playing a key role in ensuring 24/7 operational stability across our AWS-based environment. Your core responsibilities will include monitoring critical systems through platforms such as Datadog, PagerDuty, and CloudWatch, rapidly validating alerts, and escalating verified incidents based on clearly defined protocols.

You’ll execute operational tasks, follow documented procedures for common issues, and manage standard maintenance activities. You'll also have opportunities to collaborate directly with senior engineers across SRE, DevOps, and Infrastructure teams, contributing to the resolution of a wide range of technical challenges and gaining exposure to complex, real-world systems.

Acting as the central communication point during incidents, you’ll maintain clear, timely updates to stakeholders and facilitate smooth transitions between engineering and support teams.

Our Network Operations Team:

You’ll be joining a brand-new team at the ground level, helping shape the future of SaaS operations for a company undergoing exciting growth. Working closely with SRE, DevOps, Security, and various Workload teams, you’ll be at the heart of collaborative problem-solving and operational innovation. It’s a rare chance to build, influence, and grow in a highly visible and impactful role.

This role offers a rare opportunity to gain deep, hands-on experience in cloud operations and incident management while working alongside high-performing engineering teams. You'll build the foundation for growth into specialized areas like SRE, DevOps, or Infrastructure Engineering, with direct exposure to real-world systems at scale.

What You Bring to the Team:

  • Support Kubernetes (EKS) environments by performing operational checks, validating pod health, reviewing logs, and assisting with incident triage during deployments and scaling events
  • Assist with CI/CD pipeline operations by supporting deployments, rollbacks, and release verification in collaboration with DevOps and platform engineering teams using ArgoCD and GitHub
  • Execute Infrastructure as Code changes and standard operating procedures using Terraform across cloud infrastructure and application services
  • Monitor, triage, and validate incidents using observability and alerting tools such as PagerDuty, Datadog, Amazon CloudWatch, Prometheus, and Grafana, escalating to SRE, DevOps, or application teams as appropriate
  • Execute documented runbooks and SOPs to resolve common operational issues, including basic AWS troubleshooting, infrastructure access requests (SSO, VPN, IAM), and deployment support
  • Perform routine operational tasks such as configuration changes, maintenance activities, and standard change requests across cloud infrastructure and application services
  • Contribute to operational excellence by maintaining and improving runbooks, updating documentation, and participating in post-incident reviews (RCA) to drive reliability improvements

What Sets You Up for Success:

  • 3-4 years of hands-on experience in DevOps, Site Reliability Engineering (SRE), or related operational roles with focus on cloud infrastructure
  • Practical experience with Amazon EKS (Elastic Kubernetes Service) or other managed Kubernetes platforms, including container orchestration and operational management
  • Hands-on experience with CI/CD and GitOps deployment tools, particularly ArgoCD, Flux, or similar automation platforms
  • Experience using Infrastructure as Code tools, specifically Terraform, for managing and automating cloud infrastructure
  • Foundational understanding of the AWS service ecosystem including core infrastructure services (EC2, S3, RDS, IAM, VPC)
  • Strong written and verbal communication skills in English with ability to provide clear, timely updates during high-pressure incidents
  • Detail-oriented with strong documentation skills and ability to collaborate effectively across multiple teams in a fully remote, global environment

Bonus Skills to Stand Out (Optional):

  • Experience with monitoring and observability tools such as Datadog, Prometheus, Grafana, or Amazon CloudWatch
  • Prior experience working in a 24/7 operations environment with hands-on use of PagerDuty or similar on-call and alerting systems
  • Ability to write (not just read) Bash or Python scripts for automation tasks

#LI-REMOTE #LI-AM1


What to Expect - Your Journey with Us 

Behind Cloudbeds' revolutionary technology is a team of redefining what's possible in hospitality. We're 650+ employees across 40+ countries, bringing together elite engineers, AI architects, world-class designers, and hospitality veterans to solve challenges others haven't dared to tackle. Our diverse team speaks 30+ languages, but we all share one language: a passion for innovation and travel. From pioneering breakthroughs in machine learning to revolutionizing how hotels operate, we're not just watching the future of hospitality unfold – we're coding it, designing it, writing it and shipping it. If you're ready to work alongside some of the brightest minds in tech who are obsessed with using AI to transform a trillion-dollar industry, this is your chance to be part of something extraordinary.

Learn more online at cloudbeds.com

Company Awards to Check Out! 
  • Best All-In-One Hotel Management System | HotelTechAwards (2025)
  • Overall 10 Best Places to Work | HotelTechAwards (2025)
  • Most Loved Workplace® Certified (2024) 
  • Top 10 People’s Choice(2024)
  • Deloitte Technology Fast 500 (2024)
 Discover our Benefits:
  • Remote First, Remote Always 
  • PTO in accordance with local labor requirements
  • 2 corporate apartment accommodations for team member use for free (San Diego & São Paulo)
  • Monthly Wellness Fridays - enjoy an extra long weekend every month
  • Full Paid Parental Leave
  • Home office stipend based on country of residency
  • Professional development courses in Cloudbeds University
  • Access to professional development, including manager training, upskilling and knowledge transfer.
Everyone is Welcome - A Culutre of Inclusion  

Cloudbeds is proud to be an Equal Opportunity Employer that celebrates the diversity in our global team! We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

Cloudbeds is committed to the full inclusion of all qualified individuals. As part of this commitment, Cloudbeds will ensure that persons with disabilities are provided reasonable accommodations in the hiring process. We encourage deaf, hard of hearing, deaf-blind, and deaf-disabled individuals to apply. If reasonable accommodation is needed to participate in the job application or interview process or to perform essential job functions, please contact our HR team by phone at 858-201-7832 or via email at [email protected]. Cloudbeds will provide an American Sign Language (ASL) interpreter where needed as a reasonable accommodation for the hiring processes.

To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Cloudbeds. Staffing, recruiting agencies, and individuals being represented by an agency are not authorized to use this site or to submit applications, and any such submissions will be considered unsolicited. Cloudbeds does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Cloudbeds employees, or any other company location. Cloudbeds is not responsible for any fees related to unsolicited resumes/applications.

Top Skills

AWS
Bash
Cloudwatch
Datadog
Docker
Grafana
JIRA
Kubernetes
Memcached
MySQL
Nginx
Pagerduty
Postgres
Prometheus
Python
Redis
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Diego, CA
780 Employees
Year Founded: 2012

What We Do

Cloudbeds provides a transformative technology platform upon which any lodging provider, from luxury hotels to campgrounds, can build and run their business. We help lodging businesses reach travelers from every corner of the globe and empower them to spend more time with their guests, and less time worrying about technology. We've brought together the smartest minds from around the world to innovate new technology that challenges the status quo and makes the world a more connected place.

Why Work With Us

Cloudbeds has embodied a remote-first culture since the beginning (since before remote was cool!): our 100% remote workforce (and 75% of management!) has the option to come into one of our 2 offices (San Diego HQ, São Paulo), but doing so has never been required. Working here means working anywhere - with talented peers around the world.

Gallery

Gallery

Similar Jobs

Acquia Logo Acquia

Senior Marketing Campaign Specialist

AdTech • Cloud • Marketing Tech • Productivity • Software • Analytics • Automation
Easy Apply
Remote or Hybrid
Costa Rica
1100 Employees
5-5 Annually

Acquia Logo Acquia

Senior Software Engineer

AdTech • Cloud • Marketing Tech • Productivity • Software • Analytics • Automation
Easy Apply
Remote or Hybrid
Costa Rica
1100 Employees

TransUnion Logo TransUnion

Project Manager

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Remote or Hybrid
Heredia, Ulloa, Lagunilla, CRI
13000 Employees

ServiceNow Logo ServiceNow

Director of AMS People Enablement Center (PEC)

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Heredia, San Francisco, CRI
28000 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account