DevOps Engineer

Posted Yesterday
3 Locations
In-Office or Remote
Mid level
Artificial Intelligence • Information Technology • Software
The Role
Design and maintain scalable cloud infrastructure, develop automation tools, implement CI/CD pipelines, and ensure system reliability in a AI startup setting.
Summary Generated by Built In
Why this role exists

We’re looking for an Operations Engineer to help us design, build, and maintain the infrastructure powering our 10,000+ GPU cloud platform. You’ll be responsible for keeping our systems highly available, secure, and performant while working closely with backend, frontend, and infrastructure teams to enable rapid development and deployment.

As part of the operations team, you’ll lead efforts in automation, monitoring, scaling, and reliability engineering to support our fast-growing user base and platform demands.

This is an ideal opportunity for someone excited to take ownership, drive large scale deployment, move fast, and shape the foundation of a high-impact AI startup.

What you’ll do
  • Design, build, and maintain scalable infrastructure across multi-cloud and on-prem GPU environments.

  • Develop automation scripts and tools for provisioning, monitoring, and managing systems.

  • Implement robust CI/CD pipelines to support rapid development and deployment cycles.

  • Monitor and improve system performance, reliability, and security.

  • Troubleshoot infrastructure issues and respond to incidents with a focus on root cause analysis.

  • Collaborate with engineering teams to ensure seamless integration of backend systems and APIs.

  • Leverage AI-assisted coding and DevOps tools (e.g., GitHub Copilot, ChatGPT, Cursor IDE) to accelerate operations workflows and increase reliability.

You’ll thrive here if you
  • 3+ years of experience as a DevOps, SRE, or Operations Engineer in production environments.

  • Proficiency with cloud platforms (AWS, GCP, Azure) and infrastructure-as-code tools (Terraform, Ansible, Pulumi, etc.).

  • Strong experience with containerization and orchestration (Docker, Kubernetes).

  • Knowledge of networking, security, and distributed systems.

  • Experience building and maintaining CI/CD pipelines (Jenkins, GitHub Actions, GitLab CI, etc.).

  • Familiarity with monitoring and observability tools (Prometheus, Grafana, ELK stack, etc.).

  • Experience using AI-assisted coding tools (e.g., Copilot, ChatGPT) and openness to integrating them into daily workflows.

  • Startup mindset: self-motivated, comfortable with ambiguity, and excited to wear multiple hats.

Compensation
  • Competitive salary — commensurate with your experience and aligned with industry standards

  • Meaningful equity — be part of the upside as we build a category-defining company. Your grant will align with your role and the experience you bring.

Top Skills

Ai-Assisted Coding Tools
Ansible
AWS
Azure
Docker
Elk Stack
GCP
Github Actions
Gitlab Ci
Grafana
Jenkins
Kubernetes
Prometheus
Pulumi
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Taipei City, Taipei
3 Employees
Year Founded: 2024

What We Do

Zettabyte is a global innovator in AI data centre infrastructure and full-stack GPU software solutions, delivering high-performance, energy-efficient AI computing to sovereigns and enterprises. Zettabyte’s state-of-the-art Zware software suite integrates the entire AI computing infra stack, utilizing custom hardware, advanced liquid cooling, and sovereign AI infrastructure that prioritizes cybersecurity and energy efficiency, satisfying the demands of modern AI workloads. Zettabyte has Foxconn, Wistron and Pegatron as its investors and these GPU server makers build over 75% of the world’s AI GPU servers powering AI LLMs, agents and applications.

Similar Jobs

stakefish Logo stakefish

Devops Engineer

Blockchain • Information Technology
In-Office or Remote
5 Locations

stakefish Logo stakefish

Devops Engineer

Blockchain • Information Technology
In-Office or Remote
5 Locations

CrowdStrike Logo CrowdStrike

Distribution Alliances Manager - North Asia (Remote, TWN)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
Taiwan
15-15

CrowdStrike Logo CrowdStrike

Regional Sales Manager

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
Taiwan

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account