Senior Cloud Infrastructure Engineer

Reposted 13 Days Ago
Headquarters, AZ
In-Office
180K-250K Annually
Senior level
Artificial Intelligence • Information Technology • Software
The Role
The role involves designing, deploying, and maintaining cloud infrastructure, automating processes, ensuring systems reliability, and mentoring junior engineers.
Summary Generated by Built In
About LanceDB

LanceDB is a developer-friendly, open-source data lake for multimodal AI. From hyper-scalable vector search to advanced retrieval for RAG, from streaming training data to interactive exploration of large-scale AI datasets, LanceDB is the best foundation for your AI application, and powers some of the most groundbreaking applications and challenging requirements today.

About the role

We’re seeking a seasoned Cloud Infrastructure Engineer with deep expertise in automation, infrastructure-as-code (IaC), and cloud platform management. You’ll design, deploy, and maintain robust cloud environments while collaborating with cross-functional teams to streamline CI/CD pipelines, enhance system reliability, and drive operational excellence.

As a Cloud Infrastructure Engineer at LanceDB, your responsibilities will include:

  • Design & Build Cloud Infrastructure: Architect and manage secure, scalable cloud environments (AWS, Azure, GCP) using IaC tools like Terraform and CloudFormation.

  • Automate Everything: Develop and maintain automation scripts to streamline deployments, monitoring, and system operations.

  • Systems Reliability: Implement monitoring/alerting solutions (Prometheus, Grafana, Datadog) to proactively address performance bottlenecks and ensure 99.9% uptime.

  • Security & Compliance: Enforce security policies, manage secrets (Vault, AWS KMS), and ensure compliance with industry standards (GDPR, SOC2).

  • Troubleshoot & Optimize: Resolve complex infrastructure issues and lead cost-optimization initiatives for cloud resources.

  • Collaborate & Mentor: Partner with software engineering teams to integrate DevOps practices into SDLC and mentor junior engineers on IaC and cloud best practices.

Requirements
  • 10+ years in DevOps, Cloud Infrastructure, or SRE roles, with hands-on experience in public cloud platforms (AWS, Azure, GCP, Heroku).

  • Strong experience operating and supporting production distributed systems and/or databases-as-a-service in a public cloud service provider, where it was the primary product for the company. This excludes being a user of an cloud service provider's database such as RDS or BigQuery. Bonus points for experience crafting multitenant solutions. This is a hard requirement; applicants without this experience will not qualify for this role.

  • Experience designing and managing complex production environments using Kubernetes and Helm. This is a hard requirement; applicants without this experience will not qualify for this role.

  • Expertise in IaC tools (Puppet, Terraform, Ansible, CloudFormation) and configuration management.

  • Deep understanding of networking, security, and cloud architecture best practices.

  • Experience with monitoring tools (Prometheus, Grafana) and logging systems (ELK, Splunk).

  • Strong knowledge of CI/CD tools (GitHub Actions) and containerization (Docker, Kubernetes).

  • You like working with a small, high-caliber team with a lot of autonomy and drive, and you can iterate fast

Nice to have
  • You’ve made substantial contributions to open-source projects (e.g., Puppet modules, Terraform providers).

  • You design and automate single-command deployments for complex, globally distributed systems to ensure consistency, reliability, and scalability across multi-cloud or hybrid environments.

  • You fearlessly challenge the status quo and dismiss mediocre engineering as unacceptable.

  • You have worked on distributed large-scale systems, with a good understanding of how to using tracing tools to identify bottlenecks.

    • Experience building large-scale semantic search and/or caching systems is especially relevant.

Why Join Us

You’ll join a world-class team of open-source builders (co-authors of pandas, and contributors to HDFS, Arrow, Iceberg, and HBase) working on cutting-edge AI infrastructure. You’ll collaborate on systems that power next-generation AI workloads while shaping how LanceDB operates and scales production environments.

Top Skills

Ansible
AWS
Aws Kms
Azure
CloudFormation
Datadog
Docker
Elk
GCP
Github Actions
Grafana
Kubernetes
Prometheus
Puppet
Splunk
Terraform
Vault
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
29 Employees
Year Founded: 2022

What We Do

LanceDB is a developer-friendly, open source database for multimodal AI. From hyper scalable vector search to advanced retrieval for RAG, from streaming training data to interactive exploration of large scale AI datasets, LanceDB is the best foundation for your AI application.

Similar Jobs

Oscar Health Logo Oscar Health

Senior Software Engineer

Healthtech • Insurance
In-Office
Tempe, AZ, USA
2200 Employees
181K-237K Annually

PayPal Logo PayPal

Staff Software Engineer

Fintech • Payments
In-Office
3 Locations
34450 Employees
170K-292K Annually

Intel Corp Logo Intel Corp

Infrastructure Engineer

Artificial Intelligence • Cloud • Information Technology • Software • Semiconductor
In-Office
Phoenix, AZ, USA
141941 Employees
140K-197K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account