Top Remote Infrastructure Engineer Jobs

Reposted 3 Days AgoSaved
Remote
USA
160K-210K Annually
Junior
160K-210K Annually
Junior
Automotive • Machine Learning • Robotics • Software • Transportation
Design, operate, and scale ML data and training pipelines for autonomous driving. Build distributed training, orchestration, CI/CD and infrastructure-as-code; maintain data/metadata stores and tooling to support large-scale model training and evaluation across cloud and cluster environments.
Top Skills: C++LinuxPythonPyTorch
Reposted 3 Days AgoSaved
In-Office or Remote
Opportunity, WA, USA
212K-286K Annually
Senior level
212K-286K Annually
Senior level
Software
The Staff Software Engineer will lead technical strategy and execution in cloud infrastructure, design scalable systems, and enhance reliability across all engineering teams while mentoring other engineers.
Top Skills: AWSAzureGCPGoJava
Reposted 3 Days AgoSaved
Remote
United States
200K-275K Annually
Senior level
200K-275K Annually
Senior level
Artificial Intelligence • Hardware • Machine Learning • Natural Language Processing • Software • Generative AI
Lead development and optimization of the compiler stack for ML systems, collaborate across teams, integrate and deploy products, map ML operations to hardware, and drive compiler infrastructure innovation and performance debugging.
Top Skills: MlirPyTorchSambanova SuiteSn40LTensorFlow
4 Days AgoSaved
Remote
USA
164K-220K Annually
Senior level
164K-220K Annually
Senior level
Robotics • Software
Own reliability across vehicle and cloud stacks for AUV operations: onboard Jetson/ROS2 compute, topside systems, cloud ingestion/processing and customer platform. Build automation, observability, runbooks, and self-recovery to reduce on-call toil; manage AWS infrastructure, IaC, container orchestration, and reliability targets. Participate in shared 12-hour on-call shifts and field deployments, mentor team on operational excellence.
Top Skills: AWSBashContainerizationDockerGoGrafanaIamJetsonKubernetesLinuxPrometheusPythonRosRos 2Terraform
Reposted 4 Days AgoSaved
In-Office or Remote
8 Locations
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Software
Design and operate large-scale GPU infrastructure for distributed AI training, ensuring reliability, performance, and efficient customer partnerships.
Top Skills: AnsibleCudaDeepspeedFsdpGpuHelmInfinibandKubernetesLinuxMegatronNcclNvidia A100Nvidia B200Nvidia H100NvlinkPyTorchRoceTerraform
Reposted 4 Days AgoSaved
In-Office or Remote
8 Locations
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Software
The Site Reliability Engineer will provision and manage Kubernetes clusters, build automation tools, debug customer issues, and improve infrastructure reliability.
Top Skills: AnsibleBashDatadogGoGrafanaHelmKubernetesLokiPrometheusPythonTerraform
Reposted 4 Days AgoSaved
In-Office or Remote
3 Locations
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Software
As a Software Engineer in AI Infrastructure, you will design and develop core platform components, build APIs and services, enhance performance, and automate tooling while collaborating across teams and improving system reliability.
Top Skills: AnsibleGoHelmKubernetesPythonTerraform
5 Days AgoSaved
Remote
USA
Senior level
Senior level
Healthtech • Insurance • Financial Services
Lead platform reliability: define SLOs/error budgets, own observability and deploy pipelines, harden integrations with dental systems, operate LLM-driven workflows safely, build incident practices, and raise engineering reliability across the company.
Top Skills: AnthropicAWSCi/CdCrewaiDatadogDockerEcsGoGoogle Vertex AiKubernetesLangchainLlamaindexMastraNode.jsOpenaiPostgresPythonReactTerraformTypescript
5 Days AgoSaved
Remote or Hybrid
3 Locations
266K-395K Annually
Senior level
266K-395K Annually
Senior level
Artificial Intelligence • Cloud • Machine Learning • Infrastructure as a Service (IaaS)
Design, develop, and maintain high-performance distributed storage software and protocol APIs (file, block, object). Build scalable, resilient storage services, integrate with hardware (NVMe, GPU-direct), troubleshoot production data center issues, and participate in full SDLC for on-prem storage solutions.
Top Skills: CC++Ci/CdDockerDpuFibre ChannelGoGpuGpu-Direct StorageInfinibandIscsiKubernetesLinux Kernel InternalsLustreNfsNvmePythonRoceS3SmbSwift
5 Days AgoSaved
Remote or Hybrid
3 Locations
314K-465K Annually
Expert/Leader
314K-465K Annually
Expert/Leader
Artificial Intelligence • Cloud • Machine Learning • Infrastructure as a Service (IaaS)
Lead design and implementation of high-performance distributed storage systems across object, block, and file paradigms. Drive architecture, mentor engineers, integrate storage with networking/compute/DPUs, optimize protocol performance, troubleshoot production data center issues, build benchmarking and observability tooling, and collaborate on cross-functional AI infrastructure deployments.
Top Skills: BlktraceBpftraceCC++CephDaosDpdkDpus (Nvidia Bluefield)EbpfFibre ChannelFioGoGpu-Direct StorageGrafanaInfiniband)IscsiKubernetesLustreMinioNfsNvmeNvme-OfPerfPrometheusRdma (RoceRustS3SmbSpdk
Reposted 5 Days AgoSaved
In-Office or Remote
Colorado Springs, CO, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The Software Engineer will manage data collection for AI model training, enhance the ingestion pipeline on GCP, and collaborate with the AI team to improve dataset quality and efficiency.
Top Skills: BashDockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Norfolk, VA, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The role involves managing data collection for AI model training, extending cloud infrastructure, and collaborating with scientists to enhance data quality and scale.
Top Skills: BashDockerGCPPythonTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 5 Days AgoSaved
In-Office or Remote
Seattle, WA, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The role involves data collection for model training, extending ingestion pipeline infrastructure, and collaborating on dataset roadmap execution.
Top Skills: BashDockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Washington, DC, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
Join the AI team at Speechify to develop data collection processes, enhance cloud infrastructure, and collaborate on data strategies for next-gen models.
Top Skills: DockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Sunnyvale, CA, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
As a Software Engineer focused on Data at Speechify, you'll manage data collection and ingestion for model training, enhance cloud infrastructure, and collaborate closely with AI scientists to optimize data quality and cost-efficiency for next-gen products.
Top Skills: BashDockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
San Francisco, CA, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The Software Engineer will manage data collection for AI model training, operate cloud infrastructure, and collaborate with the AI team to enhance data quality and scale.
Top Skills: BashDockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Philadelphia, PA, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The role involves data collection for AI model training, managing cloud infrastructure, and collaborating with scientists to enhance data quality and processing.
Top Skills: BashDockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Des Moines, IA, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The role involves data collection for model training, managing and extending the cloud infrastructure on GCP, and collaborating with scientists to enhance data quality and scalability.
Top Skills: BashDockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Honolulu, HI, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
This role involves data collection for model training, operating cloud infrastructure, and collaborating with AI team members to enhance data quality and efficiency.
Top Skills: BashDockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Tacoma, WA, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The Software Engineer will manage data collection for AI model training, operate cloud infrastructure, and work closely with scientists to enhance data quality and efficiency.
Top Skills: BashDockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Milwaukee, WI, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The Software Engineer will enhance data collection for AI model training, manage cloud infrastructure, and collaborate on dataset roadmap at Speechify.
Top Skills: BashDockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Fremont, CA, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The role involves data collection for model training, cloud infrastructure operation and extension, and collaborative roadmap crafting for datasets to enhance Speechify's products.
Top Skills: BashDockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Buffalo, NY, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The role involves sourcing audio data, managing cloud infrastructure on GCP, and collaborating with the AI team to enhance data quality and processing for model training.
Top Skills: BashDockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Chicago, IL, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The Software Engineer will manage data collection for model training, improve cloud infrastructure on GCP, and collaborate with data scientists to enhance dataset quality and scale.
Top Skills: BashDockerGCPPythonTerraform
Reposted 5 Days AgoSaved
In-Office or Remote
Louisville, KY, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The role involves managing data collection and ingestion for AI model training, operating cloud infrastructure, and collaborating on dataset roadmap development.
Top Skills: BashDockerGCPLinuxPythonTerraform
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account