Top Remote Infrastructure Engineer Jobs

Reposted 9 Days AgoSaved
Remote
United States
144K-200K Annually
Senior level
144K-200K Annually
Senior level
Software
Lead company-wide transformation of test automation and quality engineering: define vision and roadmap, architect scalable testing frameworks across server/web/mobile/desktop, integrate AI/LLM for test generation and flakiness management, establish observability and quality gates, mentor engineers, and partner with leadership to embed quality across the SDLC.
Top Skills: Ai/Llm TechnologiesApi TestingCi/CdCloud-Based Test EnvironmentsCypressDetoxDistributed SystemsDockerGoKubernetesObservabilityPerformance TestingPlaywrightPostgresReactReact NativeSecurity TestingTypescript
10 Days AgoSaved
In-Office or Remote
2 Locations
180K-230K Annually
Mid level
180K-230K Annually
Mid level
Aerospace • Software
Build and maintain privacy-focused telecommunications infrastructure and deployment tooling. Own full lifecycle development, monitoring/instrumentation, high-availability systems, FedRAMP accreditation, and integration of telecom components. Shape engineering practices and balance short-term needs with long-term roadmap.
Top Skills: 4G5GAWSAzureFedrampGCPGoIp Multimedia Subsystem (Ims)JavaKotlinMobile Core NetworkMultimedia ProtocolsPythonRust
10 Days AgoSaved
Remote
United States
200K-240K Annually
Mid level
200K-240K Annually
Mid level
eCommerce • Machine Learning
Build and scale the platform infrastructure to support mortgage origination: design cloud architecture, deploy and operate Kubernetes-based systems, manage databases, implement observability and incident response, and create internal developer tooling to improve reliability and engineering velocity.
Top Skills: AnsibleAWSC#DockerGCPGoHelmKafkaKubernetesLinuxOpensearchPostgresRedisTerraform
10 Days AgoSaved
In-Office or Remote
26 Locations
108K-173K Annually
Senior level
108K-173K Annually
Senior level
Healthtech
Design and deliver complex physical and cloud data center infrastructures (network, compute, storage, telephony, middleware, database). Lead capacity planning, lifecycle management, standards, and vendor partnerships. Provide expert support during incidents, produce engineering documentation, and mentor junior engineers.
Top Skills: Spring BootTomcatWebsphere Application Server
10 Days AgoSaved
In-Office or Remote
27 Locations
131K-209K Annually
Senior level
131K-209K Annually
Senior level
Healthtech
Lead design and engineering of complex physical and cloud data center network and telephony infrastructure. Drive requirements analysis, capacity planning, standards, vendor collaboration, project leadership, operational readiness, incident resolution, and mentorship to ensure scalable, performant network solutions.
Top Skills: BgpData Center InfrastructureEvpnLayer 3 Leaf-SpineMulti-Vrf Wan RoutingOspfPublic CloudTelephony SystemsVnfVxlanWi-FiZone-Based Firewalling
10 Days AgoSaved
Remote
USA
113K-188K Annually
Senior level
113K-188K Annually
Senior level
Consulting
Design, automate, deploy, and secure AWS and hybrid infrastructure using Terraform and CI/CD tooling. Build reusable IaC modules, maintain DevOps pipelines, implement logging/monitoring, and ensure compliance with zero-trust and frameworks (NIST/CIS/ISO). Collaborate across engineering, security, networking, and data teams to support mission continuity, DR, and documentation for federal environments.
Top Skills: AWSAws CodepipelineBashChatgptCisCursorDockerGitGithub CopilotIso 27000JenkinsKiroNistPythonTerraform
10 Days AgoSaved
In-Office or Remote
Denton, TX, USA
Senior level
Senior level
Edtech
Lead evolution of an internal developer platform into an AI-first, self-service system. Architect, build, and operate secure multi-tenant AWS EKS/Kubernetes infrastructure, developer tooling (CLIs, SDKs, portals), IaC/GitOps workflows, observability, and AI-assisted automation for provisioning, incident triage, and remediation while collaborating with application, SRE, and security teams.
Top Skills: APIsAws EksAws IamBackstageCli ToolingEvent-Driven ArchitecturesGitopsGoKubernetesLlmsLoggingMetricsOpenai ApiOpentofuPortPythonSdksTerraformTracingYaml
Reposted 10 Days AgoSaved
In-Office or Remote
2 Locations
240K-500K Annually
Senior level
240K-500K Annually
Senior level
Software
Lead the Core Infrastructure team, focusing on Kubernetes platform ownership, multi-region expansion, AI infrastructure development, and enhancing security and performance.
Top Skills: Ai InfrastructureEksKubernetesNetworkingObservability
10 Days AgoSaved
Remote or Hybrid
3 Locations
266K-395K Annually
Senior level
266K-395K Annually
Senior level
Software
Design, develop, and maintain high-performance distributed storage software and protocols (file, block, object). Integrate with hardware (NVMe, DPUs, GPU-direct), optimize performance and scalability, troubleshoot production data centers, collaborate with networking, compute, control plane and observability teams, and lead/mentor engineering teams.
Top Skills: CephDpuFibre ChannelGpudirect StorageIscsiKubernetesLustreNetappNfsNvidia SupernicNvmeRdmaS3SmbVastWeka
11 Days AgoSaved
Remote
2 Locations
118K-140K Annually
Mid level
118K-140K Annually
Mid level
Artificial Intelligence • Energy • Industrial • Infrastructure as a Service (IaaS)
Perform process design and rigorous review for gas, steam, and reciprocating power systems; validate PFDs/P&IDs, heat and mass balances, and datasheets; support system integration (fuel, cooling, exhaust, emissions), analyze plant performance and heat rate, drive FEED through commissioning, set scopes/schedules, and contribute to HAZOP/LOPA safety studies. Travel up to 20%.
Top Skills: Aspen HysysBluebeamEdrHtri
11 Days AgoSaved
In-Office or Remote
State Road, IL, USA
75K-158K Annually
Expert/Leader
75K-158K Annually
Expert/Leader
Information Technology • Consulting • Defense
Design, deploy, and maintain Azure Virtual Desktop (AVD) VDI solutions. Automate AVD tasks with scripts and workbooks, manage images and autoscaling, enforce security/compliance (STIGs, NIST, FISMA, FedRAMP), monitor costs and performance, troubleshoot Azure/AVD issues, and document environments.
Top Skills: Active Directory (Ad Ds)AdcsAfdsAnsibleAzureAzure AutoscaleAzure Monitor WorkbooksAzure Virtual Desktop (Avd)BashCitrixCommand ShellFedrampFismaNist 800-53PythonSsoStigsVMwareWindows Pki
11 Days AgoSaved
In-Office or Remote
9 Locations
125K-130K Annually
Senior level
125K-130K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Software • Analytics • Infrastructure as a Service (IaaS) • Big Data Analytics
Operate, monitor, and maintain Astronomer's managed Airflow platform and underlying cloud/Kubernetes infrastructure. Troubleshoot customer environments, participate in on-call rotation, build monitoring/automation, improve observability, and work directly with customers to meet SLAs and drive reliability.
Top Skills: Apache AirflowAWSAzureCi/CdGCPIacKubernetesLinuxPython
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
11 Days AgoSaved
In-Office or Remote
2 Locations
136K-252K Annually
Senior level
136K-252K Annually
Senior level
Cloud • Information Technology • Internet of Things • Professional Services • Software
Design, build, and operate large-scale on-prem Kubernetes platforms for AI/ML workloads. Own control plane and etcd, develop controllers/operators in Go, enable GPU workloads, implement IaC and AIOps, optimize performance, and provide on-call incident response while collaborating with ML teams.
Top Skills: AiopsAnthosBare-MetalCrdsEtcdGo (Golang)GpuGpu-Enabled EnvironmentsInfrastructure As CodeKubernetesKubernetes ControllersLlmsMl/Ai PlatformsMlopsObservabilityOpenshiftOperatorsPythonTelemetryWebhooks
11 Days AgoSaved
In-Office or Remote
3 Locations
137K-278K Annually
Senior level
137K-278K Annually
Senior level
Cloud • Information Technology • Internet of Things • Professional Services • Software
Design, build, and operate large-scale on-prem Kubernetes platforms (OpenShift/Anthos) for AI/ML and GPU workloads. Own control plane and etcd lifecycle, build controllers/operators in Go, automate via IaC, improve observability and resource utilization, enable ML training/inference/LLM deployments, participate in on-call incident response, and mentor engineers.
Top Skills: AnthosCrdEtcdGoGpuKubeflowKubernetesKubernetes ControllersMlflowOpenshiftOperatorsWebhooks
11 Days AgoSaved
In-Office or Remote
3 Locations
127K-252K Annually
Senior level
127K-252K Annually
Senior level
Cloud • Information Technology • Internet of Things • Professional Services • Software
Design, build, and operate large-scale on-prem Kubernetes (OpenShift/Anthos) platforms for AI/ML, managing control plane and cluster lifecycle, enabling GPU workloads, building controllers/operators in Go/Python, implementing IaC and AIOps automation, and supporting on-call incident response.
Top Skills: AiopsAnthosCustom Resource Definitions (Crds)EtcdGo (Golang)GpuInfrastructure As CodeKubernetesKubernetes Controllers/OperatorsLlmsMlopsOpenshiftPythonWebhooks
11 Days AgoSaved
In-Office or Remote
3 Locations
137K-278K Annually
Senior level
137K-278K Annually
Senior level
Cloud • Information Technology • Internet of Things • Professional Services • Software
Design, build, and operate large-scale on-prem Kubernetes platforms (OpenShift/Anthos) for AI/ML workloads. Own control plane and etcd lifecycle, build operators and controllers in Go, enable GPU training/inference, implement IaC and automation, improve observability and platform reliability, participate in on-call incident response, and mentor engineers.
Top Skills: AiopsAnthosCrdsEtcdGoGpuInfrastructure As CodeKubeflowKubernetesKubernetes ControllersKubernetes OperatorsLogsMlflowObservability (MetricsOpenshiftTraces)Webhooks
11 Days AgoSaved
Remote or Hybrid
7 Locations
Senior level
Senior level
Artificial Intelligence • Software • PropTech • Generative AI
Lead architecture and rewrites for a scalable, resilient AI-first platform. Own backend systems, data storage performance, deployment pipelines, cloud infrastructure via IaC, observability, incident response, and analytics. Mentor engineers and enable rapid customer onboarding and AI-driven automation.
Top Skills: Ai InfrastructureAWSCi/CdGCPInfrastructure As CodeLlm-Based SystemsNode.jsPulumiReactTerraformTypescript
Reposted 11 Days AgoSaved
In-Office or Remote
Reston, VA, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The Software Engineer will build and operate the cloud infrastructure for data ingestion, collaborating with scientists to improve dataset quality, scale, and cost efficiency for AI model training.
Top Skills: BashDockerGCPPythonTerraform
Reposted 11 Days AgoSaved
In-Office or Remote
Spokane, WA, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
As a Software Engineer in Data Infrastructure, you'll enhance audio data ingestion pipelines, operate cloud infrastructure, and collaborate with scientists on data strategies to support AI model training.
Top Skills: BashDockerGCPPythonTerraform
Reposted 11 Days AgoSaved
In-Office or Remote
Memphis, TN, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The role involves data collection for model training, integrating infrastructure and engineering to build datasets at large scales, collaborating with scientists to improve data quality and cost efficiency, and crafting a dataset roadmap.
Top Skills: BashDockerGCPInfrastructure-As-CodePython
Reposted 11 Days AgoSaved
In-Office or Remote
Fort Worth, TX, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The Software Engineer will manage data collection for model training, enhance cloud infrastructure, and collaborate with AI Scientists to improve data sourcing and processing efficiency.
Top Skills: BashDockerGCPPythonTerraform
Reposted 11 Days AgoSaved
In-Office or Remote
Saint Paul, MN, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The Software Engineer role focuses on data collection for model training, enhancing data ingestion pipelines, and collaborating with AI Scientists to improve data quality and efficiency.
Top Skills: BashDockerGCPPythonTerraform
Reposted 11 Days AgoSaved
In-Office or Remote
New Orleans, LA, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
Join Speechify as a Software Engineer responsible for data collection and management to support AI model training and enhance product offerings.
Top Skills: BashDockerGCPLinuxPythonTerraform
Reposted 11 Days AgoSaved
In-Office or Remote
Reno, NV, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
The role involves data collection and infrastructure management for model training at Speechify, focusing on data ingestion pipelines and collaboration with AI scientists to optimize datasets.
Top Skills: BashDockerGCPPythonTerraform
Reposted 11 Days AgoSaved
In-Office or Remote
Burlington, VT, USA
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Software
As a Software Engineer on the Data team, you will manage data collection for model training, extend cloud infrastructure, and collaborate with scientists to enhance data quality and efficiency.
Top Skills: DockerGCPPythonTerraform
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account