Top Tech Jobs & Startup Jobs

2 Days AgoSaved
In-Office
Dallas, TX, USA
Senior level
Senior level
Artificial Intelligence • Cloud • Machine Learning • Infrastructure as a Service (IaaS)
Lead and operate incident and problem management practices: own major incident response, coordinate cross-functional teams, perform root cause analysis, maintain known error records, analyze trends and KPIs (MTTR, SLA), and drive long-term remediation and continuous improvement using Jira Service Management.
Top Skills: ItilItsmJIRAJira Service Management
2 Days AgoSaved
In-Office
Oxfordshire, VA, USA
Junior
Junior
Artificial Intelligence • Cloud • Machine Learning • Infrastructure as a Service (IaaS)
Proactively generate and qualify B2B leads across marine, defence, and industrial sectors. Build and maintain a sales pipeline, support sales cycles from outreach to proposal, manage customer follow-ups, attend international events, use CRM to track interactions, and help prepare commercial materials while gathering market intelligence to inform strategy.
Top Skills: CRMEnergy Storage Systems
2 Days AgoSaved
In-Office
Dallas, TX, USA
Senior level
Senior level
Artificial Intelligence • Cloud • Machine Learning • Infrastructure as a Service (IaaS)
Design, implement, and operate GPU-accelerated Kubernetes clusters for HPC/AI workloads. Build custom operators/controllers, integrate NVIDIA device plugins and MIG, optimize scheduling and GPU utilisation, implement observability and security policies, maintain GitOps CI/CD and infrastructure-as-code, and support performance tuning and incident response.
Top Skills: ArgocdDcgmDcgm ExporterFluxcdGatekeeperGitopsGoGrafanaHelmKube-Scheduler PluginsKubernetesKustomizeMigMulti-Instance GpuMultusNvidia CniNvidia Device PluginNvidia Gpu OperatorNvmlOpaOpentelemetryPrometheusPythonRbacSlurmTerraformVolcano
2 Days AgoSaved
In-Office
Dallas, TX, USA
Senior level
Senior level
Artificial Intelligence • Cloud • Machine Learning • Infrastructure as a Service (IaaS)
Design and implement automation frameworks and tooling to provision and manage large-scale data center and cloud networks. Develop Python-based software, IaC (Ansible, Terraform, Jinja2), CI/CD integrations, APIs, observability, and self-service interfaces. Collaborate with network, platform, and security teams, participate in on-call rotations, and drive end-to-end production ownership and reliability improvements.
Top Skills: AclAnsibleApi DesignAristaBgpCi/CdCiscoDnsDockerFirewall Object GroupsGraph DatabasesHTTPJinja2JuniperKafkaKubernetesLoad BalancingNetwork SecurityNoSQLOspfPolicy-As-CodePythonRabbitMQRelational DatabasesSecrets ManagementSegment RoutingService DiscoveryTcp/IpTerraformVlanVrf
2 Days AgoSaved
In-Office
Groves, TX, USA
Entry level
Entry level
Artificial Intelligence • Cloud • Machine Learning • Infrastructure as a Service (IaaS)
Provide first-line IT support to users: troubleshoot hardware, software, and connectivity issues; escalate complex incidents; document tickets and resolutions; maintain and configure user devices and basic systems; assist with onboarding and account management.
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
2 Days AgoSaved
In-Office
Dallas, TX, USA
Senior level
Senior level
Artificial Intelligence • Cloud • Machine Learning • Infrastructure as a Service (IaaS)
Design, integrate, and optimize high-performance, low-latency HPC network architectures (InfiniBand, RoCE, Ethernet). Lead customer-facing solution design, POCs, benchmarking, multi-vendor integration, observability (Prometheus/Grafana), and cross-team collaboration to deliver scalable HPC/AI/ML interconnects.
Top Skills: AnsibleAristaBashBgpCiliumCiscoEthernetEvpnGrafanaInfinibandKubernetesLinuxMellanoxMpiMultusNcclNvidiaNvidia CniOspfPowershellPrometheusPythonRoceTerraformVxlan
2 Days AgoSaved
In-Office
Dallas, TX, USA
Senior level
Senior level
Artificial Intelligence • Cloud • Machine Learning • Infrastructure as a Service (IaaS)
Design, architect, and deploy GPU-accelerated Kubernetes platforms for HPC and AI/ML workloads. Engage customers from discovery through deployment, build reference architectures, deliver POCs and benchmarking, develop custom operators, integrate compute/storage/network layers, enforce secure multi-tenancy, and drive observability and GitOps practices while collaborating with product and engineering teams.
Top Skills: ApptainerArgocdCephCiliumContainerdDcgmDcgm ExporterDevice PluginsFluxcdGatekeeperGoGpfsGrafanaHelmInfinibandKube-Scheduler PluginsKubernetesKustomizeLustreMigMultusNetwork OperatorNvidia CniNvidia Container ToolkitNvidia Gpu OperatorNvlinkNvmlOpaOpentelemetryPrometheusPythonRbacRdmaRoceSingularitySlurmVastVolcano
2 Days AgoSaved
In-Office
Dallas, TX, USA
Expert/Leader
Expert/Leader
Artificial Intelligence • Cloud • Machine Learning • Infrastructure as a Service (IaaS)
Lead global sourcing and category strategy for HPC and AI data center infrastructure. Own $500M-$1B+ spend, manage a team of procurement professionals, drive supplier negotiations, capacity planning, risk mitigation, and stakeholder alignment to enable large-scale HPC deployments.
Top Skills: Ai AcceleratorsCompute SystemsDpusEthernetGpusInfinibandLiquid CoolingNicsNvmeObject StorageOptical NetworkingParallel File SystemsRack-Scale SystemsStorage ArchitecturesSwitches
2 Days AgoSaved
In-Office
Dallas, TX, USA
Senior level
Senior level
Artificial Intelligence • Cloud • Machine Learning • Infrastructure as a Service (IaaS)
Manage day-to-day procurement systems and operations, lead the Procurement Systems Administrator, own intake/prioritization/escalation, drive S2P process improvements, partner with AP/Finance/Legal, and ensure system governance and operational scaling.
Top Skills: CoupaCxml
2 Days AgoSaved
In-Office
Dallas, TX, USA
Senior level
Senior level
Artificial Intelligence • Cloud • Machine Learning • Infrastructure as a Service (IaaS)
Design and own the HPC data center backbone, WAN and cloud interconnect reference architectures. Define routing, traffic engineering, failover, telemetry, and multi-tenant secure networking. Align capacity planning, collaborate with solutions architects, and contribute network simulations and digital twin models to ensure scalable, redundant, and observable connectivity for high-bandwidth, low-latency workloads.
Top Skills: ClosCloud Service Provider InterconnectDigital TwinEvpnGpu-AcceleratedHpcLeaf-SpinePeeringRoutingTelemetryTraffic EngineeringVxlanWan
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account