Sr. RHEL Systems Engineer, OT/SCADA

Posted Yesterday
Be an Early Applicant
Houston, TX, USA
In-Office
Senior level
Artificial Intelligence • Energy • Renewable Energy
The Role
Own and harden the RHEL-based OS and container platform for on-prem energy SCADA hosts. Build/version RHEL and SCADA container images, run containers via Podman/quadlets, manage host networking and firewall, define patch/lifecycle strategies for offline sites, troubleshoot OS/container/runtime issues, and document runbooks. Maintain Windows image pipeline in steady state.
Summary Generated by Built In

ON.energy is building the power infrastructure that makes the AI era possible. As AI demand surges past what the grid and traditional data centers can support, ON.energy provides a new class of power technology proven at gigawatt scale and trusted by the world’s leading cloud and AI companies. Our systems are already deployed across 2.5 GW of hyper-scale campuses, validated by top U.S. national labs, and certified for grid-safe operation by major utilities. With real products in the field, we’re scaling faster than the grid can, transforming power from a bottleneck into a competitive advantage for the companies building the future.

ROLE SUMMARY

The Senior RHEL engineer will own the OS and container platform behind our on-premises energy-management deployments. You’ll build and harden the tailored RHEL image and the containerized Ignition SCADA workload running on industrial PCs across U.S. data-center sites, and act as our deep RHEL authority and L3 escalation point. This is a platform-engineering and expertise role — not field installation.

You will own our OS architectures, container builds and configuration, and the runtime that hosts Ignition, while the SCADA team owns everything configured inside Ignition (tags, screens, logic). On the network side, the host firewall and OS-level networking (interfaces, routing, DNS, firewalld/nftables) are yours, while the physical and site network beyond the host NIC belongs to Operational Technology (OT). Travel to third-party data-center sites is for troubleshooting and support only. The separate OT team commissions and deploys every machine, so you won’t install systems in the field.

RESPONSIBILITIES

  • Build, version, and maintain a reproducible, turnkey RHEL image tailored to the SCADA host’s needs (time sync, firewall/ports, storage, service ordering, resource tuning).
  • Own the SCADA container image end to end: Containerfile, base image, JVM, persistence, ports, hardening, scanning, and registry.
  • Run containers as systemd services via Podman and quadlets.
  • Harden the OS and images to a recognized baseline (CIS/STIG) and define a patching and lifecycle strategy for OT uptime and restricted/disconnected sites (offline/mirrored registry).
  • Troubleshoot full-stack — OS, container runtime, and application runtime behavior — remotely and on-site, handing off application-level issues to the SCADA team.
  • Own host-level networking and firewall configuration (interfaces, routing, DNS, firewalld/nftables, and port exposure for the SCADA container), and lead connectivity troubleshooting between the host and site infrastructure.
  • Document runbooks for the OT team and help define the connectivity and security posture.
  • Maintain the Windows image pipeline (Packer-based) in steady state once established: periodic patching, hardening updates, and image rebuilds.

MUST-HAVE QUALIFICATIONS:

  • 7+ years of senior, hands-on production RHEL/Linux engineering.
  • Container-native delivery: Podman, quadlets/systemd, building and owning container images (including packaging a JVM-based app such as Ignition), and registry management.
  • Reproducible, turnkey, versioned RHEL image building (bootc, Kickstart, Image Builder/osbuild, or Ansible-driven).
  • OS and image hardening to a recognized baseline (CIS/STIG).
  • Patch and lifecycle management for restricted/disconnected environments, including offline/mirrored content and registries.
  • Automation (Ansible) and version control.
  • Strong Linux networking and diagnostics: firewall administration (firewalld/nftables) and packet/connectivity tooling (Wireshark, tcpdump, ss, ip), plus containerized debug environments (toolbox).
  • Confidence operating as the RHEL authority, with strong documentation habits.

NICE-TO-HAVE QUALIFICATIONS:

  • OT/ICS or SCADA exposure and frameworks (IEC 62443, NERC CIP, NIST 800-82).
  • Familiarity with Ignition specifically.
  • Windows image automation (Packer) and Windows patching/hardening familiarity.
  • RHCSA / RHCE / RHCA certifications.

ADDITIONAL INFORMATION:

  • Based in the Houston, TX area preferred.
  • Available to travel to sites for on-site support (20%), and to provide remote support for production issues.
  • Able to meet third-party data-center access requirements (background check, badging, host-facility protocols) and represent the company professionally on-site.

For US-based roles - What you’ll get:

  • Competitive salary + annual performance-based bonus eligibility
  • Medical, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and company holidays 

For Mexico-based roles - What you’ll get:

  • Competitive salary + annual performance bonus eligibility
  • Christmas Bonus (Aguinaldo): 30 days
  • Major medical expenses and life insurance
  • Paid time off and holidays (per local policy)

For all roles:

  • Professional development and growth opportunities
  • Opportunity to grow with a mission-driven team shaping the future of clean energy
  • Equal Opportunity: ON.energy is committed to equal employment opportunity and to maintaining a work environment free of harassment, discrimination, or retaliation.
  • Accommodations: If you need an accommodation during the application process, email [email protected]
  • Benefits vary by role and location and are subject to change.

Skills Required

  • 7+ years of senior, hands-on production RHEL/Linux engineering experience
  • Container-native delivery: Podman, quadlets/systemd, building and owning container images, packaging JVM-based apps, registry management
  • Reproducible, turnkey, versioned RHEL image building (bootc, Kickstart, Image Builder/osbuild, or Ansible-driven)
  • OS and image hardening to a recognized baseline (CIS/STIG)
  • Patch and lifecycle management for restricted/disconnected environments, including offline/mirrored registries
  • Automation (Ansible) and use of version control
  • Strong Linux networking and diagnostics: firewall administration (firewalld/nftables) and packet/connectivity tooling (Wireshark, tcpdump, ss, ip), plus containerized debug environments (toolbox)
  • Operate as the RHEL authority with strong documentation and runbook authorship
  • Available to travel to sites for on-site support (~20%) and able to meet third-party datacenter access requirements (background check, badging, host protocols)
  • OT/ICS or SCADA exposure and frameworks (IEC 62443, NERC CIP, NIST 800-82)
  • Familiarity with Ignition (SCADA) specifically
  • Windows image automation (Packer) and Windows patching/hardening familiarity
  • RHCSA / RHCE / RHCA certifications
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Miami, Florida
165 Employees

What We Do

ON.energy is building the backbone of energy and AI infrastructure, powering grid-safe data centers and mission-critical facilities. The company supplies and operates hyperscale power systems that solve the toughest resilience challenges, delivering custom solutions for AI data centers, mission-critical facilities, and front-of-the-meter assets. Its track record spans industrial, manufacturing, infrastructure, transportation, and grid-scale storage. With patented technology and proprietary software, ON.energy develops projects worldwide that set new benchmarks for resilience.

Similar Jobs

Sprinter Health Logo Sprinter Health

Patient Support Specialist (EST) -

Artificial Intelligence • Healthtech • Logistics • Social Impact • Software • Telehealth
Remote or Hybrid
United States
500 Employees
In-Office
Waco, TX, USA
3000 Employees

Domino Data Lab Logo Domino Data Lab

Site Reliability Engineer

Artificial Intelligence • Machine Learning
Easy Apply
Remote or Hybrid
US
200 Employees
200K-230K Annually

JumpCloud Logo JumpCloud

Director, Global MSP Sales - United States

Cloud • Information Technology • Security • Software
Easy Apply
In-Office or Remote
6 Locations
800 Employees
220K-290K Annually

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account