High-Performance Computing (HPC) (SA2) (Government)

Posted Yesterday
Be an Early Applicant
Columbia, MD, USA
In-Office
98K-168K Annually
Senior level
Internet of Things • Mobile • Retail
The Role
Provide on-site HPC systems administration and sustainment across two sites: install/configure Linux/Windows HPC clusters, manage parallel file systems and high-speed interconnects, automate tasks with Bash and config management, use observability tooling, support SRE teams, troubleshoot heterogeneous systems, harden and patch OSes, and communicate escalations to agency management.
Summary Generated by Built In

AT&T Global Public Sector is a trusted provider of secure, IP enabled, cloud-based, network solutions and professional services to the Federal Government. We are dedicated to recruiting, developing and empowering a diverse, high-performing workforce that is passionate about what they do, committed to our shared values and dedicated to our customers’ mission.

The scope of this Contract requires specialized expertise in areas such as high-performance computing (HPC), automated processing systems, distributed software design, and secure hosting and networking solutions. The IT infrastructure consists primarily of Linux, with some Windows, and UNIX. The environment includes a variety of network devices, server interconnections, mass storage solutions, and essential supporting infrastructure services.  The services provided under this Contract support areas including HPC, infrastructure maintenance for HPC systems, networking, office automation, and the development of specialized software.

AT&T has an opening for a High-Performance Computing (HPC) Systems Administrator to support a large client-based IT enterprise installation, configuration and networking of Linux and Windows based platforms. This position requires office presence a minimum of 5 days per week and is only located in the location(s) posted. No relocation is offered. Work to be performed at government customer site.

Description of Job Duties/Responsibilities:

The System Administrator provides HPC sustainment support across two geographically dispersed sites, including:

  • Linux-based HPC clusters (e.g., Red Hat/CentOS/Rocky/Ubuntu) with parallel file systems (e.g., Lustre/GPFS) and high-speed interconnects (InfiniBand/Slingshot).
  • Transition of new systems/capabilities into operations (clusters, SMP/MPP, parallel file systems).
  • Support to HPC and ABS (ABUNDANTSHIELD) SRE teams in accordance with Government policies and procedures.

Proficient with the following (as specific position requires):

  • Operate and maintain systems/services: monitoring, incident response, troubleshooting, and routine maintenance.
  • Install/configure Linux OS, file systems, and TCP/IP networking; troubleshoot OS and application issues.
  • Automate/administer via BASH scripting; compile/install software as required.
  • Use common operations and observability tooling: Jira, Confluence, Grafana, Prometheus, Nagios.
  • Support HPC workload and configuration management tooling: Slurm, git, Salt, Ansible.
  • Provide user support and escalation/status communication to agency management and internal customers.
  • Optimize operations through resource utilization and capacity analysis/planning.
  • Apply in-depth troubleshooting skills across heterogeneous systems (no single fixed solution).
  • Provide detailed analysis and feedback to agency management and internal customers for escalated tickets. 
  • Provide support for the dispatch system and hardware problems and remains involved in the resolution process. 
  • Harden, patch, and tune Linux/UNIX/Windows systems; implement OS-level enhancements to improve reliability and performance.

Required Clearance: TS/SCI with polygraph. (#ts/sci) (#polygraph)

Required Qualifications:

B.S. in a technical discipline and 5 years’ experience as a System Administrator in programs and contracts of similar scope, type and complexity or 10 years’ experience in lieu of degree

  • DoD 8570 IAT II level certification required.

Ready to join our team? Apply today!

Our High-Performance Computing (HPC) (SA2) (Government) earns between $98,100 - $167,830 yearly. Not to mention all the other amazing rewards that working at AT&T offers. Individual starting salary within this range may depend on geography, experience, expertise, and education/training.

Joining our team comes with amazing perks and benefits:

  • Medical/Dental/Vision coverage
  • 401(k) plan
  • Tuition reimbursement program
  • Paid Time Off and Holidays (based on date of hire, at least 23 days of vacation each year and 9 company-designated holidays) *Pro-rated when working less than 40 hrs/wk.
  • Paid Parental Leave
  • Paid Caregiver Leave
  • Additional sick leave beyond what state and local law require may be available but is unprotected · Adoption Reimbursement
  • Disability Benefits (short term and long term)
  • Life and Accidental Death Insurance
  • Supplemental benefit programs: critical illness/accident hospital indemnity/group legal
  • Employee Assistance Programs (EAP)
  • Extensive employee wellness programs
  • Employee discounts up to 50% off on eligible AT&T mobility plans and accessories, AT&T internet (and fiber where available) and AT&T phone

.

Weekly Hours:

40

Time Type:

Regular

Location:

Columbia, Maryland

It is the policy of AT&T to provide equal employment opportunity (EEO) to all persons regardless of age, color, national origin, citizenship status, physical or mental disability, race, religion, creed, gender, sex, sexual orientation, gender identity and/or expression, genetic information, marital status, status with regard to public assistance, veteran status, or any other characteristic protected by federal, state or local law. In addition, AT&T will provide reasonable accommodations for qualified individuals with disabilities. AT&T is a fair chance employer and does not initiate a background check until an offer is made.

Skills Required

  • TS/SCI with polygraph
  • B.S. in a technical discipline and 5 years' System Administrator experience (or 10 years' experience in lieu of degree)
  • DoD 8570 IAT II certification
  • On-site presence minimum 5 days per week at government customer site in Columbia, Maryland; no relocation
  • Experience administering Linux-based HPC clusters and parallel file systems (Lustre/GPFS)
  • Experience with high-speed interconnects and HPC networking (InfiniBand, Slingshot, TCP/IP)
  • Proficiency with Bash scripting; compiling and installing software
  • Experience with observability and ops tooling: Jira, Confluence, Grafana, Prometheus, Nagios
  • Experience with HPC workload and configuration management tools: Slurm, git, Salt, Ansible
  • Ability to harden, patch, tune Linux/UNIX/Windows systems and perform in-depth troubleshooting

AT&T Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about AT&T and has not been reviewed or approved by AT&T.

  • Healthcare Strength Health coverage spans medical, dental, vision, and mental health services, plus a personal healthcare team, wellness apps, and supplemental options such as fertility care, cancer support, doula services, and wigs for chemotherapy. These comprehensive offerings are portrayed as supporting a wide range of employee needs.
  • Leave & Time Off Breadth Paid time off includes vacation, holidays, sick days, caregiver time, parental leave, and adoption assistance, with some roles reaching about 23 days of PTO after several years. Community volunteer days and flexible time off options add further support for work-life balance.
  • Wellbeing & Lifestyle Benefits Employees receive sizable service discounts like 50% off most wireless plans and broadband, along with savings on travel, event tickets, and insurance. Additional workplace perks such as hybrid work models and relocation assistance contribute to overall value.

AT&T Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Dallas, TX
150,000 Employees

What We Do

Bring us your biggest career aspirations. Share your boldest dreams. This is a moment to get energized. Through 5G and Fiber, AT&T provides connectivity that leads to smarter homes, safter communities, higher quality health care and more life-changing innovations. With AT&T, Connecting Changes Everything.

Gallery

Gallery

Similar Jobs

PwC Logo PwC

Senior Manager, Internal Investigations

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
67 Locations
370000 Employees
91K-322K Annually

PwC Logo PwC

US Tech - AI Engineering Senior Associate

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
68 Locations
370000 Employees
151K-187K Annually

PwC Logo PwC

Engineering Manager

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
68 Locations
370000 Employees
212K-244K Annually

General Motors Logo General Motors

Sales Manager

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Remote or Hybrid
United States
165000 Employees
81K-109K Annually

Similar Companies Hiring

Granted Thumbnail
Mobile • Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees
Scotch Thumbnail
Artificial Intelligence • eCommerce • Fintech • Payments • Retail • Software • Analytics
US
35 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account