Senior Systems Engineer HPC - R-21841

Reposted 4 Days Ago
Be an Early Applicant
Gurgaon, Gurugram, Haryana
Hybrid
Expert/Leader
Cloud • Information Technology • Software
The Role
The role involves installing and managing HPC systems, optimizing performance, managing resources, and collaborating with researchers to enhance infrastructure.
Summary Generated by Built In
Responsibilities:

System Administration & Maintenance: Install, configure, and maintain HPC clusters (hardware, software, operating systems), perform regular updates/patching, manage user accounts and permissions, and troubleshoot/resolve hardware or software issues.

Performance & Optimization: Monitor and analyse system and application performance, identify bottlenecks, implement tuning solutions, and profile workloads to improve efficiency.

Cluster & Resource Management: Manage and optimize job scheduling, resource allocation, and cluster operations using tools such as Slurm, LSF, Bright Cluster Manager / Base Command Manager, OpenHPC, and Warewulf.

Networking & Interconnects: Configure, manage, and tune Linux networking (TCP/IP, DNS, routing) and high-speed HPC interconnects (InfiniBand, Ethernet) to ensure low-latency, high-bandwidth communication.

Storage & Data Management: Implement and maintain large-scale storage and parallel file systems (Lustre, Ceph, GPFS), ensure data integrity, manage backups, and support disaster recovery.

Security & Authentication: Implement security controls, ensure compliance with policies, and manage authentication and directory services such as LDAP and Active Directory.

DevOps & Automation: Use configuration management and DevOps practices (Ansible, Terraform, Jenkins, Git) to automate deployments, application packaging (RPM/DEB), and system configurations.

User Support & Collaboration: Provide technical support, documentation, and training to researchers; collaborate with scientists, HPC architects, and engineers to align infrastructure with research needs.

Planning & Innovation: Contribute to the design and planning of HPC infrastructure upgrades, evaluate and recommend hardware/software solutions, and explore cloud-based HPC solutions where applicable.

Qualifications:

  • Bachelor’s degree in Computer Science, Engineering, or a related field (equivalent experience may substitute for degree).
  • Minimum of 10 years of systems experience, including at least 5 years working specifically with HPC.
  • Strong knowledge of Linux operating systems (e.g., Rocky Linux, Ubuntu) with a fundamental understanding of Linux internals, system administration, and performance tuning.
  • Experience building and managing RPM and DEB packages.
  • Experience with cluster management tools such as Bright Cluster Manager, OpenHPC stack, or Warewulf.
  • Proficiency with job schedulers and resource managers such as Slurm and LSF.
  • Strong understanding of Linux networking (e.g., TCP/IP, DNS, routing) and HPC interconnects (e.g., InfiniBand, Ethernet) including performance tuning.
  • Knowledge of parallel file systems such as Lustre, Ceph, or GPFS.
  • Working knowledge of Linux authentication and directory services such as LDAP and Active Directory.
  • Proficiency in scripting languages (e.g., Python, Bash, R) and familiarity with MPI libraries for parallel and distributed computing (nice to have).
  • Strong experience with DevOps and configuration management tools, including Ansible, Terraform, Jenkins, and Git.
  • Knowledge of HPC in cloud environments (e.g., AWS, Azure, GCP HPC offerings) is a plus.
  • Strong knowledge of Linux security, compliance standards, and data protection best practices.
  • Excellent communication, interpersonal, and problem-solving skills.

Top Skills

Ansible
AWS
Azure
Base Command Manager
Bash
Bright Cluster Manager
Ceph
Dns
Ethernet
GCP
Git
Gpfs
Hpc
Infiniband
Jenkins
Linux
Lsf
Lustre
Openhpc
Python
R
Slurm
Tcp/Ip
Terraform
Warewulf
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Antonio, TX
7,509 Employees
Year Founded: 1998

What We Do

At Rackspace Technology, we accelerate the value of the cloud during every phase of digital transformation. By managing apps, data, security and multiple clouds, we are the best choice to help customers get to the cloud, innovate with new technologies and maximize their IT investments. As a recognized Gartner Magic Quadrant leader, we are uniquely positioned to close the gap between the complex reality of today and the promise of tomorrow. Passionate about customer success, we provide unbiased expertise, based on proven results, across all the leading technologies. And across every interaction worldwide, we deliver Fanatical Experience TM — the best customer service experience in the industry. Rackspace has been honored by Fortune, Forbes, Glassdoor and others as one of the best places to work.

Similar Jobs

CrowdStrike Logo CrowdStrike

Engineer III - Reliability ( 2PM - 11PM IST) (Remote, IND)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
16 Locations
10000 Employees

MongoDB Logo MongoDB

Sales Development Representative

Big Data • Cloud • Software • Database
Easy Apply
Hybrid
Gurugram, Haryana, IND
5550 Employees

BlackRock Logo BlackRock

Scrum Master

Big Data • Cloud • Fintech • Financial Services • Conversational AI
In-Office
Gurugram, Haryana, IND
25000 Employees

WeLocalize Logo WeLocalize

Quality Assurance Engineer

Machine Learning • Natural Language Processing
In-Office
Gurugram, Haryana, IND
2331 Employees

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account