Sr Systems Engineer HPC

Job Posted 10 Days Ago Posted 10 Days Ago
Be an Early Applicant
Hiring Remotely in United States
Remote
116K-198K Annually
Expert/Leader
Cloud • Information Technology • Software
The Role
The Sr Systems Engineer HPC is responsible for designing and maintaining HPC infrastructure, optimizing performance, and collaborating with scientists to meet computational needs.
Summary Generated by Built In

Job Summary: Rackspace seeking a highly skilled and motivated HPC System Engineer to join our team. You’ll be responsible for working directly for one of flagship clients and designing, implementing, maintaining, and optimizing their high-performance computing (HPC) infrastructure. You will work closely with researchers, scientists, and other engineers to ensure the efficient and reliable operation of the HPC systems. 


Work Location: 100% Remote. Due to this role supporting a customer in the Seattle area we prefer to hire in either PST or MST time zones.

 

Travel: There may be minimal travel to either San Antonio, TX or Seattle WA. 

Responsibilities:

  • Install, configure, and maintain HPC clusters, including hardware and software components.
  • Monitor system performance, identify bottlenecks, and implement solutions to optimize performance.
  • Manage user accounts, permissions, and resource allocation.
  • Perform regular system maintenance, updates, and patching.
  • Troubleshoot and resolve hardware and software issues in a timely manner.
  • Participate in the design and planning of HPC infrastructure upgrades and expansions.
  • Evaluate and recommend hardware and software solutions to meet evolving computational needs.
  • Implement and manage storage systems, networking infrastructure, and interconnects (e.g., InfiniBand).
  • Optimize system configurations and application performance for HPC workloads.
  • Profile and analyze application performance to identify areas for improvement.
  • Implement and utilize performance monitoring tools and techniques.
  • Provide technical support and training to HPC users.
  • Collaborate with researchers and scientists to understand their computational requirements.
  • Work closely with HPC architects and engineers to ensure that research needs are met.
  • Document system configurations, procedures, and best practices.
  • Assist HPC engineers and architects with day-to-day operations and ticket management.
  • Implement and maintain security measures to protect HPC infrastructure and data.
  • Ensure compliance with relevant security policies and regulations.
  • Manage data backups and disaster recovery procedures.

Qualifications:

  • Bachelor's degree in computer science, engineering, or a related field.  Experience may substitute for the degree.
  • Minimum of 10 yrs experience working with systems; 5yrs specifically with HPC.
  • Strong knowledge of Linux operating systems (e.g., Rocky, Ubuntu).
  • Experience with cluster management tools (e.g., Slurm, PBS).
  • Familiarity with high-speed interconnects (e.g., InfiniBand, Ethernet).
  • Knowledge of parallel file systems (e.g., Lustre, SEPH, GPFS).
  • Proficiency in scripting languages (e.g., R, Python, Bash).
  • Understanding of HPC hardware architectures and technologies (e.g., CPUs, GPUs, memory).
  • Strong demonstrated experience with a major configuration management software (e.g. Terraform, Ansible), including application packaging and installation.
  • Must have strong knowledge of Linux security and Linux shell scripting.
  • Strong communication and interpersonal skills.
  • Knowledge of data transfer protocols and large-scale storage solutions.

The following information is required by pay transparency legislation in the following states: CA, CO, HI, NY, and WA. This information applies only to individuals working in these states.

 

·       The anticipated starting pay range for Colorado is: $116,100 - $170-280.

·       The anticipated starting pay range for the states of Hawaii and New York (not including NYC) is: $123,600 - $181,280.

·       The anticipated starting pay range for California, New York City and Washington is: $135,300 - $198,440.

 

Unless already included in the posted pay range and based on eligibility, the role may include variable compensation in the form of bonus, commissions, or other discretionary payments. These discretionary payments are based on company and/or individual performance and may change at any time. Actual compensation is influenced by a wide array of factors including but not limited to skill set, level of experience, licenses and certifications, and specific work location. #LI-MF1 



About Rackspace Technology

We are the multicloud solutions experts. We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.

 

 

More on Rackspace Technology

Though we’re all different, Rackers thrive through our connection to a central goal: to be a valued member of a winning team on an inspiring mission. We bring our whole selves to work every day. And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.

Top Skills

Ansible
Bash
Ethernet
Gpfs
Infiniband
Linux
Lustre
Pbs
Python
R
Rocky
Seph
Slurm
Terraform
Ubuntu
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Antonio, TX
7,509 Employees
On-site Workplace
Year Founded: 1998

What We Do

At Rackspace Technology, we accelerate the value of the cloud during every phase of digital transformation. By managing apps, data, security and multiple clouds, we are the best choice to help customers get to the cloud, innovate with new technologies and maximize their IT investments. As a recognized Gartner Magic Quadrant leader, we are uniquely positioned to close the gap between the complex reality of today and the promise of tomorrow. Passionate about customer success, we provide unbiased expertise, based on proven results, across all the leading technologies. And across every interaction worldwide, we deliver Fanatical Experience TM — the best customer service experience in the industry. Rackspace has been honored by Fortune, Forbes, Glassdoor and others as one of the best places to work.

Similar Jobs

CrowdStrike Logo CrowdStrike

Principal Software Engineer - Platform Architect, CI/CD (Remote)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote
USA
10000 Employees
155K-270K Annually

ActBlue Logo ActBlue

Front-End Platform Engineer

Fintech • Social Impact
Easy Apply
Remote
USA
296 Employees
174K-211K

Cash App Logo Cash App

Software Engineering Manager, Access

Blockchain • Fintech • Mobile • Payments • Software • Financial Services
Remote
Hybrid
New York, NY, USA
3500 Employees
185K-327K Annually

Cash App Logo Cash App

Software Engineer, Trust (Access)

Blockchain • Fintech • Mobile • Payments • Software • Financial Services
Remote
Hybrid
Seattle, WA, USA
3500 Employees
153K-270K Annually

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account