HPC System Architect - 100% Remote

Sorry, this job was removed at 3:06 p.m. (CST) on Saturday, September 17, 2022
Find out who’s hiring remotely Nationwide
See all Remote jobs Nationwide
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

GDIT is currently seeking a High Performance Computing (HPC) System Architect to support our National Oceanic and Atmospheric Administration (NOAA) HPC programs, as well as broadly support GDIT's HPC Center of Excellence (COE). This position is remote within the U.S. with some travel required.

GDIT provides the supercomputing resources at multiple datacenters to support the research, development, and operational requirements essential to NOAA's mission. These resources are used to develop, deploy, and operate sophisticated numerical models used to predict and understand atmospheric and oceanic phenomena for weather and climate, as well as associated handling of very large atmosphere and ocean datasets. Our current NOAA programs include the Weather and Climate Operational Supercomputer System (WCOSS) and Research & Development HPC Systems (RDHPCS). RDHPCS supports the research and development of numerical models, while WCOSS delivers 24/7 computational capability to run these models for NOAA operational weather & climate prediction. These 10-year programs will deliver significant computational capability that will evolve over time to keep pace with NOAA's growing environmental modeling needs. The HPC COE leverages expertise from across all of GDIT's HPC programs to provide mutual support and contribute to GDIT's HPC proposal efforts.

The HPC System Architect will lead system design efforts for existing GDIT HPC programs and for GDIT HPC program proposals. This involves defining and analyzing requirements, assessing alternative solutions, and integrating technologies and components into end-to-end systems meeting customer needs.

Position Duties and Skills requirements:

  • Designs systems architecture for new or existing HPC systems that includes compute, interconnect, and storage components.
  • Performs design trade-offs and analysis of system performance (including benchmarking), reliability/availability, and maintainability to verify that systems meet requirements.
  • Develops test plans and procedures and executes test programs.
  • Maintains understanding of the HPC community and technology trends to enable building the next generation HPC systems.
  • Provides hands on experience with the operations/system administration of production HPC clusters to meet SLA and operational requirements.
  • Supports procurement of system components by evaluating technologies/sources, preparing specifications, and evaluating proposals.
  • Strong writing skills for technical documents, system procedures, user wiki's and FAQs
  • Provides guidance and mentoring to less experienced Systems Engineers.
  • Works as a team leader, team member, or individual contributor.


Additional desired qualifications:

  • Subject matter expertise with Lustre that includes experience with benchmarking, tuning and troubleshooting complex Lustre issues.


Education and Experience requirements: BA/BS in a related technical field (or equivalent experience), 12+ years of experience.

More Information on General Dynamics Information Technology
General Dynamics Information Technology operates in the Aerospace industry. The company is located in Falls Church, VA. It has 21625 total employees. It offers perks and benefits such as Flexible Spending Account (FSA), Disability insurance, Dental insurance, Vision insurance, Health insurance and Life insurance. To see all 66 open jobs at General Dynamics Information Technology, click here.
Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Similar Jobs

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about General Dynamics Information TechnologyFind similar jobs