HPC/ AI MPI Ecosystem Software Engineer

Posted 10 Days Ago
Be an Early Applicant
Fort Collins, CO
143K-328K Annually
Expert/Leader
Artificial Intelligence • Cloud • Information Technology • Consulting
The Role
The HPC/AI MPI Ecosystem Software Engineer will work with the Slingshot Ethernet Fabric team to optimize and support communication libraries and applications used in AI and HPC. Responsibilities include collaborating with the community and vendors, designing and maintaining system software for AI and HPC systems, and aligning software direction with business requirements.
Summary Generated by Built In

HPC/ AI MPI Ecosystem Software Engineer

This role has been designated as ‘Remote/Teleworker’, which means you will primarily work from home.

Who We Are:

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know diverse backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Description:

Artificial Intelligence (Generative AI and all of Machine and Deep Learning) and High-Performance Computing are the fastest growing workloads in the industry today. These workloads are pushing the leading edge of networking technology forward at a rapid pace. Come join the Slingshot Ethernet Fabric team, part of HPE's HPC and AI organization, and make an impact on the high-performance fabric business.

We are looking for an experienced Software Engineer to join the Slingshot Ecosystem Development Team to help expand HPE's High Performance Ethernet Fabric product growth through Commercial HPC use cases, AI use cases networking, systems, and application and open-source communities. This includes directly working with the community, customers, vendor/partners and internal stake holders to optimize and support the latest communication libraries, frameworks, MPI distribution, acceleration middleware, and applications used in Artificial Intelligence, Commercial HPC, and Cloud markets and running on the Slingshot Ethernet fabric.

Join the HPE AI Fabric team and be a part of the growth and evolution of Artificial Intelligence (AI), high speed networking fabrics, and the fastest growing and most significant technology revolution since the Internet. 

Responsibilities include, but are not limited to:

  • Engage and work with the Commercial HPC and AI ISV and open-source SW communities to validate, tune, and enable applications on the Slingshot Ethernet fabric.
  • Enable the broad MPI ecosystem (OpenMPI, Intel MPI, Cray MPI, other distributions) by working with application and MPI vendors to target, tune, and ensure market leading performance.
  • Design, implement and maintain system software that enables communication between GPUS, CPUs, and storage in scale out AI and HPC systems. Work with all the leading architectures and vendors in the AI and Data Center markets – Nvidia, AMD, Intel.
  • Work with the OEM, ODM, and VAR channels vendors on bring Slingshot to a broader set of customers. Validate and tune applications driving those engagements. 
  • Develop and own HPE product usage support, upstreaming and community engagements, and internal testing and infrastructure.
  • Work with cross-disciplinary teams to understand business requirements and align software direction to meet those needs.

Qualifications should Include:

  • Bachelor’s/master's degree in computer science, engineering, or related field
  • 10+ years of relevant experience with a background in networking and communications software development and/or architecture in the Data Center, university, government lab, or AI-centric environments.
  • Background in MPI software development with an emphasis on HPC applications development, tuning, and deployment in a scale out compute cluster environment
  • Ability to participate and own pieces of the product release pipeline up to and including package integration and support.
  • Deep understanding of networking architecture and communications including Ethernet and InfiniBand networking technologies
  • Understanding of computer architecture, and familiarity with the fundamentals of GPU architecture. Experience with Nvidia and AMD GPU infrastructure and software stacks.
  • Programming and debug skills in C, C++ and Python. Ability to understand how applications and industry middleware/libraries work in Slingshot enabled systems and identify strategies and ideas for allowing these applications to work to customer expectations.
  • Experience with user-based networking and OFI libfabric software interfaces and APIs.

Additional Skills:

Artificial Intelligence Technologies, Cross Domain Knowledge, Data Engineering, Data Science, Design Thinking, Development Fundamentals, Full Stack Development, IT Performance, Machine Learning Operations, Scalability Testing, Security-First Mindset

Additional Skills:

Cloud Architectures, Cross Domain Knowledge, Design Thinking, Development Fundamentals, DevOps, Distributed Computing, Microservices Fluency, Full Stack Development, Security-First Mindset, Solutions Design, Testing & Automation, User Experience (UX)

What We Can Offer You:

Health & Wellbeing

We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.

Personal & Professional Development

We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.

Diversity, Inclusion & Belonging

We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know diverse backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

Let's Stay Connected:

Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.

#unitedstates#highperformancecompute

Job:

Engineering

Job Level:

TCP_05

States with Pay Range Requirement

The expected salary/wage range for a U.S.-based hire filling this position is provided below. Actual offer may vary from this range based upon geographic location, work experience, education/training, and/or skill level. If this is a sales role, then the listed salary range reflects combined base salary and target-level sales compensation pay. If this is a non-sales role, then the listed salary range reflects base salary only. Variable incentives may also be offered. Information about employee benefits offered can be found at https://myhperewards.com/main/new-hire-enrollment.html.

USD Annual Salary: $142,500.00 - $327,500.00

Estimated job application period closure is November 2024. While this is the expected application time frame, there are many factors which may result in a change. If this position is still open beyond the anticipated closure time frame, it is likely HPE is still actively recruiting for this role and all qualified and interested candidates are encouraged to apply.

HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT and Affirmative Action employer. We are committed to diversity and building a team that represents a variety of backgrounds, perspectives, and skills. We do not discriminate and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global diverse team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here: Equal Employment Opportunity.

Hewlett Packard Enterprise is EEO F/M/Protected Veteran/ Individual with Disabilities.

HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories. .

Top Skills

AI
Hpc
Mpi
The Company
HQ: Houston, TX
61,628 Employees
On-site Workplace

What We Do

In 1939, Bill Hewlett and Dave Packard, college friends turned business partners, started the original Silicon Valley startup in the space of a rented Palo Alto garage. Starting with audio oscillators, the friends built the foundation for a company that would grow to become a global leader in enterprise technology.

More than 75 years later, our success is exemplified through our employees’ drive to advance ideas that bring meaningful innovations to life for our customers and partners around the globe. We are guided by our mission to help customers use technology to turn ideas into value, and empower them to transform industries, markets and lives. We simplify Hybrid IT, power the Intelligent Edge and provide the expertise to make it all happen.

Similar Jobs

Snap Inc. Logo Snap Inc.

Component Development Engineer, Optical & AR Display Modules

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
Boulder, CO, USA
5000 Employees
156K-276K Annually

Snap Inc. Logo Snap Inc.

RF Hardware Engineer

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
Boulder, CO, USA
5000 Employees
117K-207K Annually

The Aerospace Corporation Logo The Aerospace Corporation

Mission Protection Engineer

Aerospace • Artificial Intelligence • Cloud • Machine Learning • Cybersecurity • Defense
Colorado Springs, CO, USA
4600 Employees
95K-143K Annually

The Aerospace Corporation Logo The Aerospace Corporation

Space Systems Engineer

Aerospace • Artificial Intelligence • Cloud • Machine Learning • Cybersecurity • Defense
Hybrid
Colorado Springs, CO, USA
4600 Employees
95K-143K Annually

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account