Senior Software Development Engineer, HPC & AI Networking

Posted 6 Days Ago
Be an Early Applicant
Chicago, IL
113K-260K Annually
Mid level
Artificial Intelligence • Cloud • Information Technology • Consulting
The Role
As a Full-Stack Monitoring & AIOps Engineer, you will manage expectations between HPE and ANL regarding responsibilities, diagnose HPC fabric issues, develop integrated software algorithms for data analysis and monitoring, leverage machine learning for performance improvements, and document installation procedures and performance metrics.
Summary Generated by Built In

Senior Software Development Engineer, HPC & AI Networking

This role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE partner/customer office.

Who We Are:

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know diverse backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Description:

Join the cutting-edge HPE Slingshot R&D team at the renowned Argonne National Laboratory in the vibrant Chicago Metropolitan Area! As a pivotal member of the Slingshot AIOps and Monitoring R&D group, you will collaborate closely with top-tier HPE and ANL experts to push the boundaries of supercomputing technology. This unique dual-reporting role to both HPE’s R&D manager and an ANL manager places you at the heart of innovation, working directly on the groundbreaking Aurora supercomputer system. Immerse yourself in a dynamic, on-site environment with state-of-the-art facilities and network access provided by ANL. Your mission? To drive the evolution of diagnostics and monitoring applications for HPC Network, ensuring they meet and exceed the demanding requirements of one of the world's most powerful computing systems. You’ll play a key role in the exchange of cutting-edge ideas, pioneering new features. HPE Slingshot is the backbone of some of the world’s most powerful supercomputers, including systems that top the TOP500 list. Your contributions will directly impact the future of supercomputing. Embrace this opportunity to be at the forefront of technological advancement and make a tangible difference in the world of high-performance computing. Join us and be a part of something extraordinary!

 

Responsibilities:

  • Manage and Communicate Expectations: Coordinate and communicate project responsibilities and commitments with both the HPE manager and ANL manager.
  • Customer Issue Resolution: Track and facilitate the resolution of customer HPC interconnect issues, interfacing with HPE Slingshot R&D as necessary to align resources for advanced issue resolution.
  • Problem Diagnosis and Documentation: Assist in diagnosing fabric-related problems, write documentation, perform Root Cause Analyses (RCAs), drive upgrade planning, and other related tasks.
  • Software Development: Develop and program integrated software to structure, analyze, and leverage structured and unstructured data in monitoring and analytics system applications.
  • Performance and Maintenance: Document installation and maintenance procedures, complete programming tasks, perform testing and debugging, and define and monitor performance metrics.
  • Technical Leadership: Provide technical leadership for significant projects and programs, participating in cross-functional initiatives and mentorship.
  • Innovation and Problem Solving: Apply in-depth professional knowledge and innovative ideas to solve complex problems, contributing to measurable improvements in time-to-market, cost reductions, or customer satisfaction.

 

Qualifications:

  • Experience: 5+ years in software (systems/application) development preferably focused on distributed systems or network programming.
  • Technical Skills:
    • Proficiency in C/C++, Python and a deep understanding of Linux and kernel-level programming.
    • Strong understanding of data structures, algorithms, and operating systems.
    • Experience with distributed systems concepts, including CAP theorem, Consensus, messaging, and High Availability.
    • Expertise in low-latency networking, including HPC network fabric
  • Problem-Solving Skills: Strong ability to troubleshoot complex networking issues.
  • Communication Skills: Excellent organizational, verbal, and written communication skills.
  • Education: Bachelor’s degree in Computer Science, Engineering, or related fields.
  • Mindset: A proactive developer and expert in distributed systems who prioritizes simplicity and scalability, thrives in a collaborative, agile setting, and is eager to continuously learn.
  • #unitedstates #chicago #ml #statisticalmodel #diagnostics #algorithms #testing #debugging #performancemetrics

Additional Skills:

Cloud Architectures, Cross Domain Knowledge, Design Thinking, Development Fundamentals, DevOps, Distributed Computing, Microservices Fluency, Full Stack Development, Security-First Mindset, Solutions Design, Testing & Automation, User Experience (UX)

What We Can Offer You:

Health & Wellbeing

We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.

Personal & Professional Development

We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.

Diversity, Inclusion & Belonging

We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know diverse backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

Let's Stay Connected:

Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.

#unitedstates#highperformancecompute

Job:

Engineering

Job Level:

TCP_04

States with Pay Range Requirement

The expected salary/wage range for a U.S.-based hire filling this position is provided below. Actual offer may vary from this range based upon geographic location, work experience, education/training, and/or skill level. If this is a sales role, then the listed salary range reflects combined base salary and target-level sales compensation pay. If this is a non-sales role, then the listed salary range reflects base salary only. Variable incentives may also be offered. Information about employee benefits offered can be found at https://myhperewards.com/main/new-hire-enrollment.html.

USD Annual Salary: $117,500.00 - $270,000.00

HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT and Affirmative Action employer. We are committed to diversity and building a team that represents a variety of backgrounds, perspectives, and skills. We do not discriminate and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global diverse team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here: Equal Employment Opportunity.

Hewlett Packard Enterprise is EEO F/M/Protected Veteran/ Individual with Disabilities.

HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories. .

Top Skills

C++
Java
Python
The Company
HQ: Houston, TX
61,628 Employees
On-site Workplace

What We Do

In 1939, Bill Hewlett and Dave Packard, college friends turned business partners, started the original Silicon Valley startup in the space of a rented Palo Alto garage. Starting with audio oscillators, the friends built the foundation for a company that would grow to become a global leader in enterprise technology.

More than 75 years later, our success is exemplified through our employees’ drive to advance ideas that bring meaningful innovations to life for our customers and partners around the globe. We are guided by our mission to help customers use technology to turn ideas into value, and empower them to transform industries, markets and lives. We simplify Hybrid IT, power the Intelligent Edge and provide the expertise to make it all happen.

Similar Jobs

Caxy Logo Caxy

Senior Full Stack Software Developer

Agency • Artificial Intelligence • Enterprise Web • Mobile • Software
Remote
Hybrid
Chicago, IL, USA
45 Employees

Vibes Logo Vibes

Senior Software Engineer

Marketing Tech • Mobile • Software
Easy Apply
Chicago, IL, USA
115 Employees

Vibes Logo Vibes

Software Engineer

Marketing Tech • Mobile • Software
Easy Apply
Chicago, IL, USA
115 Employees

Vibes Logo Vibes

Senior Software Engineer - Professional Services

Marketing Tech • Mobile • Software
Easy Apply
Chicago, IL, USA
115 Employees

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account