Senior Staff Engineer II-PayPal

Posted 3 Days Ago
Be an Early Applicant
Pune, Maharashtra, IND
In-Office
Senior level
Information Technology • Consulting
The Role
Design, develop, and optimize GPU cluster management, observability, scheduling and APIs. Profile and benchmark GPU performance, implement synchronization and concurrency controls, and maintain CI/CD for GPU infrastructure tools.
Summary Generated by Built In

 

Job Description:

As a Software Developer in GPU Infrastructure Automation, you will be responsible for designing, developing, and optimizing software solutions that effectively manage and schedule GPU resources. You will work closely with various software teams to ensure seamless integration and optimal performance of our GPU infrastructure.

 

Key Responsibilities:

·         Design and implement GPU cluster management and observability tools.

·         Develop tools and APIs for other computational layers.

·         Conduct performance profiling and optimization using tools like NVIDIA Nsight.

·         Participate in code reviews, design discussions, and continuous integration/continuous deployment (CI/CD) processes.

·         Validate GPU cluster performance with benchmarking tools like MLPerf.

·         Implement and maintain synchronization mechanisms for managing concurrency and shared resources.

·         Developing infrastructure software tool kit for GPU clustering, capacity and scheduling automation

Required Skills and Qualifications:

·         Bachelor’s or Master’s degree in Computer Science or related field.

·         Strong proficiency in Golang, C/C++, and experience with GPU schedulers like SLURM.

·         Strong proficiency in Kubernetes (K8) technologies

·         Strong proficiency in in one of the public cloud Infrastructure and PaaS technologies (AWS, GCP, Azure)

·         In-depth understanding of GPU architectures and parallel computing principles.

·         Excellent understanding of REST APIs and experience with threading, concurrency, and synchronization mechanisms.

·         Knowledge of Linux operating systems.

·         Familiarity with scheduling algorithms and load balancing techniques.

·         Strong understanding of data structures, algorithms, and numerical methods.

·         Proficient in creating and using well-structured CI/CD pipelines.

·         Excellent problem-solving skills and attention to detail.

·         Strong communication and teamwork abilities.



Skills Required

  • Bachelor's or Master's degree in Computer Science or related field
  • Strong proficiency in Golang
  • Strong proficiency in C/C++
  • Experience with GPU schedulers like SLURM
  • Strong proficiency in Kubernetes
  • Experience with public cloud infrastructure (AWS, GCP, or Azure)
  • In-depth understanding of GPU architectures and parallel computing principles
  • Excellent understanding of REST APIs
  • Experience with threading, concurrency, and synchronization mechanisms
  • Knowledge of Linux operating systems
  • Familiarity with scheduling algorithms and load balancing techniques
  • Strong understanding of data structures, algorithms, and numerical methods
  • Proficient in creating and using CI/CD pipelines
  • Excellent problem-solving, communication, and teamwork abilities
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Edison, New Jersey
267 Employees
Year Founded: 2007

What We Do

SecurView is a cybersecurity solutions company founded in 2007 focusing on the following technology domains: datacenter, cloud, mobility, segmentation/NAC, and 24/7 security operations center. We respond to client-specific security requirements by providing comprehensive services around planning, designing and implementation of effective solutions. Through our proven technology and security methodologies and expertise, we help customers maximize the potential of their investment in information technologies

Similar Jobs

Capco Logo Capco

IRR Testing

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
India
6000 Employees

Capco Logo Capco

Product Manager

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
India
6000 Employees

Morningstar Logo Morningstar

Servicenow Engineer

Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services
Hybrid
Navi Mumbai, Thane, Maharashtra, IND
11500 Employees
4-4 Annually

Morningstar Logo Morningstar

Analyst - Structured Finance

Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services
Hybrid
Navi Mumbai, Thane, Maharashtra, IND
11500 Employees

Similar Companies Hiring

Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account