Senior Engineer - DevOps

Posted 18 Days Ago
Be an Early Applicant
Īnd, Chamba, Himāchal Pradesh
Senior level
Artificial Intelligence • Hardware • Robotics • Software • Metaverse
The Role
Join NVIDIA's Software Infrastructure and Operations team as a Senior Engineer. Design and maintain Kubernetes based environments, develop automation tools, deploy new data center infrastructure, and work on cloud infrastructure. Strong background in Kubernetes, programming, databases, and infrastructure management is required. Bachelor's or Master's degree in CS or related field is preferred.
Summary Generated by Built In

NVIDIA is looking for an outstanding engineer to join its Software Infrastructure and Operations team. The position will be part of a fast-paced crew that develops and maintains sophisticated Kubernetes based development, build and test environments for a multitude of platforms including Windows and Linux.

What you’ll be doing:

  • Design/Architect the scaling operation in our data centers. Deploy and Support end-to-end container management solution with Kubernetes, Docker, containerd. Design solutions with service discovery, networking, monitoring, logging, scheduling in Kubernetes

  • Setup and Manage end to end Jenkins instances - tools, plugins, nodes, user management, back up, restore, monitoring, etc. Design and develop tools needed for automating maintenance of 10000+ hosts with only 10 support engineers.

  • Use your depth in algorithms and system software background!

  • Work in teams to deploy new data center infrastructure.

  • Plan and implement critical metrics tracking using various data analytics mining methods and dashboards.

  • Reuse AI techniques to extract useful signals about machines and jobs from the data generated!

  • Take part in prototyping, crafting and developing cloud infrastructure for NVIDIA.

What we need to see:

  • Strong Kubernetes understanding and background especially on-premises setup and extensive experience with Kubernetes components & subsystems.

  • Experience of maintaining large scale cloud/on-prim infrastructure applications using Kubernetes

  • Proven programming background in python/Golang/java and/or relevant scripting languages

  • Excellent debugging and analytical skills and experience in Databases both SQL (MySQL ) and NoSQL (Elastic Search /MongoDB)

  • Proficient with configuration management tools like Ansible, Chef, Puppet and strong experience with Jenkins and/or other CI systems.

  • Hands-on experience with VMs, Dockers, Kubernetes Cluster.

  • Experience with analytics/visualization tools like Kibana, Grafana, Splunk etc. and experience with monitoring systems such as Zabbix and/or Nagios is nice to have

  • 8+ years of proven experience

  • Bachelors or Master's Degree or equivalent experience in CS, Software Engineering, or related field.

Ways to stand out from the crowd:

  • Previous experience with DevOps teams

  • Thrives in a multi-tasking environment with constantly evolving priorities and documents work well

  • Outstanding collaboration skills across organizational boundaries, experience with using and improving data centers and with computer algorithms and ability to choose the best possible algorithms to meet the scaling challenge

  • Ability to divide complex problems into simple sub problems and then reuse available solutions to implement most of those

  • Ability to design simple systems that can work reliably without needing much support

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and dedicated people in the world working for us and, due to unprecedented growth, our elite engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

Top Skills

Ansible
Chef
Docker
Go
Java
Kubernetes
NoSQL
Puppet
Python
SQL
The Company
HQ: Santa Clara, CA
21,960 Employees
On-site Workplace
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Jobs

Īnd, Chamba, Himāchal Pradesh, IND
32000 Employees
Īnd, Chamba, Himāchal Pradesh, IND

Accelya Logo Accelya

Senior Engineer- Datacentre Operations

Aerospace • Software • Transportation
Īnd, Chamba, Himāchal Pradesh, IND
2028 Employees

Accelya Logo Accelya

Senior Engineer - Software Development

Aerospace • Software • Transportation
Īnd, Chamba, Himāchal Pradesh, IND
2028 Employees

Similar Companies Hiring

TrainingPeaks (A Peaksware Company) Thumbnail
Software • Fitness
Louisville, CO
69 Employees
bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account