Senior Software QA Test Development Engineer

Job Posted 15 Days Ago Posted 15 Days Ago
Santa Clara, CA
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Role
Responsible for developing and executing test plans for NVIDIA platforms, troubleshooting, automation framework, and collaborating on solutions for test failures.
Summary Generated by Built In

NVIDIA is the world leader in GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC, datacenters and networking in addition to our traditional OEM business. NVIDIA is also well positioned as the ‘AI Computing Company’, and NVIDIA GPUs are the brains powering Deep Learning software frameworks, analytics, data centers, and driving autonomous vehicles. We have some of the most experienced and dedicated people in the world working for us. If you are dedicated, forward-thinking, and hard-working technical people across countries sounds exciting, this job is for you. NVIDIA is looking for an outstanding individual who thrives in a diverse work environment, has outstanding interpersonal skills and possesses a strong sense of engagement and continuous process improvement. This candidate must have enterprise server integration, strong Linux experience, reliability testing with various telemetries, scale out cluster, test plan development, track record in developing AI tools and NLP, DevOps, CI/CD experience to join our platform SWQA team.

What you’ll be doing:

  • Responsible for the development and execution of NVIDIA HGX/DGX/MGX platform test plan on servers, OS, FW and CUDA SW stack from design doc.

  • Installing and testing various systems OS, server firmware and SW stack.

  • Drive support for root cause analysis on reliability and validation test failures to identify root cause(s) and achieve mitigation.

  • Build, develop/debug server and OS level automation front-end and back-end framework and tests

  • Review partner and supplier test results and prescribe additional reliability testing on components, servers, and packaging as needed.

  • Work in an agile software development team with very high production quality standards.

  • Manage bug lifecycle and collaborate with inter-groups to drive for solutions.

What we need to see:

  • Bachelor’s Degree (or equivalent experience) in a STEM (Science, Technology, Engineering, Math or Physics) field

  • 5+ years proven experience; or master’s degree.

  • Proven years of OS and server level automation, CI/CD process and DevOps experience using Python, SHELL, Ansible, Jenkins, C/C++, Java, JavaScript

  • Strong server and Linux(Ubuntu, RedHat, CentOS, SuSE, Fedora and etc…) troubleshooting and debugging experience in a bare-metal and KVM/VMWare/Hyper-V environment.

  • Good knowledge and hands-on experience in model testing, AI tools/frameworks (TensorFlow, Pytorch, Cursor and etc…), NLP and LLM benchmarking

  • Experience in using AI development tools for test plans creation, test cases development and test cases automation

  • Strong experience in FW, BMC/OpenBMC, Network protocol, internal/external enterprise storage devices, PCIe buses and devices, IO sub-devices, CPU and memory, ACPI, UEFI spec, Redfish - huge plus

  • Proven years of experience in GitHub/Gitlab/Gerrit, PXE, SLURM, Stack/Kubernetes/Docker) – huge plus

Ways to stand out from the crowd:

  • AI related tools, LLM and NLP.

  • Experience working with NVIDIA GPU hardware is a strong plus.

  • Good to have solid understanding of virtualization in Linux (KVM, Docker orchestrated with Kubernetes)

  • Background in parallel programming ideally CUDA/OpenCL is a plus

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

    The base salary range is 136,000 USD - 264,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

    You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

    NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

    Top Skills

    Ansible
    C/C++
    Docker
    Java
    JavaScript
    Jenkins
    Kubernetes
    Python
    PyTorch
    Shell
    TensorFlow
    Am I A Good Fit?
    beta
    Get Personalized Job Insights.
    Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

    The Company
    HQ: Santa Clara, CA
    21,960 Employees
    On-site Workplace
    Year Founded: 1993

    What We Do

    NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

    Similar Jobs

    NVIDIA Logo NVIDIA

    Senior Software QA Test Development Engineer

    Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
    Santa Clara, CA, USA
    21960 Employees

    True Anomaly Logo True Anomaly

    Future Engineering Opportunities

    Aerospace • Artificial Intelligence • Hardware • Machine Learning • Software • Defense
    Long Beach, CA, USA
    131 Employees

    True Anomaly Logo True Anomaly

    Propulsion Engineer (Senior or Staff)

    Aerospace • Artificial Intelligence • Hardware • Machine Learning • Software • Defense
    2 Locations
    131 Employees

    Xero Logo Xero

    Senior Software Engineer, Embedded Accounting, Authentication

    Cloud • Fintech • Information Technology • Machine Learning • Software
    Hybrid
    San Mateo, CA, USA
    4700 Employees

    Similar Companies Hiring

    True Anomaly Thumbnail
    Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
    Colorado Springs, CO
    131 Employees
    Caliola Engineering Thumbnail
    Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
    Colorado Springs, CO
    53 Employees
    Red 6 Thumbnail
    Virtual Reality • Software • Hardware • Defense • Aerospace
    Orlando, Florida
    113 Employees
    By clicking Apply you agree to share your profile information with the hiring company.

    Sign up now Access later

    Create Free Account

    Please log in or sign up to report this job.

    Create Free Account