Why work at Nebius
Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field.
Where we work
Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 800 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team.
About the Role
We are looking for a QA Engineer specializing in servers and server components to ensure quality, performance, reliability, and compatibility for purpose of modern server platforms.
In this role, you will work closely with hardware, firmware, system, services, and application teams to validate server architectures and components across the full product lifecycle - from early prototypes to production deployment.
Responsibilities
System & Component Validation
• Qualify complete server system architecture, including compute, memory, storage, networking, and accelerators
• Test and validate interactions between server components: CPU, DRAM, NVMe devices, Network interface cards (NICs), GPUs / accelerators.
• Perform individual component testing to verify performance, stability, and reliability prior to and after system integration
• Execute system-level validation under different configurations, workloads, and use cases
• Evaluate benchmark results against: technical specifications, business and application requirements, historical baselines and previous generations
• Identify performance regressions, instability, and reliability issues under sustained load and stress conditions
• Ensure hardware components and firmware meet business needs, performance goals, and specifications
Firmware, BIOS & BMC Validation
• Test and validate firmware for BMC / BIOS / FPGA
• Validate BMC functionality including IPMI/Redfish, sensors, power, thermal management, and logging
• Perform firmware upgrade, downgrade, and regression testing
• Assess firmware impact on performance, stability, and reliability across different hardware configurations
• Prepare and maintain test cases, checklists, and test reports
• Participate in platform validation for manufacturing and production readiness
Required
• Strong understanding of server system architecture
• In-depth knowledge of server components and their operation:
• Experience with component-level and system-level testing
• Understanding of BIOS/UEFI and BMC fundamentals
• Practical experience with Linux (command line, system tools, logs)
• Scripting skills (e.g. Bash, Python) for test and benchmark automation
• Ability to analyze performance data and identify regressions or anomalies
• Strong debugging, documentation, and communication skills
Nice to Have
• Experience with data center or server-grade hardware platforms
• Familiarity with common bench-marking and stress tools
• Understanding of PCIe topology and system interconnects
• Experience with reliability, power, and thermal testing
• Exposure to production, manufacturing, or fleet-scale validation
• Experience working with early hardware prototypes
What we offer
- Competitive salary and comprehensive benefits package.
- Opportunities for professional growth within Nebius.
- Flexible working arrangements.
- A dynamic and collaborative work environment that values initiative and innovation.
We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!
Top Skills
What We Do
Cloud platform specifically designed to train AI models








