Software Engineer in Test (Distributed Systems & AI)

Reposted 19 Days Ago
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka, IND
In-Office
Junior
Artificial Intelligence • Big Data • Machine Learning
The Role
As a Software Engineer in Test, design and implement distributed frameworks, build AI-driven tools, and ensure platform resilience through chaos experiments and telemetry pipelines.
Summary Generated by Built In
WEKA is architecting a new approach to the enterprise data stack built for the age of reasoning. NeuralMesh by WEKA sets the standard for agentic AI data infrastructure with a cloud and AI-native software solution that can be deployed anywhere. It transforms legacy data silos into data pipelines that dramatically increase GPU utilization and make AI model training and inference, machine learning, and other compute-intensive workloads run faster, work more efficiently, and consume less energy.

WEKA is a pre-IPO, growth-stage company on a hyper-growth trajectory. We’ve raised $375M in capital with dozens of world-class venture capital and strategic investors. We help the world’s largest and most innovative enterprises and research organizations, including 12 of the Fortune 50, achieve discoveries, insights, and business outcomes faster and more sustainably. We’re passionate about solving our customers’ most complex data challenges to accelerate intelligent innovation and business value. If you share our passion, we invite you to join us on this exciting journey.

The Mission
At WEKA, we are building NeuralMesh™- the world’s first intelligent, adaptive mesh storage system. To ensure our platform is unbreakable for the world’s largest AI and GPU clusters, we don't just "test" our code. We build an adversarial distributed system as complex and sophisticated as the product itself.
We are moving away from traditional QA automation. We are building a high-octane engineering force that treats reliability as a high-end software problem. We need "Quality Hackers" who want to build the technology that proves bugs don't exist.
In this role as a Software Engineer in Test (SET), you are a developer first. You will join a high-impact team of engineers who write production-grade code to build a massive-scale validation ecosystem. Your job is to act as "The Breaker"—designing the infrastructure, chaos experiments, and AI-driven tools that push our platform to its theoretical limits.

What You’ll Build:
  • Adversarial Engineering: Design and implement Python-based distributed frameworks capable of orchestrating millions of concurrent IO operations to hunt down race conditions and memory leaks.
  • AI-Augmented Validation: Be at the forefront of the AI-Native transformation. You will leverage LLMs and Generative AI to automate complex scenario generation, build intelligent agents for root-cause analysis, and multiply your engineering velocity.
  • Simulation & Chaos: Build the "Entropy Engine." You will develop tools that inject real-world failures - latency, packet loss, and hardware crashes - to prove the resilience of our Raft and RDMA implementations.
  • Deep-System Observability: Move beyond "Pass/Fail." You will build telemetry pipelines to track P99 latency and jitter, providing critical architectural feedback to the Core Kernel teams.
  • Collaborative Architecture: You will operate with the same rigorous standards as the Core R&D team: design docs, production-grade code reviews, and high-level architectural planning.
Requirements:
  • Extensive Coding experience: You are a Python expert who understands "under the hood" internals. You are comfortable reading and debugging C++, Rust, or Go to understand how the core system works.
  • Systems Engineering Mindset: You have a background in distributed systems, networking (TCP/IP, RDMA), or storage protocols. You understand the complexities of consistency and metadata at scale.
  • AI Enthusiast: You are an early adopter of AI tools (Copilot, LLMs) and are excited about using them to automate the most tedious parts of the engineering lifecycle.
  • The "SRE" Lens: You approach quality through the lens of Site Reliability Engineering. You care about observability, MTTD (Mean Time to Detection), and building self-healing testing loops.
  • Problem Hunter: You have a "hacker" instinct. You don’t just find a bug; you find the architectural flaw that allowed it to exist.
Why Join This Group?
You will be part of a newly restructured group led by veteran systems architects, moving at the speed of a startup with the impact of a global leader. If you are a backend engineer who wants to solve the hardest problems in computer science—verifying correctness in a massive asynchronous system—this is your home.

Skills Required

  • Strong coding skills in Python
  • Comfortable reading and debugging C++, Rust, or Go
  • Background in distributed systems and networking
  • Familiarity with AI tools and automation for engineering
  • Experience in Site Reliability Engineering principles
  • Ability to identify architectural flaws causing bugs
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Campbell, CA
273 Employees
Year Founded: 2014

What We Do

Weka offers WekaFS, the modern file system that uniquely empowers organizations to solve the newest, biggest problems holding back innovation. Optimized for NVMe and the hybrid cloud, Weka handles the most demanding storage challenges in the most data-intensive technical computing environments, delivering truly epic performance at any scale. Its modern architecture unlocks the full capabilities of today’s data center, allowing businesses to maximize the value of their high-powered IT investments. Weka helps industry leaders reach breakthrough innovations and solve previously unsolvable problems. Try now at https://www.weka.io/

Similar Jobs

Zscaler Logo Zscaler

Senior Staff Software Development Engineering

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Hybrid
Bangalore, Bengaluru, Karnataka, IND
8697 Employees

Zscaler Logo Zscaler

Development Engineer

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Hybrid
Bangalore, Bengaluru, Karnataka, IND
8697 Employees

Mondelēz International Logo Mondelēz International

Mgr, Global Demand Insights

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
6 Locations
90000 Employees
122K-168K Annually

Boeing Logo Boeing

Lead Software Engineer

Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
170000 Employees

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account