Senior Software Development Engineer in Test (SDET), Chaos Engineering Specialist - (Dublin, CA)

Posted 10 Days Ago
Be an Early Applicant
Dublin, CA
In-Office
Senior level
Artificial Intelligence • Software
The Role
As a Senior SDET specializing in chaos engineering, you will design test automation frameworks, execute chaos experiments, implement monitoring solutions, mentor engineers, and contribute to quality assurance.
Summary Generated by Built In
Company Overview

At Articul8 AI, we're building the next generation of resilient, scalable software systems that help organizations transform their operations. Our commitment to quality and reliability drives our engineering culture, where we continuously test and improve our systems under real-world conditions.

Why Join Articul8 AI?
  • Make an Impact: Shape the resilience and reliability of AI-driven systems at scale.

  • Build with Modern Tech: Leverage cutting-edge tools and platforms (Multi-cloud, AI-first tooling).

  • Ownership & Growth: Take ownership of chaos engineering initiatives and influence engineering culture across teams.

  • Continuous Learning: Collaborate with top engineers, participate in mentoring, and stay ahead in chaos engineering and SRE practices.

Position Summary

We are seeking a Senior SDET specializing in chaos engineering and monitoring to join our Quality Engineering team. You will design and implement sophisticated test automation frameworks, create and run chaos experiments to validate our systems' resilience against real-world failures, while ensuring comprehensive monitoring capabilities that provide actionable insights during both testing and production scenarios.

Key Responsibilities
  • Design, develop, and maintain advanced test automation frameworks that incorporate chaos engineering principles

  • Create and execute chaos experiments that simulate various failure modes and edge cases in our distributed systems

  • Implement monitoring solutions that effectively track system performance, resilience, and failure recovery

  • Establish observability practices that provide deep insights into system behavior during chaos experiments

  • Collaborate with development teams to build resilience into our applications from the ground up

  • Develop metrics and dashboards to visualize system reliability and the impact of chaos experiments

  • Lead post-mortem analyses to identify system weaknesses discovered through chaos testing

  • Integrate chaos testing into CI/CD pipelines to validate system resilience continuously

  • Mentor engineers through code reviews, technical sessions, and hands-on guidance in test automation, chaos engineering, and monitoring best practices.

  • Contribute to the company's overall testing strategy and quality assurance practices

QualificationsRequired
  • Bachelor's degree in Computer Science, Engineering, or related field

  • 5+ years of experience in software testing and quality assurance, with at least 2 years focused on chaos engineering

  • Strong programming skills in languages such as Python, Go, and/or Rust

  • Experience with chaos engineering tools such as Chaos Monkey, Gremlin, or similar frameworks

  • In-depth knowledge of monitoring systems like Prometheus, Grafana, ELK Stack, or similar tools

  • Experience implementing observability practices (metrics, logging, tracing) in distributed systems

  • Familiarity with container orchestration platforms like Kubernetes and related chaos tools

  • Experience with SRE practices and principles

  • Strong understanding of CI/CD pipelines and how to integrate testing workflows

  • Experience with cloud platforms (AWS, GCP, Azure) and their monitoring capabilities

  • Excellent communication skills with the ability to present technical findings to various stakeholders

Preferred
  • Master’s degree in Computer Science, Engineering, or related field

  • Knowledge of statistical analysis for evaluating test results and system performance

  • Experience with distributed systems and microservice architectures

  • Contributions to open-source testing or chaos engineering projects

  • Familiarity with AI/ML systems and their unique testing challenges

  • Relevant certifications in cloud platforms, testing methodologies, or chaos engineering

Ready to shape the future of resilient software systems? Apply now and help drive the reliability of tomorrow’s AI at Articul8 AI!

Top Skills

AWS
Azure
Chaos Monkey
Elk Stack
GCP
Go
Grafana
Gremlin
Kubernetes
Prometheus
Python
Rust
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Dublin, California
58 Employees
Year Founded: 2024

What We Do

Articul8 AI is a technology company whose products transform enterprise data and expertise into powerful engines of growth, value and impact. Our full-stack GenAI platform is revolutionizing how enterprises harness their data and expertise to build expert-level Generative AI applications for their mission-critical challenges. Our products deliver enterprise-scale impact with ROI in hours to weeks. General-purpose GenAI models, while necessary, are not sufficient to deliver enterprise-specific decisioning and actioning. Our platform addresses this gap by making it straightforward for companies to build sophisticated, enterprise-scale and expert-level GenAI applications that encode their domain expertise. Our proprietary technology does the heavy lifting through autonomous decisions and actions, automated data intelligence, improved precision and relevance with industry knowledge encoded into Articul8's library of domain and task-specific models. We are purpose-built for regulated industries and meet the highest standards of compliance, data security, privacy and performance, including traceability and auditability at every step. We are trusted by leading global enterprises like AIAA, Itochu Techno-Solutions Corporation, Uptycs, AWS, NIQ, Intel and Franklin Templeton to transform their mission-critical work.

We are the enterprise GenAI platform that simply works! For more information, please visit www.articul8.ai.

Similar Jobs

ServiceNow Logo ServiceNow

Senior Machine Learning Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Santa Clara, CA, USA
159K-270K Annually

Atlassian Logo Atlassian

Head of Employee Research & Listening

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
San Francisco, CA, USA
182K-286K Annually

CrowdStrike Logo CrowdStrike

Senior Software Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
CA, USA
140K-215K Annually

Caterpillar Logo Caterpillar

2026 ITS (Information Technology Solutions) Entry Level Rotation

Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
Hybrid
San Diego, CA, USA
83K-104K Annually

Similar Companies Hiring

Axle Health Thumbnail
Logistics • Information Technology • Healthtech • Artificial Intelligence
Santa Monica, CA
15 Employees
Compa Thumbnail
Software • Other • HR Tech • Business Intelligence • Artificial Intelligence
Irvine, CA
48 Employees
Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account