- 5+ years of experience as an SRE (or similar role) in a high-scale production environment, with hands-on ownership across the full stack - infrastructure and application layers.
- Business-level reliability experience is a strong advantage.
- Experience in designing, building, and operating cloud-native systems on AWS.
- Hands-on experience with maintaining Node.JS or JVM-based applications running with the following: MongoDB, ElasticSearch, Kafka.
- Strong coding skills and a software engineering mindset - you build your own tools rather than waiting for someone else to.
- Experience with infrastructure-as-code and modern container orchestration platforms.
- Practical experience building or integrating AI-driven solutions (e.g., LLMs, agents, or AI-powered operational tooling).
- A true owner - you take responsibility for systems end-to-end and proactively drive improvements without waiting for direction.
- A problem solver who practices adaptability and flexibility to business needs.
- Competitive equity
- Hybrid work schedule
- Company funded health insurance
- 6 weeks fully paid Parental Leave
- Critical Family Medical Leave
- Financial planning services
- Employee learning & development budget
- Values-based recognition (quarterly and annually)
- Social community & ERG programs
- FreeFit gym package
- Work-life harmony
- Dog friendly office
Skills Required
- 5+ years of experience as an SRE or similar role in a high-scale production environment
- Work from the Tel Aviv office three days per week
- Experience designing, building, and operating cloud-native systems on AWS
- Hands-on experience maintaining Node.JS or JVM-based applications
- Experience with MongoDB, Elasticsearch, and Kafka
- Strong coding skills and a software engineering mindset (build your own tools)
- Experience with infrastructure-as-code
- Experience with modern container orchestration platforms (e.g., Kubernetes)
- Practical experience building or integrating AI-driven solutions (LLMs, agents, AI operational tooling)
- Business-level reliability experience
- Ownership mindset and strong problem-solving/adaptability
What We Do
BigPanda is the only Event Correlation and Automation platform built for domain-agnostic AIOps. We transform how IT teams prevent outages and resolve incidents by turning data into insights and action. Without BigPanda, IT Ops and DevOps teams struggle with manual and reactive incident response capabilities that are badly suited for the scale, complexity and velocity of modern IT environments. This results in painful outages, unhappy customers, growing IT headcount and the inability to focus on innovation. Fortune 500 enterprises such as Intel, Cisco, United, Nike, Marriott and Expedia rely on BigPanda to prevent outages, reduce costs, and give their teams time back for digital transformation. BigPanda helps organizations take a giant step towards Autonomous IT Operations by turning IT noise into insights and manual tasks into automated actions. BigPanda is backed by top-tier investors including Sequoia Capital, Mayfield, Battery Ventures, Greenfield Partners and Insight Partners. Visit www.bigpanda.io for more information.
Gallery









