AI Infrastructure & Reliability Engineer

Reposted 3 Days Ago
Be an Early Applicant
Hiring Remotely in Israel
Remote or Hybrid
Mid level
HR Tech • Information Technology • Professional Services • Sales • Software
Our mission is to create a modern work experience that empowers organizations to be remarkable.
The Role
The AI Infrastructure & Reliability Engineer will manage cloud infrastructure, CI/CD processes, observability practices, and AI operations to ensure platform reliability and performance.
Summary Generated by Built In
Job Description
About UsHiBob helps modern, mid-size businesses transform the way they manage people, giving HR and managers all they need to connect, engage, develop, and retain top talent. Since 2015, we've achieved consecutive triple-digit year-over-year growth, all backed by our amazing team of Bobbers from across the globe, making us the choice HRIS of over ~5500 midsize and multinational companies and over 1 Milion users.
Our HR platform is intuitive, data-driven, and built for the way people work today: globally, remotely, and collaboratively.
What this role is really about
You'll join a 3-person platform team within our Business Technology group -owning the internal infrastructure that our AI platform and its users depend on. This isn't a product engineering role, and it isn't ticket work or babysitting pipelines someone else built. You're building and operating the internal foundation that the company runs on. The work covers the full stack of platform engineering: core cloud infrastructure (AWS, Kubernetes, IaC), CI/CD pipelines, AI-driven infrastructure components, and the SRE and observability practice that keeps it all honest -metrics, alerting, incident response, and reliability standards. As our AI capabilities grow, so does the complexity underneath them, and staying ahead of that is central to the role. If you treat infrastructure as a product -reusable, automated, observable, and built to last -this is your kind of role.
Job Requirements
  • 2-4 years Hands-on DevOps, SRE, or infrastructure engineering in production SaaS environments.
  • Strong AWS experience: multi-account architecture, cross-account IAM, serverless and event-driven services (Lambda, SQS, SNS, EventBridge), and EKS cluster management.
  • Proven Kubernetes experience in production, including cross-account migrations and stateful workload management.
  • Proficiency with Terraform - repository structure design, module architecture, and CI/CD pipeline implementation.
  • Hands-on experience building and maintaining GitHub Actions pipelines for end-to-end CI/CD workflows.
  • Working Python proficiency for scripting, internal tooling, and workflow automation.
  • Practical experience implementing observability stacks from scratch: metrics, logging, distributed tracing, and alerting.
  • Experience owning reliability practices: SLOs, incident response, and postmortem culture.
Nice to have
  • Hands-on experience operating LLM APIs in production: rate-limit and quota management, cost attribution per team/model, latency monitoring, and resilience patterns (retries, fallbacks, circuit breakers).
  • FinOps experience across cloud, AI, and observability spend.
  • Experience introducing self-healing or auto-remediation patterns in production.

Job Responsibilities
  • DevOps & AI-Driven Infrastructure - own CI/CD, deployment processes, and release reliability. Build and operate cloud infrastructure that is automated, intelligent, and continuously self-improving - not just managed.
    • Design and build our Terraform repository and IaC pipeline from scratch -AI-assisted generation, drift detection, and policy enforcement built in.
    • Build AI-driven GitHub Actions pipelines -automated code review, risk assessment, and intelligent deployment decisions.
    • Manage Kubernetes workloads across AWS accounts -zero downtime, fully automated, nothing left behind.
  • Embed AI into the operational layer -proactive drift detection, automated remediation, and intelligent scaling toward a self-healing runtime.
  • Reliability & SRE -improve uptime, resilience, and incident response.
    • Define and enforce SLOs/SLIs, error budgets, and on-call practices.
    • Lead incident response, postmortems, and systemic reliability improvements.
  • Own AI-specific reliability: model latency SLOs, token quota monitoring, rate limit handling, fallback and retry strategies, and cost-per-request alerting.
  • Observability & Telemetry - increase visibility, reduce noise, improve troubleshooting.
  • Establish and continuously evolve the observability stack: metrics, logs, distributed tracing, and alerting tuned for both application and AI workloads.
  • AI / LLM Operations- bringing AI systems to production and operating them at scale, with a focus on reliability, performance, and trust.
    • Own the AI infrastructure layer: rate limits, quota management, latency SLOs, and fallback strategies (retries, circuit breakers).
  • Operate LLM APIs in production with resilience and cost attribution per team/model.
  • FinOps & Cost Optimization - optimize AI, infra, and logging costs at scale.
  • Build cost visibility and guardrails across AWS, LLM usage, and observability pipelines.

Benefits
HiBob is a village filled with amazing people and we're especially proud of that. It's a place where Bobbers can be themselves. We're about fun, dreams, hopes and ambition, just as much as we are about precision, growth, and top performance. Becoming a Bobber means you'll receive competitive Total Reward offer including:
Financial & Equity Incentives
  • Equity Plan: Participation in the Company Share Options Plan
  • Social Contributions and Keren Hishtalmut
  • Employee Referral Program: $2,500 for each successful hire
  • Wolt Benefit (meal card): ₪1,000 per month
Health, Wellness
  • Private Health Insurance: Comprehensive premium medical coverage
  • Sick Leave: Full payment from the first day of illness
  • Wellness Benefits: Annual Headspace subscription and dedicated wellness programs
  • Preventive Screening: Health screenings for employees aged 40+
Work-Life Balance & Leave ⚖️
  • Paid Time Off: Competitive paid time off policy
  • HiBaby: 3 weeks of additional fully paid bounding time for new parents
  • Bob Balance Days: 4 additional company-wide "long weekend" days (one per quarter)
  • Social Impact: 2 paid days per year for volunteering and social contribution
  • Work from Anywhere: Temporary remote work option for up to 2 months (available after 6 months of tenure).
  • Birthday Day Off: Enjoy a day off during your birthday month
Hybrid Work & Office Environment
  • Hybrid Model: A flexible balance between office and home-based work
  • Home Office Allowance: One-time stipend to ensure an ergonomic and productive home setup
  • Transportation: Monthly travel allowance or parking arrangements
  • Pet-Friendly: Dog-friendly office environment to support a stress-free workplace
Culture & Growth
  • Social Events: Regular team-building and company-wide events, both local and global
  • Professional Growth: A culture built on precision, performance, and ambitious career scaling

If this sounds like something you've been looking for, we'd love to have you. Come on, join our village

Skills Required

  • 2-4 years Hands-on DevOps, SRE, or infrastructure engineering in production SaaS environments
  • Strong AWS experience: multi-account architecture, cross-account IAM, serverless services
  • Proven Kubernetes experience in production
  • Proficiency with Terraform
  • Hands-on experience building and maintaining GitHub Actions pipelines
  • Working Python proficiency for scripting
  • Practical experience implementing observability stacks
  • Experience owning reliability practices

What the Team is Saying

Giovanna
Alex
Latisha
Rebecca
Ana
Ashley

HiBob Compensation & Benefits Highlights

  • Leave & Time Off Breadth Paid time off includes about 20 days plus quarterly “Bob Balance Days,” a birthday day off, paid volunteer time, and holidays. Company‑wide recharge long weekends and volunteering time emphasize rest and community engagement.
  • Flexible Benefits Hybrid work is supported with a work‑from‑anywhere option for up to two months after six months and a home‑office stipend. Dog‑friendly offices and structured recharge days reinforce flexibility in how and where work happens.
  • Equity Value & Accessibility Equity grants are provided to all new hires with opportunities for additional grants over time. An expressed commitment to equitable pay accompanies the ownership component in careers materials.

HiBob Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Tel Aviv
1,350 Employees
Year Founded: 2015

What We Do

HiBob helps modern, mid-size businesses transform the way they manage people, giving HR and managers all they need to connect, engage, develop, and retain top talent. Since 2015, we’ve achieved consecutive triple-digit year-over-year growth, all backed by our amazing team of Bobbers from across the globe, making us the choice HRIS of over 4000 midsize and multinational companies. Our HR platform is intuitive, data-driven, and built for the way people work today: globally, remotely, and collaboratively. Fast-growing companies across the globe such as Huel, What3words, Fiverr, and VaynerMedia rely upon Bob to help them create the best work experiences for their people.

Why Work With Us

Being a Bobber is all about being you. We want you to bring all parts of yourself to work, giving you the freedom and confidence to be the best you and do your best work. If that’s bubbly, shy, precise, funny, bold, kind, honest, brilliant, or anything in between, we’re waiting with open arms. Come join us.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

HiBob Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

We love collaborating and connecting with our team members in person and we hope you do too. Our team spends 2-3 days per week in our NYC office.

Typical time on-site: 2 days a week
HQHiBob Tel Aviv
HiBob Amsterdam
HiBob Berlin
HiBob Lisbon
HiBob London
HiBob New York City
HiBob Sydney
Learn more

Similar Jobs

HiBob Logo HiBob

MIS Developer

HR Tech • Information Technology • Professional Services • Sales • Software
Remote or Hybrid
Israel
1350 Employees

HiBob Logo HiBob

Product Manager

HR Tech • Information Technology • Professional Services • Sales • Software
Remote or Hybrid
Israel
1350 Employees

HiBob Logo HiBob

Global IT & Sec Ops Director

HR Tech • Information Technology • Professional Services • Sales • Software
Remote or Hybrid
Israel
1350 Employees

HiBob Logo HiBob

Product Enablement Manager

HR Tech • Information Technology • Professional Services • Sales • Software
Remote or Hybrid
Israel
1350 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account