WHAT YOU'LL BE DOING
- Design and maintain high-availability, scalable infrastructure across AWS and Azure clouds
- Lead cross-region, multi-cloud migration projects with minimal service disruption
- Implement infrastructure-as-code using Terraform/CloudFormation for reproducible deployments
- Architect cloud-native solutions optimized for performance, reliability, and cost-efficiency
- Establish robust observability practices with comprehensive monitoring and alerting
- Drive continuous improvement through automation and performance optimization
- Respond to incidents with well-documented procedures and conduct blameless post-mortems
YOU ARE A PERFECT FIT IF YOU HAVE
- Bachelor's or Master's Degree in Computer Science or related technical discipline
- 5+ years in Site Reliability Engineering or DevOps with multi-cloud experience
- Demonstrated proficiency in both AWS and Azure environments (e.g. Kubernetes, EC2/VM, Spot Instances, ELB, RDS, Redis, DynamoDB, Cloudfront/ Front Door, Secret Manager/ Key Vault, Route53/ Azure DNS, SQS/ Service Queue, Lambda/ Azure Functions, EFS/Azure Files)
- Strong experience with containerization (Docker) and orchestration (Kubernetes/AKS/EKS)
- Expertise in infrastructure-as-code tools (Terraform, CloudFormation, ARM templates)
- Proficiency in scripting (Bash) and programming languages (Python, Go)
- Solid understanding of networking concepts, DNS management, and SSL/TLS implementation
- Experience implementing CI/CD pipelines and automating deployment workflows
BONUS POINTS FOR
- Experience managing cloud migrations between AWS and Azure platforms
- Strong background in distributed systems design and troubleshooting
- Knowledge of observability tools (Prometheus, Grafana, ELK stack, Azure Monitor)
- Knowledge of Gitops tools (FluxCD, ArgoCD)
- Experience with database replication and high-availability patterns
- Security-focused mindset with experience implementing cloud security best practices
- History of optimizing cloud infrastructure for significant cost savings
- Experience managing DNS at scale with complex routing configurations
WHAT WE OFFER
- The opportunity to be at the forefront of AI-first eCommerce Search & Product Discovery
- The opportunity to architect next-generation cloud solutions while optimizing our existing infrastructure for reliability, performance, and cost-efficiency across multiple cloud providers
- A fast-paced and dynamic work environment with a focus on innovation and growth
- Competitive salary and benefits package
- The chance to work with a talented and passionate team
- Make a real impact on a product that is transforming the ecommerce industry
Top Skills
What We Do
GroupBy's cloud-native SaaS technology powers the world's most relevant and highly converting eCommerce websites. Our composable commerce-based Product Discovery Platform powered by Google Cloud Discovery AI, provides industry-leading features for data enrichment, search, recommendations, navigation, personalization, merchandising and search analytics. GroupBy’s next-generation search and recommendations platform creates seamless eCommerce experiences optimized for your business outcomes, including revenue, margin, and profit. We excel with complex, large-scale B2B configurations and in dynamic, high-volume B2C scenarios. Founded in 2013, GroupBy is headquartered in Toronto, Canada and has offices in Austin, Texas


_1.png)





