Our Tech Stack:
- Docker
- AWS EKS
- Kafka / AutoMQ for asynchronous messaging
- Elixir & Ruby for core services
- gRPC for inter-service communication
- GraphQL for API ingress
- Next.js / TypeScript for frontend
- PostgreSQL (RDS) for persistent storage
- GitHub Actions (primary CI), with Jenkins & Argo Workflows for legacy pipelines
- Redis for key/value storage
- Terraform for infrastructure as code
- Python and GoLang for our Platform tools
- Datadog, Sentry for observability and incident response
What you will be doing:
- Define and evolve infrastructure architecture to support multi-region deployments
- Design systems for resilience, scalability, and operational simplicity
- Explore and apply AI-assisted approaches to reduce operational toil, improve reliability, and support platform decision-making where it creates clear value
- Extend monitoring and observability capabilities across services
- Make operational data easy to access, understand, and act on
- Build tooling for safe deployments, fast rollbacks, and reduced operational toil using software methodologies and approaches
- Experiment with AI-assisted detection, triage, or automation to improve signal quality and reduce manual effort
- Scope, lead, and deliver platform initiatives autonomously
- Drive cross-team projects with clear outcomes and accountability
- Raise the bar through documentation, runbooks, and internal knowledge sharing
- Help teams learn, align, and operate more effectively together
We’d love to hear from you if you:
- Have 5+ years of experience building, operating, and troubleshooting production-grade systems, with a strong focus on reliability, observability, and scalability
- Bring hands-on experience with AI/ML or LLM-based tools in a platform or operational context (e.g. automation, observability, developer experience, or incident response)
- Demonstrate strong programming and automation skills in Python, Go, or Bash, and use Infrastructure as Code (Terraform) to build repeatable, low-risk systems
- Have experience with AWS (EKS, RDS, S3, CloudFront), or equivalent platforms on GCP or Azure
- Are comfortable working close to the system with solid Linux and networking fundamentals (TCP/IP, DNS, firewalls, load balancing, VPNs)
- Have practical experience designing or improving observability (metrics, logs, traces) using tools such as Datadog, Grafana, ELK, Sentry, and OpsGenie
- Take ownership of complex technical problems, form clear opinions backed by data, and drive solutions through implementation and collaboration
- Work effectively across teams, communicate technical concepts clearly, and apply systems thinking to make platforms easier to use, safer to operate, and simpler to scale
Top Skills
What We Do
Fresha is the world's largest and top-rated booking platform for Beauty and Wellness trusted by millions of consumers worldwide. Fresha is used by 70,000+ businesses and 300,000+ professionals worldwide, processing over 20mil appointments per month. Fresha is headquartered in London, United Kingdom with global offices located in New York City, Vancouver, Sydney, Dublin, Amsterdam , Dubai and Warsaw. The company raised $185M in venture capital funding to date from leading institutional investors.
Fresha allows consumers to discover, book and pay for beauty and wellness appointments with local businesses via its marketplace, while beauty and wellness businesses and professionals use an all-in-one platform to manage their entire operations with its intuitive free business software and financial technology solutions. Fresha’s ecosystem gives merchants everything they need to run their business seamlessly by facilitating appointment bookings, point-of-sale, customer records management, marketing automation, loyalty, beauty products inventory and team management. The consumer marketplace unlocks revenue potential for partner businesses by leveraging the power of online bookings and automated marketing through mobile apps and advanced integrations with major tech brands including Instagram, Facebook and Google.








