DevOps Engineer (GCP)

Posted Yesterday
Be an Early Applicant
Hiring Remotely in Greece
Remote
Mid level
Artificial Intelligence • Information Technology • Machine Learning • Software • Analytics
The Role
Own and evolve GCP-based infrastructure for an AI evaluation platform: manage Terraform, GKE, databases, CI/CD, observability, secrets, and cost/reliability. Collaborate with backend, ML, and frontend teams to make deployments repeatable, secure, and reliable.
Summary Generated by Built In

Are you passionate about AI? 🤖

At Satori Analytics, we aim to change the world one algorithm at a time by bringing clarity to global brands through Data & AI. From cloud-based ecosystems for fintech to predictive models for airlines, our cutting-edge solutions cover the entire data lifecycle—from ingestion to AI applications.

As a fast-growing scale-up, our team of 100+ tech specialists—including Data Engineers, Data Scientists, and more—delivers innovative analytics solutions across industries like FMCG, retail, manufacturing and FSI. Join us as we lead the data revolution in South-Eastern Europe and beyond!

Together with a partnering company, we're looking for a a DevOps / Platform Engineer to own and evolve the infrastructure that keeps this platform reliable (AI agent evaluation platform), observable, secure, and fast to ship to. You'll work closely with backend, ML, and frontend engineers to make deploying and operating services boring, repeatable, and safe.

What Your Day Might Look Like:

  • Cloud infrastructure as code: Own and extend our Terraform estate across multiple GCP environments (base, core, obs, dev, test, prod), including GKE clusters, Cloud SQL (Postgres/MySQL), networking, buckets, and IAM. Drive the in-progress "Neo" platform rollout and the cutover/retirement of legacy infrastructure.
  • Kubernetes & containers: Manage workloads on GKE, maintain Dockerfiles and Helm-style application configs for ~10 backend services, and tune autoscaling, resource limits, and pod disruption budgets.
  • Maintain and improve our GitHub Actions pipelines: PR checks (Python/JS lint, type-check, tests), Terraform prechecks, image builds and pushes, auto-deploy, and DB-migration labelling/gating. Reduce build times and flakiness, and make deploys self-service for product teams.
  • Data & messaging infrastructure: Operate Postgres, Redis, and Celery-based async workers; manage Alembic migrations, queue health, and backpressure for long-running simulation jobs.
  • Observability: Own our monitoring stack — Grafana dashboards, ClickHouse, Langfuse (LLM tracing), and Celery queue metrics. Build alerting and SLOs so we catch issues before customers do.
  • Security & secrets: Manage secret distribution, least-privilege IAM, and remediation tracking. Partner with engineering on findings in our security assessment process.
  • Cost & reliability: Keep an eye on cloud and LLM-proxy (LiteLLM) spend, right-size resources, and improve resilience of the simulation and evaluation pipelines.

You'll work with:

  • Cloud: Google Cloud Platform (GKE, Cloud SQL, GCS, IAM); some AWS / IBM footprint
  • IaC: Terraform (>= 1.14), multi-environment root modules
  • Containers/orchestration: Docker, docker compose (local), Kubernetes / GKE
  • CI/CD: GitHub Actions
  • Backend: Python 3.13+ (managed with uv), Celery, FastAPI-style HTTP APIs; Node/Express services
  • Data: PostgreSQL, MySQL, Redis, ClickHouse
  • Observability: Grafana, Langfuse, custom Celery metrics
  • LLM infra: LiteLLM proxy

Requirements

Your Superpowers 🚀

  • 3+ years in DevOps / SRE / Platform Engineering, or strong backend experience with heavy infra ownership.
  • Solid hands-on Terraform (modules, state, multi-environment) and cloud experience (GCP preferred; AWS/Azure transferable).
  • Production Kubernetes experience: deployments, services, autoscaling, debugging pods, rollouts/rollbacks.
  • Strong Docker fundamentals and comfort writing/optimising Dockerfiles.
  • CI/CD pipeline design and maintenance (GitHub Actions, or equivalent like GitLab CI / CircleCI).
  • Comfortable scripting and reading code in Python and/or Bash; able to navigate a polyglot monorepo.
  • Operational experience with relational databases and managed database services (migrations, backups, performance).
  • A reliability mindset: monitoring, alerting, incident response, and writing runbooks.

Bonus points for:

  • Experience operating Celery / distributed task queues and Redis at scale.
  • Familiarity with LLM/AI infrastructure (model proxies, GPU scheduling, token/cost management).
  • Observability tooling depth (Grafana, Prometheus, ClickHouse, OpenTelemetry, Langfuse or similar tracing).
  • Security/compliance experience (IAM hardening, secret management, vulnerability remediation).
  • Cost-optimisation experience for cloud + third-party API spend.
  • Experience supporting a monorepo with multiple language ecosystems and editable/internal package dependencies.

Benefits

Perks on Perks

  • Competitive salary.
  • Training budget to level up your skills from top tech partners like Microsoft, AWS, Salesforce, and Databricks – whether it’s certifications or courses, we’ve got you covered.
  • Private insurance, top-tier tech gear, and the chance to work with a stellar crew.

Ready to create some data magic with us? Hit that apply button and let’s get started. ✨

Skills Required

  • 3+ years in DevOps, SRE, or Platform Engineering (or strong backend experience with heavy infra ownership)
  • Hands-on Terraform (modules, state, multi-environment)
  • Cloud experience (GCP preferred; AWS/Azure transferable)
  • Production Kubernetes experience (deployments, autoscaling, debugging, rollouts/rollbacks)
  • Strong Docker fundamentals and Dockerfile optimisation
  • CI/CD pipeline design and maintenance (GitHub Actions or equivalent)
  • Comfortable scripting and reading code in Python and/or Bash
  • Operational experience with relational databases and managed DB services (migrations, backups, performance)
  • Monitoring, alerting, incident response, and writing runbooks (reliability mindset)
  • Experience operating Celery / distributed task queues and Redis at scale
  • Familiarity with LLM/AI infrastructure (model proxies, GPU scheduling, token/cost management)
  • Observability tooling depth (Grafana, Prometheus, ClickHouse, OpenTelemetry, Langfuse)
  • Security/compliance experience (IAM hardening, secret management, vulnerability remediation)
  • Cost-optimisation experience for cloud and third-party API spend
  • Experience supporting a monorepo with multiple language ecosystems
  • Familiarity with Alembic migrations and DB migration gating/labeling
  • Familiarity with Python 3.13+ and uv (Python app management)
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Athens
94 Employees
Year Founded: 2014

What We Do

Changing the world one algorithm at a time. Satori is a term to describe “the moment of clarity”. We are an Analytics Agency made with one simple vision: To give clarity in decision making, through data and AI. With teams of certified expert architects, analysts, data and AI engineers, we have the depth and experience to deliver simpler and complex data-centric solutions reliably, efficiently and repeatably. Over the past 10 years our people have been delivering innovative solutions to global brands across multiple industries in Financial Services, Retail, FMCG, Energy, Manufacturing, Health and others. Whether it’s a best practices cloud data estate design, a scalable and cost-efficient data warehouse, lake or lakehouse, intuitive and performing BI, optimisation and machine learning, generative (Open)AI and cognitive services, we’ve done it. With a diverse client portfolio we are proud to say we have a >90% retention rate and long standing relationships as a trusted data and AI partner with some of the biggest brands in Europe and beyond. If you want to know more about Satori's services, projects, and clients visit our website and feel free to get in touch directly with our people. If you are a prospective Satorian and want to have a career in building advanced data and AI products for the best companies out there and be part of true innovation, visit our career page and send us your CV!

Similar Jobs

Easy Apply
Remote
37 Locations
55 Employees
140K-178K Annually

CodePath.org Logo CodePath.org

Senior Product Designer

Edtech • Social Impact
Easy Apply
Remote
37 Locations
55 Employees
148K-190K Annually

Pfizer Logo Pfizer

Senior Director, CFC CRM Lifecycle & Value Lead

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Remote or Hybrid
32 Locations
121990 Employees
215K-358K Annually

Mondelēz International Logo Mondelēz International

Talent Acquisition Advisor

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
2 Locations
90000 Employees

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account