Site Reliability Engineer

Posted 7 Days Ago
Be an Early Applicant
Hiring Remotely in Spain
Remote
Senior level
Information Technology • Consulting
The Role
The Site Reliability Engineer will ensure operational excellence of a Kubernetes platform, focusing on performance optimization, reliability engineering, and the management of hybrid infrastructure.
Summary Generated by Built In

We are looking for an experienced Site Reliability Engineer to ensure the stability, scalability, and operational excellence of a Kubernetes-based platform running in a hybrid environment. 

The project is entering a pivotal phase, with a major go-live planned for mid-February and a target audience of 75,000 users. User onboarding is already underway, with over 5,000 users connected and 15,000–20,000 expected to be active by year-end. While the system is stable, we anticipate increased activity and new challenges in January, February, and after the go-live—making this an exciting opportunity to make a real impact. The role focuses on performance optimization, scaling strategies, observability, and reliability engineering.

Required Skills:

  • 4+ years of experience as SRE / DevOps Engineer
  • Strong hands-on experience with Kubernetes in production
  • Experience working with hybrid infrastructure (on-prem + cloud)
  • Solid knowledge of PostgreSQL performance tuning and scaling
  • Experience with Qdrant or other vector databases
  • Experience with Helm, Kubernetes autoscaling, and resource optimization
  • Familiarity with observability stacks (Prometheus, Grafana, ELK/Loki)
  • Understanding of performance engineering and load testing
  • Experience with Linux systems and networking
  • Strong troubleshooting and incident-management skills

Nice to Have:

  • Experience with STACKIT or other sovereign clouds
  • Experience with PgBouncer
  • Knowledge of SRE practices (SLO/SLI)
  • Experience in regulated or public-sector environments
  • German language skills

Responsibilities:

  • Operate and optimize hybrid infrastructure (on-prem & STACKIT)
  • Manage and scale Kubernetes clusters
  • Optimize Helm charts, resource usage, and autoscaling
  • Conduct performance, load, and stress testing
  • Ensure reliability, availability, and monitoring of production systems
  • Tune and operate PostgreSQL
  • Operate and optimize vector databases (e.g. Qdrant)
  • Implement monitoring, logging, and alerting
  • Support incident response and capacity planning

We offer*:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

*not applicable for freelancers

Top Skills

Elk
Grafana
Helm
Kubernetes
Linux
Postgres
Prometheus
Qdrant
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: North Miami Beach, Florida
2,135 Employees
Year Founded: 2002

What We Do

N-iX is a global software solutions and engineering services company that helps world’s leading organizations turn challenges into lasting business value, operational efficiency, and revenue growth using advanced technology. Whether you need to build a custom solution, modernize your digital product or acquire extra tech expertise - we have the experience and capabilities to ensure your success.

With over 2,000 professionals in 25 countries across Europe and the Americas, N-iX offers expert solutions in cloud, data analytics, embedded software, IoT, AI, machine learning, and other tech domains. Being in business for over two decades, we have worked with dozens of industry-leading enterprises and Fortune 500 companies creating value across a wide variety of sectors, including finance, manufacturing, supply chain, retail, e-commerce, healthcare, and more. Our unique combination of business domain expertise and technical know-how enables us to effectively collaborate with ISVs, tech companies, and enterprises of all sizes. Thanks to the strong tech ecosystem and partnerships with AWS, GCP, Microsoft, SAP, OpenText, Snowflake, and others, we bring extra speed, scale and efficiency to more than 160 organizations across the globe. N-iX is recognized by numerous industry awards, such as CRN Solution Provider 500, Global Outsourcing 100 by IAOP, ISG Provider Lens™, Modern Application Development services providers by Forrester, etc

Similar Jobs

Affirm Logo Affirm

Site Reliability Engineer

Big Data • Fintech • Mobile • Payments • Financial Services
Easy Apply
Remote
Spain
2200 Employees
96K-126K Annually

GitLab Logo GitLab

Site Reliability Engineer

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
28 Locations
2500 Employees

Tempo Software Logo Tempo Software

Site Reliability Engineer

Information Technology • Software
Remote
8 Locations
322 Employees

Affirm Logo Affirm

Senior Site Reliability Engineer

Big Data • Fintech • Mobile • Payments • Financial Services
Easy Apply
Remote
Spain
2200 Employees
80K-110K Annually

Similar Companies Hiring

Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
LayerOne Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account