Site Reliability Engineer

Reposted 7 Hours Ago
Be an Early Applicant
Hiring Remotely in Spain
Remote
Senior level
Information Technology • Consulting
The Role
The Site Reliability Engineer will ensure operational excellence of a Kubernetes platform, focusing on performance optimization, reliability engineering, and the management of hybrid infrastructure.
Summary Generated by Built In

We are looking for an experienced Site Reliability Engineer to ensure the stability, scalability, and operational excellence of a Kubernetes-based platform running in a hybrid environment. 

The project is entering a pivotal phase, with a major go-live planned for mid-February and a target audience of 75,000 users. User onboarding is already underway, with over 5,000 users connected and 15,000–20,000 expected to be active by year-end. While the system is stable, we anticipate increased activity and new challenges in January, February, and after the go-live—making this an exciting opportunity to make a real impact. The role focuses on performance optimization, scaling strategies, observability, and reliability engineering.

Required Skills: 

  • 4+ years of experience as SRE / DevOps Engineer
  • Strong hands-on experience with Kubernetes in production
  • Experience working with hybrid infrastructure (on-prem + cloud)
  • Solid knowledge of PostgreSQL performance tuning and scaling
  • Experience with Qdrant or other vector databases
  • Experience with CI/CD workflows,  Helm, Kubernetes autoscaling, and resource optimization
  • Familiarity with observability stacks (Prometheus, Grafana, ELK/Loki)
  • Understanding of performance engineering and load testing
  • Experience with Linux systems and networking
  • Strong troubleshooting and incident-management skills
  • Strong Python skills; Rust exposure is a plus
  • Strong experience with infrastructure as code (Terraform)

Nice to Have:

  • Experience with STACKIT or other sovereign clouds
  • Experience with PgBouncer
  • Knowledge of SRE practices (SLO/SLI)
  • Experience in regulated or public-sector environments
  • German language skills

Responsibilities:

  • Operate and optimize hybrid infrastructure (on-prem & STACKIT)
  • Manage and scale Kubernetes clusters
  • Optimize Helm charts, resource usage, and autoscaling
  • Conduct performance, load, and stress testing
  • Ensure reliability, availability, and monitoring of production systems
  • Tune and operate PostgreSQL
  • Operate and optimize vector databases (e.g. Qdrant)
  • Implement monitoring, logging, and alerting
  • Support incident response and capacity planning

We offer*:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

*not applicable for freelancers

Top Skills

Elk
Grafana
Helm
Kubernetes
Linux
Postgres
Prometheus
Qdrant
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: North Miami Beach, Florida
2,135 Employees
Year Founded: 2002

What We Do

N-iX is a global software solutions and engineering services company that helps world’s leading organizations turn challenges into lasting business value, operational efficiency, and revenue growth using advanced technology. Whether you need to build a custom solution, modernize your digital product or acquire extra tech expertise - we have the experience and capabilities to ensure your success. With over 2,000 professionals in 25 countries across Europe and the Americas, N-iX offers expert solutions in cloud, data analytics, embedded software, IoT, AI, machine learning, and other tech domains. Being in business for over two decades, we have worked with dozens of industry-leading enterprises and Fortune 500 companies creating value across a wide variety of sectors, including finance, manufacturing, supply chain, retail, e-commerce, healthcare, and more. Our unique combination of business domain expertise and technical know-how enables us to effectively collaborate with ISVs, tech companies, and enterprises of all sizes. Thanks to the strong tech ecosystem and partnerships with AWS, GCP, Microsoft, SAP, OpenText, Snowflake, and others, we bring extra speed, scale and efficiency to more than 160 organizations across the globe. N-iX is recognized by numerous industry awards, such as CRN Solution Provider 500, Global Outsourcing 100 by IAOP, ISG Provider Lens™, Modern Application Development services providers by Forrester, etc

Similar Jobs

Elastic Logo Elastic

Site Reliability Engineer

Cloud • Security • Software • Generative AI
Remote
Spain
3222 Employees

Kraken Digital Asset Exchange Logo Kraken Digital Asset Exchange

Site Reliability Engineer

Blockchain • Financial Services • Cryptocurrency • Web3
Remote
15 Locations
2900 Employees

Intetics Logo Intetics

Site Reliability Engineer

Artificial Intelligence • Blockchain • Internet of Things • Machine Learning • Software
Remote
5 Locations
532 Employees

dLocal Logo dLocal

Site Reliability Engineer

Fintech • Payments • Financial Services
Remote or Hybrid
5 Locations
932 Employees

Similar Companies Hiring

Scrunch  Thumbnail
Artificial Intelligence • Information Technology • Marketing Tech • Software • SEO
Salt Lake City, Utah
Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account