Site Reliability Engineer

Posted 11 Days Ago
Be an Early Applicant
Maia, Porto
In-Office
Mid level
Automotive • Software • Business Intelligence • Semiconductor • Manufacturing
The Role
As a Site Reliability Engineer, you'll enhance system performance and reliability, automate processes, and collaborate on best practices within distributed systems while participating in incident response efforts.
Summary Generated by Built In

Critical Manufacturing is dedicated to empowering high-performance operations to make Industry 4.0 a reality with the most innovative, comprehensive, and modular MES software. We have a global presence, but our headquarters, and the main technical center, are in Porto (Maia), Portugal, where we develop a state-of-the-art solution for Semiconductor, Electronics, Medical Devices, and Industrial Equipment. 

Recognized as a Leader by Gartner, we are part of ASMPT, the world's largest supplier of best-in-class equipment, and technological process partner for the electronics and semiconductor industries.

The Role 

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE you will be responsible for keeping an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. 
 
SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow. 


Requirements

What You Will Do 

  • Analyze and interpret distributed systems telemetry (metrics, logs, traces) to identify and address potential issues before they affect users
  • Design, build, and maintain monitoring, alerting, and reliability tooling that improves system visibility and operational excellence
  • Collaborate with software and infrastructure teams to improve resilience, scalability, and performance across our platform
  • Participate in incident response and post-mortem analysis to ensure continuous learning and improvement
  • Contribute to automation efforts that reduce toil and increase engineering productivity

 
What Success Looks Like 

Within your first year, you will have: 

  • Improved reliability and observability of key production systems
  • Reduced manual operational work by automating recurring processes
  • Partnered effectively with development teams to embed SRE best practices into the software lifecycle
  • Shaped scalable approaches to telemetry, monitoring, and incident response

 
Why Join Us 

  • Be part of a company shaping the future of manufacturing software
  • Enjoy the freedom to experiment, innovate, and create systems that will last
  • Join a team where storytelling, strategy, and technology meet to make Industry 4.0 real

What You Will Bring 

  • More than 2 years of experience in the role of Site Reliability Engineer
  • A passion for investigation and problem-solving—digging deep until you understand how things work
  • Strong belief that telemetry is essential for system health and continuous improvement
  • Excellent spoken and written English communication skills

What we consider a plus (not mandatory):

  • Experience with cloud infrastructure (e.g., Azure) or container orchestration platforms (e.g., Kubernetes, OpenShift)
  • Familiarity with Docker, Terraform, and reverse proxies (e.g., Traefik)
  • Hands-on experience designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize performance, and automate repetitive tasks
  • Strong problem-solving mindset

 
 


Diversity, Equity and Inclusion are a source of commitment and innovation 

At Critical Manufacturing, we welcome and encourage applications from individuals of all backgrounds, regardless of disabilities, diverse abilities, identities, or experiences. Our commitment is to create an inclusive environment where everyone has equal opportunities to succeed and thrive.  

If you need accommodation during the recruitment process, please let us know - we're happy to support you. 

Top Skills

Azure
Docker
Kubernetes
Openshift
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Austin, TX
467 Employees
Year Founded: 2009

What We Do

Critical Manufacturing provides the most modern, flexible and configurable manufacturing execution system (MES) available. Critical Manufacturing MES helps manufacturers stay ahead of stringent product traceability and compliance requirements; reduce risk with inherent closed-loop quality; integrate seamlessly with enterprise systems and factory automation; and provide deep intelligence and visibility of global production operations.

As a result, our customers are Industry 4.0 ready. They can compete effectively and profitably by easily adapting their operations to changes in demand, opportunity or requirements, anywhere, at any time.

For more information, visit www.criticalmanufacturing.com

Follow us on:
- Facebook: http://www.facebook.com/CriticalManufacturing
- Twitter: https://twitter.com/#!/CriticalMfg
- Youtube: http://www.youtube.com/CriticalMfg

Similar Jobs

In-Office
Porto, PRT
1463 Employees
In-Office
Porto, PRT
1661 Employees
Easy Apply
In-Office
Porto, PRT
163 Employees
60K-80K Annually

Nebius Logo Nebius

Senior Site Reliability Engineer

Artificial Intelligence • Information Technology • Consulting
In-Office or Remote
33 Locations
473 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account