Company Description
We are Software Mind, an awesome team of engineers who are ready to ramp up any top-notch company’s projects! Our aim? To always be one step ahead. Become part of a multicultural company in constant growth with an excellent work environment certified by Great Place To Work!
Job Description
Project - the aim you'll have
We're looking for a skilled Senior SRE Engineer to join a team that works on a complex distributed architecture, spanning physical machines - and virtualizing on-prem host/cloud computing. Our Client develops and deploys systematic financial strategies across a variety of asset classes and global markets, and our teams work collaboratively to drive the production of high-quality predictive signals and financial strategies – the foundation of a sustainable, global investment platform.
If you enjoy working with cutting-edge technologies in a fast-paced environment this opportunity is for you!
Qualifications
Expectations - the experience you need
- 5+ years of proven experience in SRE
- Deep expertise and hands-on experience working with Linux-based systems, with a focus on optimization and troubleshooting.
- Strong skills in Python for scripting, automation, and system management.
- In-depth knowledge of container orchestration technologies such as Kubernetes (K8S). Experience with other cluster management tools like Slurm is a plus.
- Hands-on experience with tools like Helm, Terraform, and Ansible to manage infrastructure in a scalable and automated way.
- Strong working knowledge of Docker, Podman, or other containerization systems to enable efficient and consistent deployment.
- Experience working with CI/CD tools, especially GitLab (preferred), GitHub, or Git, to ensure smooth and rapid delivery cycles.
- Experience with monitoring and logging solutions such as Prometheus, Grafana, and the ELK stack to provide comprehensive insights into system performance and health.
- Understanding of relational databases, their performance tuning, and management in distributed systems.
- Familiarity with Agile development methodologies, with a focus on continuous improvement and collaboration.
- Exposure to cloud technologies such as AWS or Google Cloud (GCP) is a strong plus.
Position - how you'll contribute
- Architecture and Automation: Design and deploy As-A-Service solutions using open-source software to automate system management, scaling, and monitoring.
- System Optimization: Develop tools to streamline deployment, monitoring, and incident management for large-scale, distributed environments.
- Collaboration Across Teams: Work with development and operations teams to design and implement software solutions that enhance the overall reliability of services. Contribute to the ongoing DevOps and Agile transformation.
- Monitoring & Incident Response: Set up, configure, and maintain monitoring and alerting systems to ensure real-time visibility into system performance. Participate in on-call rotations to respond to incidents and mitigate downtime.
- CI/CD & Infrastructure Management: Continuously improve CI/CD pipelines using tools like GitLab, Helm, Terraform, and Ansible, ensuring fast, safe, and reliable deployments.
- Container Orchestration: Leverage container orchestration platforms like Kubernetes (K8S) to manage distributed systems at scale. Experience with Slurm or similar cluster management is a plus.
- Cloud and Automation Tools: Use cloud infrastructure (AWS, GCP, etc.) and Infrastructure as Code (IaC) tools to automate the provisioning and scaling of resources.
Our Benefits
- Educational resources.
- Flexible schedule and Work From Anywhere.
- Referral Program.
- Supportive and chill atmosphere.
We are accepting applications from LATAM countries
Position at: Software Mind LATAM
Additional Information
Top Skills
What We Do
Software Mind is a global digital transformation partner with operations throughout Europe, the US and LATAM. Driven by tech and empowered by people, we provide companies with software engineers and autonomous, cross-functional development teams who manage software life cycles from ideation to release and beyond.
For over 20 years we’ve been enriching organizations with the talent they need to boost scalability, drive dynamic growth and bring disruptive ideas to life. Our top-notch engineering teams combine ownership with leading technologies, including cloud, AI, data science and embedded software to accelerate digital transformations and boost software delivery.
A culture, driven by trust, that embraces openness, craves more and acts with respect enables our experts to create evolutive solutions that support scale-ups, unicorns and enterprise-level companies around the world.