Site Reliability Engineer

Posted 17 Days Ago
Miami, FL
100K-130K Annually
3-5 Years Experience
Computer Vision • Information Technology • Software
The Role
The Site Reliability Engineer will enhance system reliability, optimize performance, and automate deployment processes in both cloud and on-premises environments. Responsibilities include monitoring production systems, improving infrastructure, and participating in capacity planning and incident response.
Summary Generated by Built In

Get to Know Us Better

RT² offers the most flexible cutting-edge Retail Management Solutions that encompass sales,
inventory management, frontline employee management and engagement, payments, business
intelligence, and digital automation tools for the wireless industry. We support Fortune 500
companies, unify their customer experience, and remove pain points across multiple retail touch
points. RT² prides itself on fostering a team-oriented culture and a dynamic work environment,
where team members are set up to make meaningful contributions across the organization.
Site Reliability Engineer - Our team here at RT² is looking for a Site Reliability Engineer to join our team. This SRE will be a key player in maintaining and improving the reliability of our systems. We're seeking SREs with experience in infrastructure tools like Terraform, Bicep, and Ansible, spanning both On-Premise and Cloud environments, including Azure. Your role will involve enhancing system stability, optimizing performance, and automating deployment processes. If you're a proactive problem solver with a passion for infrastructure and continuous improvement, this SRE position offers an exciting opportunity to make a meaningful impact.
Responsibilities:

  • Help maintain and enhance production monitoring and notifications.
  • Improve reliability and quality of production systems.
  • Measure and help optimize system performance.
  • Work with delivery. and other teams to identify points of potential failure and then work to help enhance and improve systems to mitigate.
  • Participate in capacity planning.
  • Create automation to improve deployment speed, testing, and responding to operational issues.
  • Work to meet service level objectives.
  • Help build runbooks, tools, and other supporting tools to improve incident response.
  • Monitor production systems and help manage incident response.
  • Participate in post mortems, document outages, steps to recovery, future mitigation strategies.
  • Work on both on-premises (data center) and cloud-based infrastructure (Azure).
  • Experience working with server operating systems like Windows, Unix, Linux
  • Experience working with monitoring via tools such as ELK stack, Grafana, Azure Application Insights
  • Experience with Git or other distributed source control systems.

Qualifications:

  • Bachelor’s degree (or equivalent) in computer science or related discipline
  • Experience with tools TerraForm, Bicep, Ansible.
  • Experience with both On-Premise and Cloud Providers preferably Azure
  • Experience with Hyper-V and VMWare
  • Experience with CI/CD Pipelines like Azure Pipelines, GitHub Actions, and OctoDeploy
  • Experience with scripting languages like PowerShell, Python and Bash
  • Proactive approach to identifying problems, performance bottlenecks, and areas for improvement.
  • Experience with observability tools like Grafana, UptimeRobot, ELK, PagerDuty
  • Experience working with Agile methodologies

Salary Range: $100,000 - $130,000/Yr. 
Our pay structure takes into account various geographical markets within the United States. The base salary for this role reflects the typical expected earnings. However, the final compensation package is determined by several factors, such as your location, job-specific expertise, skills, experience, and other relevant job-related considerations.
What We Offer:

  • A unique opportunity to shape the journey of RT²
  • Working within a rapidly growing, game-changing business
  • Remote, flexible working options
  • Competitive compensation
  • Generous STI and LTI provisions
  • Health, Dental and Vision Insurance
  • Paid Annual Leave
  • Paid Sick Leave
  • 401K, and more


 

Top Skills

Ansible
Azure
Bash
Bicep
Git
Linux
Powershell
Python
Terraform
Unix
Windows
The Company
Royal Oak, Michigan
8 Employees
On-site Workplace

What We Do

Real Time Technologies Inc is a computer software company based out of 1517 N Main St, Royal Oak, Michigan, United States

Jobs at Similar Companies

bet365 Logo bet365

Trading Assistant

Digital Media • Gaming • Software • eSports • Automation
Denver, CO, USA
6100 Employees
48K-53K Annually

Jobba Trade Technologies, Inc. Logo Jobba Trade Technologies, Inc.

Customer Success Specialist

Cloud • Information Technology • Productivity • Professional Services • Software
Hybrid
Chicago, IL, USA
45 Employees

InCommodities Logo InCommodities

Head of People & Culture - US

Information Technology • Machine Learning • Analytics • Energy • Automation • Renewable Energy
Hybrid
Austin, TX, USA
234 Employees

Similar Companies Hiring

bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account