The Role
The Senior Site Reliability Engineer will manage and monitor on-premises and cloud systems, administer Kubernetes platforms, and utilize various DevOps tools. Responsibilities include maintaining technical systems, creating automation scripts, and providing 24x7 on-call support. The role requires a solid grasp of application architecture and strong communication skills.
Summary Generated by Built In
Location: Chicago, IL
Position Type: Fulltime (3 days a week (Tue, Wed & Thu) onsite or more if needed)
Salary: $125,000 to 140,000 (10% yearly bonus)
Responsibilities:
- Manage and monitor systems and infrastructure hosted on-premises and Cloud.
- Good understanding of different layers of an application and system design - networking concepts, cloud fundamentals, microservice architectures.
- Install, configure, test and maintain complex technical systems and architectures.
- Manage and Administration of Kubernetes platform, deployments and Services.
- Comfortable with common DevOps tools like Docker, GitHub, Jenkins, Terraform, SonarQube, JFrog, etc.
- Proficient on at least one APM tool like Datadog, Dynatrace, Splunk Signal Fx, AppDynamics or Azure Monitor.
- Ability to write simple to moderately complex scripts and programs for automation, tools, frameworks, dashboards, alarms. (preferably Bash, Python, Groovy, PowerShell).
- Provide 24x7 on-call, 2nd and 3rd level support as needed to troubleshoot day-to-day issues.
Requirements:
- BS/MS in Computer Science, Information Technology or related disciplines.
- At least 6 years of experience in Software engineering environments with 7 years cloud and microservices experience.
- Experience in administering Kubernetes resources and associated AKS services.
- Understanding of Azure subscriptions and cost models,
- Good understanding of DevOps, SRE principles and concepts.
- Good verbal and written communication skills are a must.
- Demonstrate self-learning capabilities, taking initiative in a fast paced /quickly changing environment.
- Must work effectively and professionally with cross-functional groups and multiple time zones.
- Preferred certification areas: Azure Cloud Fundamentals, any industry recognized Site Reliability Engineering or DevOps Certifications.
Top Skills
Bash
Groovy
Powershell
Python
The Company
What We Do
DATAMAXIS takes pride in delivering a wide range of business IT modernization, data analytics, and technology management services. With command of the cutting-edge developments in these fields, our team and consultants are ready to provide you a robust technology modernization experience that results in a big boost in performance capability and operational efficiency.