TCN

Site Reliability Engineer

Reposted 21 Days Ago

St. George, UT, USA

In-Office

Mid level

Cloud

The Role

The Site Reliability Engineer at TCN will design, deploy, and maintain systems for performance, reliability, and security, while managing incidents and collaborating with teams.

Summary Generated by Built In

TCN is looking for a Site Reliability Engineer to join our team in Saint George, Utah. The Site Reliability Engineer works as part of a team to analyze, troubleshoot, deploy, monitor, and maintain TCN’s large production environment with global scale. These significant responsibilities are completed while continually thinking about reliability, scalability, resilience, security, and performance. The Site Reliability Engineer’s responsibilities are critical to the continuity of the services provided to TCN’s clients.

The ideal candidate will have at least three (3) years' experience working in a Linux environment as a System Administrator, Site Reliability Engineer, or a similar role.

Responsibilities

Designs and deploys software/systems - Collaborates with development teams to throughout the product life cycle, including but not limited to engaging in the design, development, deployment, and ongoing delivery of services; assists in ensuring the development of software and systems that increase product reliability and organizational efficiency
Manages solutions and ensures resistance to failure - Deploys and manages solutions to manage platform infrastructure as we continue to grow our global scale; ensuring resistance to failure
Troubleshoots - Troubleshoots complicated, cross platform incidents for OS, networking, and database in a cloud-based SaaS environment; ability to handle live production incidents, debug and troubleshoot application and infrastructure issues, and follow and implement best practices
Post-incident evaluation - Participates in post-incident evaluations and ensures permanent closure of incidents
Monitors performance | Improves application stability - Monitors application performance and takes steps to improve application performance and stability; follows through with implementation
Conducts analysis and development improvements - Conducts system analysis, configuration management, and development improvements for system software performance, availability, and reliability
Identifies application patterns and analytics in support of better service level objectives
Incident response - Participates in 24x7 incident response and on-call rotation
Shares best practices - Shares understanding of Site Reliability Engineering culture across organization; shares knowledge of best practices, approaches, documentation, and code with team members and other teams

Qualifications

Bachelor’s degree in computer science, information technology, or related field of study
Not less than three (3) years’ experience in a Linux environment as a System Administrator, Site Reliability Engineer, or similar role
Demonstrated advanced knowledge of networking protocols, including but not limited to IP routing (static/BGP/OSPF), TCP/UDP fundamentals, security (TLS, IPSEC), and common application protocols
Demonstrated advanced knowledge of Linux operating environment including storage, network, and container subsystems
Proven skills in incident management and root cause analysis
Demonstrated experience with Google Cloud Platform (APIs and CLIs)
Experience with configuration management tools
Experience with scripting and automation in commonly used languages, including but not limited to Bash, Ruby, and Python
Familiarity with programming languages used for DevOps/Continuous Delivery, including but not limited to Go, Java, and Node.Js
Experience with distributed storage, containers, containerizing applications, and container orchestration (Kubernetes)
Excellent communication skills, both oral and written; ability to adapt message/style to fit audience (i.e., ability to communicate technical concepts to a non-technical audience)
Strong interpersonal skills with the ability to work with all levels of management and employees; ability to gain credibility, provide effective customer service, and foster positive working relationships with internal and external stakeholders
Excellent attention to detail; ability to work accurately and to identify, analyze, prevent, and solve problems

About TCN

TCN is a fast-growing technology company and provides all its services over the internet in a cloud-based software-as-a-service model. TCN's technology stack and culture are positive and forward-thinking. When you join TCN, you are joining a dedicated team of professionals. Employees often describe our culture as friendly, collaborative, flexible, and fast-paced. To learn more, visit our website.

Our benefits include:

Medical Insurance (HDHP with HSA)
Dental Insurance
Vision Insurance
Life Insurance
401k with employer match
Competitive salary
Paid time off
Paid holidays (11 scheduled)
Weekly lunches; free drinks and snacks
Casual dress and flexible work environment

Skills Required

Bachelor's degree in computer science, information technology, or related field
At least three years' experience in a Linux environment
Advanced knowledge of networking protocols
Advanced knowledge of the Linux operating environment
Skills in incident management and root cause analysis
Experience with Google Cloud Platform
Experience with configuration management tools
Experience with scripting in Bash, Ruby, or Python
Familiarity with programming languages used for DevOps/Continuous Delivery
Experience with containers and container orchestration (Kubernetes)

View all jobs at TCN

View TCN Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: Saint George, UT

230 Employees

Year Founded: 1999

What We Do

As a leader in the hosted call center technology industry since 1999, TCN, Inc. is recognized worldwide as the preeminent global provider of cloud-based virtual call center technologies. Our cutting-edge communication technology has rendered expensive hardware, subscription software, and crowded call centers obsolete. We believe that every call center should have the ability to scale, affordability. Stop paying for features you don't need. Learn more @ tcn.com