Site Reliability Engineer

Reposted 21 Days Ago
St. George, UT, USA
In-Office
Mid level
Cloud
The Role
The Site Reliability Engineer at TCN will design, deploy, and maintain systems for performance, reliability, and security, while managing incidents and collaborating with teams.
Summary Generated by Built In

TCN is looking for a Site Reliability Engineer to join our team in Saint George, Utah. The Site Reliability Engineer works as part of a team to analyze, troubleshoot, deploy, monitor, and maintain TCN’s large production environment with global scale. These significant responsibilities are completed while continually thinking about reliability, scalability, resilience, security, and performance. The Site Reliability Engineer’s responsibilities are critical to the continuity of the services provided to TCN’s clients.

The ideal candidate will have at least three (3) years' experience working in a Linux environment as a System Administrator, Site Reliability Engineer, or a similar role. 

Responsibilities

  • Designs and deploys software/systems - Collaborates with development teams to throughout the product life cycle, including but not limited to engaging in the design, development, deployment, and ongoing delivery of services; assists in ensuring the development of software and systems that increase product reliability and organizational efficiency
  • Manages solutions and ensures resistance to failure - Deploys and manages solutions to manage platform infrastructure as we continue to grow our global scale; ensuring resistance to failure
  • Troubleshoots - Troubleshoots complicated, cross platform incidents for OS, networking, and database in a cloud-based SaaS environment; ability to handle live production incidents, debug and troubleshoot application and infrastructure issues, and follow and implement best practices 
  • Post-incident evaluation - Participates in post-incident evaluations and ensures permanent closure of incidents
  • Monitors performance | Improves application stability - Monitors application performance and takes steps to improve application performance and stability; follows through with implementation
  • Conducts analysis and development improvements - Conducts system analysis, configuration management, and development improvements for system software performance, availability, and reliability
  • Identifies application patterns and analytics in support of better service level objectives
  • Incident response - Participates in 24x7 incident response and on-call rotation
  • Shares best practices - Shares understanding of Site Reliability Engineering culture across organization; shares knowledge of best practices, approaches, documentation, and code with team members and other teams

Qualifications 

  • Bachelor’s degree in computer science, information technology, or related field of study
  • Not less than three (3) years’ experience in a Linux environment as a System Administrator, Site Reliability Engineer, or similar role
  • Demonstrated advanced knowledge of networking protocols, including but not limited to IP routing (static/BGP/OSPF), TCP/UDP fundamentals, security (TLS, IPSEC), and common application protocols
  • Demonstrated advanced knowledge of Linux operating environment including storage, network, and container subsystems
  • Proven skills in incident management and root cause analysis
  • Demonstrated experience with Google Cloud Platform (APIs and CLIs)
  • Experience with configuration management tools
  • Experience with scripting and automation in commonly used languages, including but not limited to Bash, Ruby, and Python
  • Familiarity with programming languages used for DevOps/Continuous Delivery, including but not limited to Go, Java, and Node.Js 
  • Experience with distributed storage, containers, containerizing applications, and container orchestration (Kubernetes)
  • Excellent communication skills, both oral and written; ability to adapt message/style to fit audience (i.e., ability to communicate technical concepts to a non-technical audience)
  • Strong interpersonal skills with the ability to work with all levels of management and employees; ability to gain credibility, provide effective customer service, and foster positive working relationships with internal and external stakeholders 
  • Excellent attention to detail; ability to work accurately and to identify, analyze, prevent, and solve problems 

About TCN

TCN is a fast-growing technology company and provides all its services over the internet in a cloud-based software-as-a-service model. TCN's technology stack and culture are positive and forward-thinking. When you join TCN, you are joining a dedicated team of professionals. Employees often describe our culture as friendly, collaborative, flexible, and fast-paced. To learn more, visit our website.

Our benefits include:

  • Medical Insurance (HDHP with HSA)
  • Dental Insurance
  • Vision Insurance
  • Life Insurance 
  • 401k with employer match
  • Competitive salary
  • Paid time off 
  • Paid holidays (11 scheduled)
  • Weekly lunches; free drinks and snacks
  • Casual dress and flexible work environment

Skills Required

  • Bachelor's degree in computer science, information technology, or related field
  • At least three years' experience in a Linux environment
  • Advanced knowledge of networking protocols
  • Advanced knowledge of the Linux operating environment
  • Skills in incident management and root cause analysis
  • Experience with Google Cloud Platform
  • Experience with configuration management tools
  • Experience with scripting in Bash, Ruby, or Python
  • Familiarity with programming languages used for DevOps/Continuous Delivery
  • Experience with containers and container orchestration (Kubernetes)
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Saint George, UT
230 Employees
Year Founded: 1999

What We Do

As a leader in the hosted call center technology industry since 1999, TCN, Inc. is recognized worldwide as the preeminent global provider of cloud-based virtual call center technologies. Our cutting-edge communication technology has rendered expensive hardware, subscription software, and crowded call centers obsolete. We believe that every call center should have the ability to scale, affordability. Stop paying for features you don't need. Learn more @ tcn.com

Similar Jobs

DraftKings Logo DraftKings

Site Reliability Engineer

Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Remote or Hybrid
United States
6400 Employees
200K-250K Annually

Domino Data Lab Logo Domino Data Lab

Site Reliability Engineer

Artificial Intelligence • Machine Learning
Easy Apply
Remote or Hybrid
US
200 Employees
200K-230K Annually

Akamai Technologies Logo Akamai Technologies

Site Reliability Engineer

Cloud • Security • Software • Cybersecurity
In-Office or Remote
2 Locations
10285 Employees
76K-136K Annually

DigiCert Logo DigiCert

Site Reliability Engineer

Security • Software • Cybersecurity
In-Office
Lehi, UT, USA
1372 Employees
160K-190K Annually

Similar Companies Hiring

Rundoo Thumbnail
Cloud • Information Technology • Internet of Things • Software
Redwood City, CA
50 Employees
NetBox Labs Thumbnail
Cloud • Software
US
125 Employees
Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account