Site Reliability Engineer III

Posted 16 Days Ago
Be an Early Applicant
Hiring Remotely in India
Remote
Senior level
Consumer Web • Marketing Tech • Professional Services • Social Media • Software
Khoros powers the digital customer experience for the world's leading brands.
The Role
The Senior Site Reliability Engineer at Khoros will manage cloud environments and troubleshoot application issues on Linux-based systems. Responsibilities include application patching, system documentation, change management, incident resolution, and collaboration with various technical teams. The role requires strong troubleshooting skills and on-call responsibilities.
Summary Generated by Built In

At Khoros, our passion is to help the world’s best brands create customers for life. We build products we’re proud of, and we’re passionate about customer success. As part of the Vista Equity family, you’ll receive best-in-class development opportunities and the ability to work with global brand customers like Samsung, HP, Sony, and Visa.

We are seeking to recruit a Site Reliability Engineer III within our Mission Critical Support Team to support our infrastructure, production data, and applications. This role, based in Bangalore, provides support to global locations. As part of this position, you will manage critical Khoros applications, ensuring the reliability, scalability, and performance of our infrastructure and applications. Also, you will be collaborating closely with development teams, you will design, build, and maintain highly available systems capable of handling increasing user traffic and demand. The role requires close coordination with teams across application development, networks, security, management systems, storage, and databases. This senior-level position is specialized, demanding exceptional technical troubleshooting skills and playing a key role in problem resolution.

Responsibilities :

  •  Manage environments on the Cloud. 
  • Monitor, troubleshoot, and resolve issues related to infrastructure, applications, and services.
  • Monitor availability and maintain the systems in good health.
  • Implement automation tools and processes to improve efficiency and reliability.
  • Participate in on-call rotation and respond to incidents promptly.
  • Continuously evaluate and improve our systems and processes to enhance reliability and performance.
  • Document runbooks and procedures.
  • Work closely with 1st Level support groups as well as Development groups.
  • To follow departmental change management procedures in defining, planning, and implementing change so that service disruption is minimized and adherence to Service Level Agreements is ensured.
  •  Perform the Incident root cause analysis.
  • Have the ability to run with projects/issues solo and work in a team environment. 
  • Be a Team Player – work in a collaborative team-oriented environment, share information, respect diverse ideas, and interact with customers and, partner with cross-functional and remote teams.
  • Be Curious & Innovative – continuously update yourself with next-generation technology and development tools, and contribute to process development practices. Evaluate new technologies and software products to determine the feasibility and desirability of incorporating capabilities within the company's products.
  • Be Agile – with a strong sense of urgency and a desire to work in a fast-paced, dynamic environment to deliver solutions against strict timelines.

Requirements:

  • 4+ years experience as an SRE in fast-paced and high-traffic environments.
  • Experience deploying and maintaining applications in any one of the clouds (AWS- must have, AZURE/ GCP- good to have)
  • Working knowledge of Linux and Windows operating systems
  • Working knowledge with any of the scripting languages - Shell, bash, python, PowerShell
  • Understanding of containerization and orchestration technologies (e.g., Docker, Kubernetes).
  • Working knowledge with Jenkins, Ansible, Terraform, and ArgoCD (good to have)
  • Administration of databases (MS SQL, MongoDB, etc)
  • Extensive experience with some monitoring, logging, and observability tools ( Sumo, DD, AWS CloudWatch, AWS X-Ray, New Relic, Splunk, etc.)
  • Ability to debug issues and solve problems
  • Excellent problem-solving and communication skills.
  • Ability to work independently and collaborate effectively in a team environment.
  • Familiarity with agile development methodologies is a plus.


About Khoros

The Khoros platform connects every facet of customer engagement, including digital contact centers, messaging, chat, online brand communities, CX analytics, and social media management so brands can listen, respond, and act on customer conversations- creating deep relationships and fostering brand loyalty and advocacy.

Khoros offers a great working environment and competitive compensation and benefits packages. We're looking for fast-thinking, innovative, passionate team players who enjoy brainstorming new ideas, working with the best and brightest in the social media software industry.

Our Core Values

  Accountability - We embrace an ownership mentality


Customer-Centricity - We are obsessed with achieving customer value


 Agility - We move with urgency and purpose

Top Skills

AWS
Linux
The Company
HQ: Austin, TX
950 Employees
On-site Workplace
Year Founded: 2001

What We Do

Khoros is a global leader in digital-first customer engagement software and services. We build enterprise software and offer expert services for digital customer service, messaging, chat, online brand communities, and social media management. Our platform is used by over 2,000 of the world's biggest and best brands to help them create customers for life.

Why Work With Us

At Khoros, we’ve worked hard to create a culture where every employee is valued. We’re made up of many backgrounds and perspectives, and we strive to celebrate our differences to create a collaborative and respectful workplace. We hire for potential, offer learning opportunities, and encourage growth within the organization to advance your career.

Gallery

Gallery

Similar Jobs

Easy Apply
Remote
India
100 Employees

RingCentral Logo RingCentral

SRE Engineer-III

Artificial Intelligence • Cloud • Events • Productivity • Software • Business Intelligence • Conversational AI
Remote
India
7000 Employees
Remote
8 Locations
880 Employees
Remote
8 Locations
880 Employees

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account