Site Reliability Engineer

Posted 2 Days Ago
Be an Early Applicant
Bangalore, Bengaluru Urban, Karnataka
Senior level
Machine Learning • Cybersecurity
The Role
As a Site Reliability Engineer at Skyhigh Security, you'll maintain the availability of cloud infrastructure, manage incidents, implement automation, analyze system metrics, and collaborate with engineering teams to enhance production services. You'll innovate with new technologies and ensure a high-quality service in a 24x7 environment.
Summary Generated by Built In

Job Title:

Site Reliability Engineer

About Skyhigh Security:

Skyhigh Security is a dynamic, fast-paced, cloud company that is a leader in the security industry. Our mission is to protect the world’s data, and because of this, we live and breathe security. We value learning at our core, underpinned by openness and transparency. 

Since 2011, organizations have trusted us to provide them with a complete, market-leading security platform built on a modern cloud stack. Our industry-leading suite of products radically simplifies data security through easy-to-use, cloud-based, Zero Trust solutions that are managed in a single dashboard, powered by hundreds of employees across the world. With offices in Santa Clara, Aylesbury, Paderborn, Bengaluru, Sydney, Tokyo and more, our employees are the heart and soul of our company. 

Skyhigh Security Is more than a company; here, when you invest your career with us, we commit to investing in you. We embrace a hybrid work model, creating the flexibility and freedom you need from your work environment to reach your potential. From our employee recognition program, to our ‘Blast Talks' learning series, and team celebrations (we love to have fun!), we strive to be an interactive and engaging place where you can be your authentic self. 

We are on these too! Follow us on LinkedIn and Twitter@SkyhighSecurity.

Role Overview:

At Skyhigh Security, we’re focused on innovation. We secure the world’s data, and this means making the world a safer place. With plenty of learning and growth opportunities, exciting challenges and talented teams, you’ll have everything you need to see your future in a whole new way. Come make a difference with us. For more information, visit www.skyhighsecurity.com

About the role:

  • Perform Incident Management and Change Management to maintain the continuous availability of all Cloud Infrastructure services
  • Ensure all SRE and operating procedures are maintained and executed.
  • Maintain a 24x7 production environment with a high level of service availability and Perform quality reviews, manage operational issues.
  • Explore and innovate new cloud technologies, features, and tools to improve the platform and automate using Bash, Python or Perl, etc...
  • Implement automation and orchestration for manual processes required to operate and deploy cloud services, be at the heart of developing new ideas into internal tools by working closely with teams.
  • Analyze alarms and dashboards to identify problem areas, report incidents, troubleshoot, and escalate as required.Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.
  • Perform ticket review and updates through the JIRA ticketing tool.
  • Manage and Maintain Runbooks / Standard Operating procedures
  • Manage, coordinate, and document all types of maintenances / outage events.
  • Must take initiative and be proactive.
  • Must take on the responsibility to learn new products and procedures.
  • Implementation of proactive monitoring, alerting, trend analysis, and self-healing systems.
  • Understand the existing architecture and work with various Engineering teams to develop and execute strategies to provide a high-quality Global production service.
  • You are responsible to debug and identify the cause of the problem/outage.
  • You will work flexible to work in a 24X7 environment (rotational shifts).

 

About you:

  • You will have 8+ years of production applications and systems support
  • System admin experience on Linux environments.
  • Ability to understand networking and its components
  • Good experience with Public Cloud Technology AWS
  • Experience with identifying the thresholds and monitoring setup for infra and application
  • Experience with Grafana, ELK, Cloud watch, OpsGenie, Pager duty, etc.
  • Strong communication and analytical/problem-solving skills.
  • Network knowledge (TCP/IP, UDP, DNS, Load balancing) and prior network administration experience is a big plus•
  • Experience in writing Root Cause Analysis documents
  • Experience with source control tools such as Github, SVN, or Perforc
  • Systematic approach and to drive problems to resolution
  • Experience configuring and managing web servers (Apache, Tomcat, Nginx)
  • Ability to script/program with one or more high level languages, such as Python, Go, etc.
  • Good to have experience/knowledge of GCP, Azure
  • Experience with deployment tools Jenkins, Team city, Harness etc.
  • Experience with any configuration management tools like Salt, Puppet, Ansible,etc.
  • Experience in Security domain will be added advantage
  • Experience with continuous integration and deployment automation tools such as Jenkins, Harness, AWS CloudFormation, Salt, or Puppet, Chef, Ansible
  • Experience with SQL (MySQL) NoSQL databases (Redis, CouchBase, Cassandra, Crate)

 

Experience with open-source technologies (Kafka, Memcached, Redis, Hadoop, HBase, Zookeeper, Oozie)

Company Benefits and Perks:

We work hard to embrace diversity and inclusion and encourage everyone to bring their authentic selves to work every day. We offer a variety of social programs, flexible work hours and family-friendly benefits to all of our employees.

  • Retirement Plans
  • Medical, Dental and Vision Coverage
  • Paid Time Off
  • Paid Parental Leave
  • Support for Community Involvement

We're serious about our commitment to diversity which is why we prohibit discrimination based on race, color, religion, gender, national origin, age, disability, veteran status, marital status, pregnancy, gender expression or identity, sexual orientation or any other legally protected status.

Top Skills

AWS
Azure
Bash
Cassandra
Couchbase
Crate
GCP
Go
Graphana
Hadoop
Hbase
Kafka
Memcached
MySQL
Perl
Python
Redis
SQL
The Company
HQ: Plano, Texas
3,118 Employees
On-site Workplace
Year Founded: 2022

What We Do

Trellix is a global company redefining the future of cybersecurity. The company’s open and native extended detection and response (XDR) platform helps organizations confronted by today’s most advanced threats gain confidence in the protection and resilience of their operations. Trellix’s security experts, along with an extensive partner ecosystem, accelerate technology innovation through machine learning and automation to empower over 40,000 business and government customers.

Similar Jobs

Zeta Global Logo Zeta Global

Lead DevOps/ SRE Engineer

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Bangalore, Bengaluru, Karnataka, IND
2194 Employees

Warner Bros. Discovery Logo Warner Bros. Discovery

Software Engineer II- Site Reliability Engineering (SRE Team)Bangalore

Artificial Intelligence • Digital Media • Gaming • Machine Learning • News + Entertainment • Software
Hybrid
Bangalore, Bengaluru, Karnataka, IND
40000 Employees
Hybrid
Bengaluru, Karnataka, IND
289097 Employees

Take-Two Interactive Software Logo Take-Two Interactive Software

SRE I

Gaming • Information Technology • Mobile • Software
Bengaluru, Karnataka, IND
6500 Employees

Similar Companies Hiring

Halter Thumbnail
Software • Machine Learning • Internet of Things • Hardware • Greentech • Business Intelligence • Agriculture
Auckland City, NZ
150 Employees
Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account