Principal Site Reliability Engineer

Posted 18 Hours Ago
Be an Early Applicant
2 Locations
Remote
7+ Years Experience
Other
The Role
Looking for a Principal Site Reliability Engineer with experience in building secure cloud technologies, automation mindset, and expertise in SRE discipline. Responsibilities include enhancing SaaS infrastructure security, designing secure systems, automating cloud native technologies, resolving operational issues, and leading incident response. Must be a U.S. Citizen with 7-10 years of site reliability engineering experience and a Bachelor's or Master's degree in Computer Science or related field.
Summary Generated by Built In

*This position can be remote within the United States*



Who we are... 

In a world of constant change, we're leading the charge towards truly autonomous enterprises. Our cutting-edge platform harnesses the power of automation and generative AI to revolutionize how businesses manage and optimize their IT operations.

We're not just adapting to digital transformation—we're accelerating it. Our solutions bring business and operations leaders together, unlocking new levels of innovation, efficiency, and scalability. We empower organizations to deliver superior customer experiences and drive revenue growth in an always-on, always-mobile world.

At ScienceLogic, we're building the foundation for Autonomic IT—a future where IT operations are self-healing, self-optimizing, and aligned perfectly with business objectives. Our team of visionaries is reshaping the $18+ billion IT operations market, creating cost-optimized, efficient, and next-level capabilities for enterprises worldwide.



What we’re looking for…

We are looking for a Principal Site Reliability Engineer who is well versed in building cloud technologies in a secure manner, has an automation mindset and is an ardent follower of the SRE discipline. If this sounds like you, then our team will benefit from your skillset!


Who we are…

ScienceLogic is going through a product transformation and the Site Reliability Engineering (SRE) team is at the forefront of it. We are responsible for the design, deployment, and maintenance of the Cloud Infrastructure used for running company’s revenue generating go-forward SaaS product line. Overall, we’re passionate about automation and solving complex business and technology challenges. Our team combines SRE, DevOps, Software Development and Information Security knowledge to help make Cloud operations agile, elastic inside the security and governance framework boundaries.


What you’ll be doing…

  •  Enhance the company’s SaaS infrastructure security protocols.
  • Collaborate across the organization to design, build and operationalize SaaS services conforming to various security standards like FedRAMP, SOC2, ISO etc.
  • Participate in architecture, security, and operations reviews.
  • Lead design reviews and buildout of secure systems for delivering various SaaS services with 99.99% uptime.
  • Design, automate, test, and monitor the use of cloud native technologies as a foundation for a service platform.
  • Investigate and resolve customer and operational issues with the mentality of fixing and not just mitigating issues.
  • Identify and automate measurement of operations SLAs and SLOs. 
  • Triage incident response, document SOPs, Runbooks, and train NOC team members
  • Writing automation that can be easily supported and extended by others.
  • Work on special projects as assigned.


Qualities you possess…

Here at Site Reliability, we believe that if you are hungry for learning, passionate for technology and like building tools then you are a good fit. Having experience with the skills is an added plus:

  • Must be a U.S. Citizen.
  • 7-10 years of site reliability engineering or cloud operations experience or equivalent experience.
  •  Proven track record of operating production SaaS environments within security standards like FedRAMP, SOC2, ISO, PCI.
  • Bachelors or Master's degree in Computer Science, Information Systems or similar field.
  • Skilled at problem solving, algorithms, and data structures conforming to the modern SaaS security requirements.
  • Building tools and scripting frameworks from scratch.
  • Working with Cloud Automation tools like CloudFormation, Terraform, CDK, aws-cli.
  • Scripting languages like Python, Groovy, PowerShell, Bash, Perl etc.
  • Exposure to Windows and Linux administration skills.
  • Familiarity with basic networking, security and cloud engineering concepts.
  • Highly collaborative with effective written and verbal communication skills.
  • Ability to work against tight deadlines and occasionally after-hours, part of on-call scheduling.
  • Occasionally work during off-hours and participate in weekly on-call schedule.
  • Take full responsibility for the availability and performance of the platform.




Benefits & Perks

  • A remote-first culture - work from home or come into the office, it's totally up to you.
  • Comprehensive medical, dental and vision plans.
  • 401(k) plan with employer match.
  • Flexible Paid Time Off (FTO) so that you can take the time that you need to re-energize.
  • Volunteer Time Off (VTO) - take two days off per calendar year to volunteer with your preferred charitable organization.
  • 5-year Service Milestone Sabbatical.
  • Paid parental leave.
  • Generous employee referral bonus program.
  • Pet insurance.
  • HQ Office centrally located in Reston Town Center featuring a well-stocked kitchen with rotating snacks and beverages, and catered lunch on Thursdays.
  • Regular virtual company-wide events, including cooking classes, yoga, meditation and more.
  • The opportunity to learn and develop from some of the best and brightest minds in the industry!

Don’t meet every single requirement? Studies have shown that women and people of color are less likely to apply to jobs unless they meet every single qualification. At ScienceLogic, we are dedicated to building a diverse, inclusive and authentic workplace, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyway. You may be just the right candidate for this or other roles.

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or any other applicable legally protected characteristics in the location in which you are applying.



About ScienceLogic

ScienceLogic empowers intelligent, automated IT operations, freeing up time and resources, and driving business outcomes with actionable insights. ScienceLogic’s AIOps platform sees broadly across clouds and on-premises, enabling business service visibility with relationship mapping, and workflow automation to eliminate manual tasks. Trusted by thousands of organizations across the globe, ScienceLogic’s technology has been proven for scale by the world’s largest service providers, enterprises and government agencies.


www.sciencelogic.com


All ScienceLogic employees have the responsibility to protect information assets, adhere to access controls, report suspicious activity, and comply with security and privacy policies.


#LI-Remote


Top Skills

AWS
Azure
GCP
Python
The Company
Reston, VA
488 Employees
On-site Workplace
Year Founded: 2003

What We Do

ScienceLogic is a leader in IT Operations Management, providing modern IT operations with actionable insights to predict and resolve problems faster in a digital, ephemeral world. Its IT infrastructure monitoring and AIOps solution sees everything across cloud and distributed architectures, contextualizes data through relationship mapping, and acts on this insight through integration and automation.

Jobs at Similar Companies

MyBambu Logo MyBambu

Consumer Compliance Specialist

Fintech • Mobile • Other • Payments • Social Impact • Financial Services • App development
West Palm Beach, FL, USA
120 Employees

Artlist Logo Artlist

Agency Account Manager

Digital Media • Music • Other • Social Media
Hybrid
Tel Aviv-Yafo, ISR
450 Employees

Voltage Park Logo Voltage Park

Product Designer

Artificial Intelligence • Cloud • Hardware • Machine Learning • Other • Software • Infrastructure as a Service (IaaS)
San Francisco, CA, USA
45 Employees
115K-150K Annually

Similar Companies Hiring

Voltage Park Thumbnail
Software • Other • Machine Learning • Infrastructure as a Service (IaaS) • Hardware • Cloud • Artificial Intelligence
Berkeley, CA
45 Employees
MyBambu Thumbnail
Social Impact • Payments • Other • Mobile • Fintech • Financial Services • App development
West Palm Beach, Florida
120 Employees
Artlist Thumbnail
Social Media • Other • Music • Digital Media
Tel Aviv, IL
450 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account