Site Reliability Engineer, Senior

Posted 17 Days Ago
New York, NY
160K-215K Annually
Mid level
Software
The Role
As a Site Reliability Engineer, you'll optimize platform uptime and reliability, manage deployment pipelines, enhance observability, support local development environments, and contribute to security measures. You'll automate operational tasks and troubleshoot issues to ensure seamless operations of AI products.
Summary Generated by Built In

About Hebbia

The user interface for AGI – Hebbia is AI that works the way you work.  

Designed to be generally capable– it can tackle even the most complex tasks, citing answers over any amount of sources.   By showing its work, Hebbia empowers users to collaborate with AI on each step and validate responses instead of blindly trusting them.  Our mission is to put capable AI in the hands of 1 billion people by 2030.

Job Description

As a highly skilled Site Reliability Engineer (SRE), you will contribute to building systems that optimize the uptime and reliability of our platform, and support the management and optimization of our DevOps and infrastructure operations. You will be responsible for owning our deployment pipelines, building and maintaining our continuous integration and continuous deployment (CI/CD) systems, ensuring the reliability and performance of our services, enhancing our observability, supporting our local development environments, and bolstering our security posture.  Your technical expertise and problem-solving skills will contribute to the success of our AI products and shape the future of our technology stack.

Our team works in person 5 days a week at our SoHo office in NYC. 

Responsibilities

  • Assist in managing deployment pipelines to facilitate smooth and efficient software releases.
  • Help implement and maintain observability solutions for monitoring system performance and reliability.
  • Support local development environments to optimize developer workflows.
  • Work with development teams to ensure infrastructure aligns with project requirements.
  • Contribute to improving the security of our infrastructure by assisting with proactive measures and audits.
  • Assist in developing and maintaining automation scripts and tools to enhance operational efficiency.
  • Help troubleshoot and resolve infrastructure and application issues to minimize downtime and maintain smooth operations.
  • Participate in evaluating and integrating new technologies to enhance the scalability, reliability, and security of our infrastructure.

Who You Are

  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
  • 4+ years software development experience at a venture-backed startup or top technology firm.
  • Proven experience as a Site Reliability Engineer, DevOps Engineer, or similar role.
  • Strong expertise in managing CI/CD pipelines and deployment automation.
  • Proficiency in cloud platforms such as AWS, Azure, or Google Cloud (we are an AWS shop).
  • Solid understanding of containerization and orchestration technologies such as Docker and Kubernetes.
  • Experience with monitoring and observability tools such as Datadog, Prometheus, Grafana, or similar.
  • Knowledge of infrastructure-as-code (IaC) tools such as Terraform or CloudFormation.
  • Familiarity with security best practices and tools for infrastructure and application security.
  • Excellent problem-solving skills and the ability to troubleshoot complex issues.
  • Strong communication skills and the ability to work effectively in a collaborative environment.
  • A proactive and self-motivated approach to learning and adopting new technologies.
  • Passion for continuous improvement and operational excellence.


Compensation

In consideration of market analysis and relevant factors, the salary range for this position is set between $160,000 and $215,000. However, adjustments outside of this range may be considered for candidates whose qualifications significantly differ from those outlined in the job description. Additionally, this role is eligible to participate in our equity plan and benefits program. Benefits include, but not limited to: Comprehensive health, dental and vision coverage, retirement benefits, daily catered lunch, and unlimited PTO.


#LI-Onsite

Top Skills

AWS
Azure
Docker
GCP
Kubernetes
The Company
HQ: New York, NY
14 Employees
On-site Workplace
Year Founded: 2020

What We Do

Neural search for the enterprise.

We've raised $30M in funding from Peter Thiel, Index Ventures (via Mike Volpi), Jerry Yang (founder of Yahoo), Ram Sriram (one of the first investors in Google), and others, and have built the fastest-moving team in the world.

We want to work with people who share our core values:
- Your work is a craft you want to master.
- You thrive in ambiguity and approach new challenges with optimism.
- You appreciate diverse perspectives and uncommon ideas.

Similar Jobs

Formation Bio Logo Formation Bio

Senior Site Reliability Engineer

Artificial Intelligence • Big Data • Healthtech • Biotech • Pharmaceutical
Easy Apply
Hybrid
New York, NY, USA
140 Employees

Citadel Logo Citadel

Site Reliability Engineer

Information Technology • Software • Financial Services • Big Data Analytics
New York, NY, USA
4000 Employees
2 Locations
300 Employees
New York, NY, USA
14 Employees
180K-235K Annually

Similar Companies Hiring

TrainingPeaks (A Peaksware Company) Thumbnail
Software • Fitness
Louisville, CO
69 Employees
bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account