Senior Site Reliability Engineer

Posted 18 Days Ago
Be an Early Applicant
Latham, NY
Senior level
Information Technology • Mobile • Database
The Role
As a Senior Site Reliability Engineer, you will ensure the reliability and performance of our infrastructure, manage incident resolutions, optimize SaaS platforms, and collaborate with various stakeholders to drive improvements and maintain operational excellence.
Summary Generated by Built In

Launchpad, a people-first technology company, is a leader in North America´s rapidly growing tech sector. Through two solutions, Launchpad supports its clients with digital transformation:

  • PaasportTM, our iPaaS solution, streamlines software integration and automates workflows.
  • Nearshore Staff Augmentation, our managed IT staffing service, connects top IT talent across various geographical regions, bringing industry expertise to leading clients.

Based in Vancouver, Canada, our operational footprint spans across North and South America, with a second headquarters in Santiago, Chile.

In 2023, our unwavering dedication to innovation garnered recognition as a Deloitte Technology Fast 50™ Program Company. Our clientele boasts industry leaders such as Walmart, GM, TIME Magazine, Salesforce, Tableau, Splunk, Bolt.com, Freedom House, and more.

At Launchpad, we genuinely care about our people as individuals. If you are looking for a team that values growth, drive, and passion for your craft, if you’re seeking a place to achieve your goals and dreams with fairness and integrity, then we’d love to hear from you.

About the Role

We are seeking a Senior Site Reliability Engineer (SRE) to play a pivotal role in ensuring the reliability, scalability, and performance of our infrastructure. This is a mission-critical role, requiring someone who can address both external product reliability and internal platform demands while contributing strategically to organizational objectives.

You will balance hands-on technical work with leadership in reliability initiatives, driving improvements across our platform and collaborating with stakeholders at all levels. This position is crucial to maintaining operational excellence as we navigate complex compliance standards and evolving business needs.

Responsibilities

  • Develop, maintain, and improve our automated deployment, certification, and validation pipelines.
  • Define, implement, and monitor service level objectives (SLOs), service level agreements (SLAs), and service level indicators (SLIs).
  • Lead efforts to optimize, improve, and maintain the reliability and performance of the SaaS platform.
  • Manage third-party services and technologies used to support the SRE discipline.
  • Collaborate with senior management and the engineering team to lead SRE initiatives and provide updates
  • Define and implement an observability framework to provide insights into system performance and behavior.
  • Implement proactive monitors and alerts to ensure system reliability and performance meet customer expectations.
  • Own operational incident management, providing support to related teams and individuals during incident resolution.
  • Identify and implement best practices for system reliability, security, scalability, and performance.
  • Participate in on-call rotations for system support, troubleshooting, and resolution.
  • Conduct post-mortem reviews of incidents, identify root cause, and implement remediation steps.
  • Develop and maintain documentation for systems, processes, and procedures.


Requirements

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent work experience.
  • Minimum of 5 years of experience in Site Reliability Engineering, DevOps, or similar roles.
  • Familiarity with monitoring tools and systems.
  • Proficient in scripting languages such as Python, Bash, or Ruby.
  • Experience with infrastructure automation tools such as Terraform, Ansible, or Chef.
  • Familiarity with containerization technologies like Docker and orchestration tools like Kubernetes.
  • Strong knowledge of cloud platforms such as AWS, GCP, or Azure.
  • Excellent troubleshooting and analytical skills.
  • Strong communication skills and the ability to work effectively within a team.


Nice to Have

  • Certifications in AWS, GCP, or Azure.
  • Experience with CI/CD tools like Jenkins, GitLab CI, or CircleCI.
  • Familiarity with database technologies, both SQL and NoSQL.

Why work for Launchpad?

  • 100% remote
  • People first culture
  • Excellent compensation in US Dollars
  • Hardware setup for working from home
  • Work with global teams and prominent brands based in North America, Europe, and Asia
  • Training allowances
  • Personal time off (PTO) for vacations, study leave, personal time, etc.
  • ...and more!

At Launchpad, we genuinely care about our people as individuals. If you are looking for a team that values growth, drive, and passion for your craft, if you’re seeking a place to achieve your goals and dreams with fairness and integrity, then you are the future of Launchpad. Launchpad is committed to fostering a diverse and representative workforce and an inclusive work environment where all employees are respected and treated equally.

Are you ready to elevate your career at Launchpad? We want to hear your story! Contact us today.


Top Skills

Bash
Python
Ruby
The Company
Vancouver, British Columbia
40 Employees
On-site Workplace
Year Founded: 2018

What We Do

Your Digital Transformation Partner Launchpad is your technology partner, committed to driving seamless digital transformation. Specializing in app, data, and people integration, we supercharge your journey. Our iPaaS platform, PassportTM, simplifies software integration and workflow automation. We also have handpicked IT experts in your time zone, helping you launch faster, streamline your projects, and cut costs. Your digital future starts here. Learn more: https://www.golaunchpad.io

Similar Jobs

Formation Bio Logo Formation Bio

Senior Site Reliability Engineer

Artificial Intelligence • Big Data • Healthtech • Biotech • Pharmaceutical
Easy Apply
Hybrid
New York, NY, USA
140 Employees

The Walt Disney Company Logo The Walt Disney Company

Sr Site Reliability Engineer

AdTech • Digital Media • News + Entertainment
Hybrid
New York, NY, USA
200000 Employees
139K-204K Annually

Squarespace Logo Squarespace

Senior Site Reliability Engineer, Databases

Consumer Web • eCommerce • Marketing Tech • Payments • Software • Design • SEO
Hybrid
New York, NY, USA
1723 Employees

CoreWeave Logo CoreWeave

Senior Site Reliability Engineer, Developer Productivity

Cloud • Information Technology • Machine Learning
4 Locations
806 Employees

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account