Site Reliability Engineer

| USA | Remote
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

Roadie, a UPS Company, is a logistics management and crowdsourced delivery platform. Founded in 2014, Roadie offers businesses fast, flexible and asset-light logistics solutions for last-mile delivery. Roadie enables local delivery to more than 95% of U.S. households by providing access to more than 200,000 independent drivers nationwide – allowing businesses to offer their customers delivery optionality for almost any industry, from airlines to artisans.

Roadie is seeking a Site Reliability Engineer to join our growing Technical Operations Team. We are looking for a candidate who has experience implementing site reliability principals, as well as production level Kubernetes experience.  The ideal candidate is a skilled problem solver with intimate knowledge of site reliability practices, standard Dev Ops principles, AWS, scripting languages and Kubernetes.

What You'll Do

  • Maintain, support, and engineer production and nonproduction Kubernetes Clusters
  • Deploy and maintain monitoring and logging solutions based on Prometheus, Thanos and Loki
  • Work directly with Development teams to foster site reliability principals
  • Define and manage SLO, SLI and error budgets
  • Build automation and tooling to “eliminate toil”
  • Capacity planning, and cost optimization
  • Debug production/non-production issues
  • Take part in 24/7 on-call rotation

Technology We're Using Now

  • Python, Ruby on Rails, Golang
  • Postgres, Redshift, Redis, Kafka
  • AWS, GCP
  • Docker/Kubernetes
  • Prometheus/Thanos/Loki/Grafana
  • Istio, Karpenter, Keda
  • Git/CircleCI
  • ArgoCD
  • Terraform/Crossplane

What You Bring

  • 2+ years in various SRE roles
  • 4+ years in various DevOPS/System Engineering roles
  • 2+ years of experience building and managing production Kubernetes infrastructure with emphasis on the use of *nix and cloud vendor Kubernetes solutions (EKS, GKE)
  • 4+ Years experience with popular scripting languages (Python, Ruby, Bash)
  • Experience with Infrastructure as code such as Terraform
  • Experience with CI/CD Development tools (CircleCI)
  • Experience with GitOPS Tools (Argocd)
  • Experience using a broad range of AWS technologies (RDS, ElasticSearch, VPC, EKS, S3, CloudFront, MSK, Elasticache, CloudWatch)
  • Must be able to work independently, be self-motivated and handle multiple priorities
  • Comfortable working in a fast-paced agile environment
  • Finally, a willingness to admit what you don’t know, and learn what you need to learn quickly

Why Roadie? 

  • Competitive compensation packages 
  • 100% covered health insurance premiums for yourself
  • 401k with company match
  • Tuition and student loan repayment assistance (that’s right - Roadie will contribute directly to your existing student loans!) 
  • Flexible work schedule with unlimited PTO 
  • Monthly 3-day weekends
  • Monthly WFH stipend 
  • Paid sabbatical leave- tenured team members are given time to rest, relax, and explore
  • The technology you need to get the job done
More Information on Roadie
Roadie operates in the Automotive industry. The company is located in Atlanta, GA. Roadie was founded in 2014. It has 260 total employees. It offers perks and benefits such as Dental insurance, Vision insurance, Health insurance, Life insurance, 401(K) and Employee stock purchase plan. To see all 10 open jobs at Roadie, click here.
Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Similar Jobs

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about RoadieFind similar jobs