Site Reliability Engineer (SRE/ DevOps) - Engineering Productivity

Posted 3 Days Ago
Be an Early Applicant
Hiring Remotely in Dublin, IRL
In-Office or Remote
Mid level
Cloud • Software • Analytics
The Role
Operate, scale, and automate engineering productivity infrastructure in a hybrid cloud environment. Build reliable, observable production systems, automate toil, create incident runbooks and postmortems, coordinate maintenance with vendors, and improve developer workflows and platform reliability.
Summary Generated by Built In
Company Description

Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in an increasingly interconnected world. Our solutions are designed to not only meet the current demands of the digital landscape but to also anticipate and adapt to future challenges.

At Arista we value the diversity of thought and perspectives that each employee brings to the table. We believe that fostering an inclusive environment, where individuals from various backgrounds and experiences feel welcome, is essential for driving creativity and innovation.

Our commitment to excellence has earned us several prestigious awards, such as Best Engineering Team, Best Company for Diversity, Compensation, and Work-Life Balance. At Arista, we take pride in our track record of success and strive to maintain the highest standards of quality and performance in everything we do.

Job Description

Who You'll Work With

Arista Networks is looking for a skilled professional for our Engineering Productivity (EngProd) team to help maintain and support our rapidly expanding infrastructure and internal user base. The ideal candidate is someone who can wear many hats, is versatile and is enthusiastic about learning new technologies. As a part of the software engineering team, you will work with other team members to design, build and administer secure, scalable and fault-tolerant tools and infrastructure in a hybrid cloud environment.

Working in the EngProd group, you will collaborate and work with other engineers to design, build, scale, and operate the systems used by Arista's product development teams. These systems are based on industry-standards, including Ansible, Artifactory, Gerrit, Jenkins, Kubernetes, Grafana, Spinnaker, MySQL, ElasticSearch, Google Cloud, Varnish, Perforce, Gerrit etc, 3rd party storage appliances, as well as internal systems developed from the ground-up to automate CI/CD, testing, analysis, and visualization.

What You'll Do

  • Build, deploy safely and incrementally, and operate critical production systems with focus on scalability, reliability, observability, performance and security.

  • Build automation to remove toil and proactively monitor, respond to, and enhance alerts with automated handling.

  • Create and maintain incident response runbooks, triage platform and infrastructural issues, and write postmortem documents to prevent recurring incidents.

  • Plan and communicate maintenance windows on production systems while engaging with 3rd party vendor support as needed.

  • Work with Arista's product development teams to identify infrastructural bottlenecks and design solutions to enhance developer experience and workflow efficiency.

  • Survey and adopt best practices around infrastructure and platform design to maintain secure, scalable and fault-tolerant systems, including studying OSS system implementations for better triage and resolution.

#Linux #UNIX #Go #Python #Shell Scripting #Ansible #Docker #Kubernetes

Qualifications

Essential Skills

  • At least BSc Computer Science or Engineering + 3 years’ experience, MS Computer Science or Engineering + 3 years’ experience, or equivalent work experience.

  • Knowledge of one or more of Go, Python, shell scripting to be able to implement medium complexity automation workflows.

  • Knowledge of Linux (or UNIX) from administration and debugging perspective

  • Hands-on experience in operating software systems (infrastructure, complex applications etc) at scale

  • Experience in server provisioning (esp from storage and networking perspective).

  • Strong problem solving and software troubleshooting skills

  • Experience with infrastructure-as-code

 Desired Skills

  • Experience managing databases - mariadb, postgres, mongodb etc

  • Experience with docker and virtualization technologies - kvm, qemu, kata-containers etc

  • Experience managing monitoring stack - Prometheus, Loki, Tempo, InfluxDB, Grafana, Thanos etc

  • Experience managing ElasticSearch clusters

  • Experience managing Artifactory, docker registry etc

  • Experience managing CI/CD systems like ArgoCD, Spinnaker etc

  • Experience managing version control systems like Perforce, Gerrit etc

  • Experience with infrastructure-as-code frameworks like Ansible

  • Experience managing large Java applications

  • Experience in storage infrastructure management eg: NAS, SAN, Ceph etc

#LI-EO1

Skills Required

  • BSc Computer Science/Engineering + 3 years, MS Computer Science/Engineering + 3 years, or equivalent experience
  • Knowledge of one or more of Go, Python, shell scripting to implement medium complexity automation
  • Knowledge of Linux or UNIX administration and debugging
  • Hands-on experience operating software systems or infrastructure at scale
  • Experience in server provisioning (including storage and networking perspectives)
  • Strong problem solving and software troubleshooting skills
  • Experience with infrastructure-as-code
  • Experience managing databases (MariaDB, Postgres, MongoDB)
  • Experience with containerization and virtualization (Docker, KVM, QEMU, kata-containers)
  • Experience managing monitoring stacks (Prometheus, Loki, Tempo, InfluxDB, Grafana, Thanos)
  • Experience managing Elasticsearch clusters
  • Experience managing Artifactory, Docker registries, and CI/CD systems (ArgoCD, Spinnaker, Jenkins)
  • Experience with version control systems like Perforce and Gerrit
  • Experience managing large Java applications
  • Experience in storage infrastructure management (NAS, SAN, Ceph)
  • Familiarity with Google Cloud

Arista Networks Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Arista Networks and has not been reviewed or approved by Arista Networks.

  • Leave & Time Off Breadth Time away is positioned as generous, including unlimited PTO, paid holidays, and flexible hours with hybrid options. Parental leave is also included, supporting time off needs beyond standard vacation.
  • Equity Value & Accessibility Equity participation is a notable component of rewards through RSUs and an employee stock purchase plan with a discount. This structure can materially increase total compensation when stock performance is favorable.
  • Wellbeing & Lifestyle Benefits Everyday perks and wellness supports are broad, including on-site gym/showers, secured bike storage, stocked break rooms, discounted lunches, wellness webinars, and social events. Family-planning benefits add to lifestyle and wellbeing coverage.

Arista Networks Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
2,867 Employees
Year Founded: 2004

What We Do

Arista Networks was founded to pioneer and deliver software driven cloud networking solutions for large datacenter storage and computing environments. Arista’s award-winning platforms, ranging in Ethernet speeds from 10 to 400 gigabits per second, redefine scalability, agility and resilience. Arista has shipped more than 20 million cloud networking ports worldwide with CloudVision and EOS, an advanced network operating system. Committed to open standards, Arista is a founding member of the 25/50GbE consortium. Arista Networks products are available worldwide directly and through partners.

Similar Jobs

Mastercard Logo Mastercard

Project Manager

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Remote or Hybrid
Dublin, IRL
38800 Employees

Mastercard Logo Mastercard

Manager, Product Development

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Remote or Hybrid
Dublin, IRL
38800 Employees

Mastercard Logo Mastercard

Product Specialist

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Remote or Hybrid
Dublin, IRL
38800 Employees

Mastercard Logo Mastercard

Director, Government Affairs & Policy - Ireland

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Remote or Hybrid
Dublin, IRL
38800 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account