Lead Systems Engineer

Reposted 2 Days Ago
Be an Early Applicant
Dublin, OH, USA
In-Office
Senior level
Information Technology • Professional Services • Social Impact • Software
The Role
Lead and coordinate technical projects to improve reliability, scalability, and automation of the OCLC Wise platform. Build and maintain Ansible provisioning, lead environment preparation, manage patches/upgrades, improve monitoring, triage production incidents, and coordinate cross-team changes and stakeholder communication.
Summary Generated by Built In
Together we make breakthroughs possible. 

At OCLC, we build technology with a purpose: to connect libraries and make knowledge accessible worldwide, because we believe that what is known must be shared. Our teams work with complex global datasets, AI and machine learning, hybrid cloud solutions, and other technologies that connect people and organizations to the information they need. We value the power of unique perspectives and experiences to unlock innovation. At OCLC, your ideas matter, whether you have two years of experience or 20. You’ll learn, create, and problem-solve with technologists, product developers, librarians, researchers, marketing pros, and support teams around the world. 

Why join OCLC? 

OCLC is consistently recognized as a best place to work by several independent programs. We recognize and reward people and results with a comprehensive Total Rewards package. This means competitive compensation that reflects your unique contributions—performance, experience, and skills—along with exceptional benefits, including best-in-class health coverage, retirement plans with generous company contributions, and a commitment to your overall well-being.

  • We know the best ideas don’t always happen at a desk. Take a walking meeting around our 100-acre campus or enjoy lunch on the patio. We’re committed to your success—both personally and professionally. Hybrid work environment: For many roles, three days a week on-site, with occasional additional days based on business needs. 

  • Free use of our on-site fitness center, gym sports, group exercise classes, and game room 

  • Onsite catering and cafeteria subsidized by OCLC 

  • Health and wellness events 

  • Work environments with individual and team spaces and the latest technology tools 

  • Paid parental leave and adoption assistance 

  • Tuition reimbursement and Public Service Loan Forgiveness eligibility 

  • Company-subsidized pricing on local tickets and memberships 

Join us in transforming how people everywhere access information and be part of a mission-driven team that makes a global impact. 

The job details are as follows:We’re hiring a Lead DevOps Engineer to raise the standard of how we build, test, deploy, and operate software. This is a hands-on role with strong technical ownership and a developer enablement mindset: you’ll reduce deployment friction, improve environment reliability and quality, strengthen observability, and lead incident resolution through to completion.
You’ll lead through standards and influence—building reusable automation, mentoring others, and driving improvements across teams (including coordination with EMEA DevOps counterparts).

What you’ll do

  • Lead automation initiatives that eliminate repetitive tasks and reduce operational toil.
  • Build and maintain Ansible automation to provision new environments and keep existing environments up to date.
  • Propose and lead platform improvement projects using tools such as AnsibleRundeck, and CI/CD systems.
  • Design and improve CI/CD pipelines and deployment automation with safe rollout/rollback strategies and clear environment promotion.
  • Enable developers through reusable “paved road” tooling: templates, golden pipelines, self-service workflows, and guardrails that reduce manual work and tribal knowledge.
  • Partner with engineering teams to improve delivery quality through:
    • automated integration and regression testing,
    • deployment validation and smoke testing,
    • reliable and repeatable test/pre-production environments,
    • quality gates that catch issues earlier.
  • Improve observability across services and infrastructure (monitoring, logging, alerting, tracing), including visibility into deployment outcomes and failures.
  • Lead analysis and resolution of production incidents across infrastructure, application, database, and network layers; drive RCAs and prevention work.
  • Oversee platform patching and upgrades; plan, schedule, and monitor maintenance tasks.
  • Coordinate and implement server/platform changes required by customers and internal teams.
  • Document systems and processes, transfer knowledge, and mentor engineers to raise technical standards across the organization.
  • Communicate proactively with stakeholders, manage multiple requests, and prioritize work effectively.

What success looks like

  • Manual operational work is automated or removed; fewer repetitive tasks and fewer “only one person knows” processes.
  • Faster, safer releases with stronger validation, clearer rollback paths, and improved release confidence.
  • More reliable environments and improved readiness for new customer onboarding.
  • Better visibility into platform health and incidents: higher signal, less alert noise, faster diagnosis and recovery.
  • Clear standards and reusable tooling adopted across teams; improved developer experience and reduced deployment friction.

Required qualifications

  • Bachelor’s degree in Computer Science (or equivalent) or equivalent professional experience.
  • 6+ years of RedHat Linux server administration experience, including production troubleshooting and log triage.
  • 6+ years of extensive Ansible scripting and automation experience (or equivalent configuration management).
  • 6+ years of scripting experience in Bash and Python.
  • Experience building and operating CI/CD pipelines and deployment automation.
  • Strong troubleshooting skills across distributed systems (infrastructure, application, database, and network layers).
  • Strong working knowledge of MySQL (query language required); Postgres experience is a plus.
  • Excellent communication skills and the ability to lead through planning, prioritization, and influence.
  • Proven ability to context switch, manage multiple stakeholder requests, and deliver reliably under deadlines.

Preferred qualifications

  • Container infrastructure design and implementation; experience with DockerKubernetes/Helm a plus.
  • Experience with RundeckJenkinsSOLR, and/or ETCD.
  • Experience with monitoring/logging/alerting and modern observability practices (SLOs/SLIs, change correlation, incident reduction).
  • Networking fundamentals (DNS, routing, connectivity troubleshooting).
  • Familiarity with standard change management practices (e.g., ITIL).
  • General programming knowledge/structure; Java familiarity is a plus.
  • Experience with progressive delivery (canary/blue-green), feature flags, IaC (Terraform/CloudFormation), and secrets management (Vault or equivalent).
  • Interest in extending observability to automated workflows and AI/agent activity (execution tracing, failures, permissions, cost visibility).

Working style

This role requires strong ownership, organization, attention to detail, proactive stakeholder communication, and a bias toward automation and repeatability. You’ll be expected to take ambiguous, high-impact problems through to resolution and leave the platform better than you found it.

The Wise Lead System Engineer functions as an embedded subject matter expert and technical project leader working from within the OCLC Wise development team. OCLC practices a hybrid work location model allowing at least 3 days a week in the office and 2 days remote.

Skills Required

  • Bachelor's degree in Computer Science or equivalent experience
  • 6+ years RedHat Linux server administration
  • 6+ years Ansible scripting and automation
  • 6+ years scripting experience (Bash, Python)
  • Proficient MySQL query language knowledge
  • Extensive Linux system administration and command-line skills
  • Container infrastructure design and implementation
  • Excellent communication skills across technical levels
  • Leadership, planning, prioritization, and stakeholder coordination
  • Outstanding troubleshooting and incident analysis skills
  • Postgres experience
  • Knowledge of Java
  • Knowledge of Kubernetes, Helm, Docker
  • Experience with Rundeck, Jenkins, SOLR and/or ETCD
  • Experience with networking, DNS, and routing
  • Familiarity with standard change management practices (ITIL)
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
1,400 Employees
Year Founded: 1967

Similar Jobs

In-Office or Remote
17 Locations
6342 Employees
In-Office
Evendale, OH, USA
156896 Employees

STR Logo STR

Systems Engineer

Machine Learning • Security • Software • Analytics • Defense
Easy Apply
In-Office
Dayton, OH, USA
800 Employees
157K-224K Annually

Thermo Systems Logo Thermo Systems

Systems Engineer

Automation • Manufacturing
In-Office
Pataskala, OH, USA
457 Employees
76K-106K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account