At OCLC, we build technology with a purpose: to connect libraries and make knowledge accessible worldwide, because we believe that what is known must be shared. Our teams work with complex global datasets, AI and machine learning, hybrid cloud solutions, and other technologies that connect people and organizations to the information they need. We value the power of unique perspectives and experiences to unlock innovation. At OCLC, your ideas matter, whether you have two years of experience or 20. You’ll learn, create, and problem-solve with technologists, product developers, librarians, researchers, marketing pros, and support teams around the world.
OCLC is consistently recognized as a best place to work by several independent programs. We recognize and reward people and results with a comprehensive Total Rewards package. This means competitive compensation that reflects your unique contributions—performance, experience, and skills—along with exceptional benefits, including best-in-class health coverage, retirement plans with generous company contributions, and a commitment to your overall well-being.
We know the best ideas don’t always happen at a desk. Take a walking meeting around our 100-acre campus or enjoy lunch on the patio. We’re committed to your success—both personally and professionally. Hybrid work environment: For many roles, three days a week on-site, with occasional additional days based on business needs.
Free use of our on-site fitness center, gym sports, group exercise classes, and game room
Onsite catering and cafeteria subsidized by OCLC
Health and wellness events
Work environments with individual and team spaces and the latest technology tools
Paid parental leave and adoption assistance
Tuition reimbursement and Public Service Loan Forgiveness eligibility
Company-subsidized pricing on local tickets and memberships
Join us in transforming how people everywhere access information and be part of a mission-driven team that makes a global impact.
The job details are as follows:We’re hiring a Lead DevOps Engineer to raise the standard of how we build, test, deploy, and operate software. This is a hands-on role with strong technical ownership and a developer enablement mindset: you’ll reduce deployment friction, improve environment reliability and quality, strengthen observability, and lead incident resolution through to completion.You’ll lead through standards and influence—building reusable automation, mentoring others, and driving improvements across teams (including coordination with EMEA DevOps counterparts).
What you’ll do
- Lead automation initiatives that eliminate repetitive tasks and reduce operational toil.
- Build and maintain Ansible automation to provision new environments and keep existing environments up to date.
- Propose and lead platform improvement projects using tools such as Ansible, Rundeck, and CI/CD systems.
- Design and improve CI/CD pipelines and deployment automation with safe rollout/rollback strategies and clear environment promotion.
- Enable developers through reusable “paved road” tooling: templates, golden pipelines, self-service workflows, and guardrails that reduce manual work and tribal knowledge.
- Partner with engineering teams to improve delivery quality through:
- automated integration and regression testing,
- deployment validation and smoke testing,
- reliable and repeatable test/pre-production environments,
- quality gates that catch issues earlier.
- Improve observability across services and infrastructure (monitoring, logging, alerting, tracing), including visibility into deployment outcomes and failures.
- Lead analysis and resolution of production incidents across infrastructure, application, database, and network layers; drive RCAs and prevention work.
- Oversee platform patching and upgrades; plan, schedule, and monitor maintenance tasks.
- Coordinate and implement server/platform changes required by customers and internal teams.
- Document systems and processes, transfer knowledge, and mentor engineers to raise technical standards across the organization.
- Communicate proactively with stakeholders, manage multiple requests, and prioritize work effectively.
What success looks like
- Manual operational work is automated or removed; fewer repetitive tasks and fewer “only one person knows” processes.
- Faster, safer releases with stronger validation, clearer rollback paths, and improved release confidence.
- More reliable environments and improved readiness for new customer onboarding.
- Better visibility into platform health and incidents: higher signal, less alert noise, faster diagnosis and recovery.
- Clear standards and reusable tooling adopted across teams; improved developer experience and reduced deployment friction.
Required qualifications
- Bachelor’s degree in Computer Science (or equivalent) or equivalent professional experience.
- 6+ years of RedHat Linux server administration experience, including production troubleshooting and log triage.
- 6+ years of extensive Ansible scripting and automation experience (or equivalent configuration management).
- 6+ years of scripting experience in Bash and Python.
- Experience building and operating CI/CD pipelines and deployment automation.
- Strong troubleshooting skills across distributed systems (infrastructure, application, database, and network layers).
- Strong working knowledge of MySQL (query language required); Postgres experience is a plus.
- Excellent communication skills and the ability to lead through planning, prioritization, and influence.
- Proven ability to context switch, manage multiple stakeholder requests, and deliver reliably under deadlines.
Preferred qualifications
- Container infrastructure design and implementation; experience with Docker; Kubernetes/Helm a plus.
- Experience with Rundeck, Jenkins, SOLR, and/or ETCD.
- Experience with monitoring/logging/alerting and modern observability practices (SLOs/SLIs, change correlation, incident reduction).
- Networking fundamentals (DNS, routing, connectivity troubleshooting).
- Familiarity with standard change management practices (e.g., ITIL).
- General programming knowledge/structure; Java familiarity is a plus.
- Experience with progressive delivery (canary/blue-green), feature flags, IaC (Terraform/CloudFormation), and secrets management (Vault or equivalent).
- Interest in extending observability to automated workflows and AI/agent activity (execution tracing, failures, permissions, cost visibility).
Working style
This role requires strong ownership, organization, attention to detail, proactive stakeholder communication, and a bias toward automation and repeatability. You’ll be expected to take ambiguous, high-impact problems through to resolution and leave the platform better than you found it.
The Wise Lead System Engineer functions as an embedded subject matter expert and technical project leader working from within the OCLC Wise development team. OCLC practices a hybrid work location model allowing at least 3 days a week in the office and 2 days remote.
Skills Required
- Bachelor's degree in Computer Science or equivalent experience
- 6+ years RedHat Linux server administration
- 6+ years Ansible scripting and automation
- 6+ years scripting experience (Bash, Python)
- Proficient MySQL query language knowledge
- Extensive Linux system administration and command-line skills
- Container infrastructure design and implementation
- Excellent communication skills across technical levels
- Leadership, planning, prioritization, and stakeholder coordination
- Outstanding troubleshooting and incident analysis skills
- Postgres experience
- Knowledge of Java
- Knowledge of Kubernetes, Helm, Docker
- Experience with Rundeck, Jenkins, SOLR and/or ETCD
- Experience with networking, DNS, and routing
- Familiarity with standard change management practices (ITIL)









