Role Summary
Serve as the engineering owner for New York Life's enterprise workload automation ecosystem. You'll operate and harden scheduling platforms and calendars, design resilient restart/rerun patterns, and standardize job definitions, logging, and audit evidence across environments. Your work will ensure critical batch chains run predictably, meet SLAs, and support a consistent, automation-first operating model.
What You'll Do:
Run & Harden the Platform
- Operate and maintain scheduling controllers and agents across environments.
- Manage calendars and holiday tables; configure SLA jeopardy thresholds, alerting, and escalation paths.
- Implement platform upgrades, patches, and configuration changes in line with standards and change governance.
Engineer Reliability & Resilience
- Design restart/rerun patterns (checkpointing, idempotent wrappers) and failure-handling flows for critical batches.
- Model dependencies and schedules as code (job-as-code) in version control with CI/CD-based promotion.
- Reduce single points of failure and improve consistency across job chains and environments.
Standardize & Govern
- Define and maintain standard naming conventions, templates, parameters, and calendars across schedulers.
- Engineer common audit-evidence and log schemas to support internal and external reviews.
- Ensure data retention, traceability, and segregation of duties align with policies and regulatory requirements.
Guardrails, Health & Service Readiness
- Implement pre/post checks, synthetic probes, and health validations for batch workflows.
- Define and maintain SLIs/SLOs for batch completion, success rates, and recovery times.
- Build safeguards that detect anomalies and misconfigurations before they impact downstream processes.
Observability & Operational Excellence
- Integrate schedulers with observability tools (logs, metrics, dashboards) to improve visibility.
- Tune job concurrency, execution windows, and resource usage for performance and cost efficiency.
- Reduce noisy alerts and improve the signal-to-noise ratio for incident responders.
Change, Incident & Release Coordination
- Align scheduler changes, maintenance, and releases with APSO/Change Management processes.
- Lead incident triage and resolution for batch failures, including rapid root-cause analysis and safe restarts/reruns.
- Contribute to post-incident reviews and drive remediation actions into platform and pattern improvements.
Partner & Influence Across Teams
- Collaborate with Application Owners/Developers, DBAs/Data teams, SRE/Observability, Security, and Vendors to keep batch chains healthy and compliant.
- Provide guidance on best practices for job design, scheduling windows, dependencies, and error handling.
- Document patterns, playbooks, and standards; mentor peers and junior engineers in workload automation.
What You'll Bring:
- 5-8+ years of experience in enterprise workload automation, SRE, or production operations supporting mission-critical batch processing.
- Hands-on experience with Stonebranch or at least one major enterprise scheduler (e.g., ESP, Control-M, AutoSys, IBM Workload Scheduler/TWS, Redwood) including:
- Operating controllers/agents across environments.
- Managing calendars/holiday tables and SLA jeopardy configurations.
- Strong scripting and automation skills in PowerShell, Bash, or Python, plus familiarity with YAML/JSON and REST APIs.
- Experience with Git-based workflows and CI/CD pipelines for job-as-code and configuration promotion.
- Proven design and implementation of restart/rerun patterns, dependency modeling, and idempotent batch frameworks.
- Experience integrating schedulers with observability platforms (logs/metrics/dashboards) and defining SLIs/SLOs.
- Excellent coordination skills across incident and change processes, with clear, concise communication to technical and non-technical stakeholders.
Nice to Have
- Experience in financial services or other highly regulated industries.
- Background standardizing multiple schedulers and creating common audit schemas and evidence-capture patterns.
- Relevant certifications such as ITIL, cloud architect/operations, DR/BC (e.g., DRII/BCI), or security (e.g., CISSP).
How Success Will Be Measured
- Reduction in SLA jeopardy and breaches; lower mean time to recover (MTTR) from failed jobs.
- Percentage of batch chains using standardized templates, restart/rerun patterns, and automated pre/post checks.
- Completeness, consistency, and time-to-produce logs and evidence for audits and reviews.
- Reduction in manual interventions and alert noise; improved rate of on-time, successful batch completion.
Working Model
Hybrid role based in New York, NY with periodic on-site participation for key release and batch events. Participation in an on-call rotation for critical batch windows is expected. You'll work within clear governance, established change processes, and close cross-technology collaboration to keep job automation reliable, consistent, and audit-ready.
Pay Transparency
Salary Range: $90,000-$128,500
Overtime eligible: Exempt
Discretionary bonus eligible: Yes
Sales bonus eligible: No
Actual base salary will be determined based on several factors but not limited to individual's experience, skills, qualifications, and job location. Additionally, employees are eligible for an annual discretionary bonus. In addition to base salary, employees may also be eligible to participate in an incentive program.
Company Overview
At New York Life, our 180-year legacy of purpose and integrity fuels our future. As we evolve into a more technology-, data-, and AI-enabled organization, we remain grounded in the values that drive lasting impact.
Our diverse business portfolio creates opportunities to make a difference across industries and communities-inviting bold thinking, collaborative problem-solving, and purpose-driven innovation. Here, you'll find the rare balance of long-standing stability and forward momentum, supported by an inclusive team that honors tradition while embracing progress.
As a Fortune 100 mutual company, we offer a place to grow your skills, contribute to meaningful work, and deliver solutions that matter. Your ideas drive what's next, and your growth powers it.
Our Benefits
We provide a full package of benefits for employees - and have unique offerings for a modern workforce, including leave programs, adoption assistance, and student loan repayment programs. Based on feedback from our employees, we continue to refine and add benefits to our offering, so that you can flourish both inside and outside of work.Click hereto discover more about our comprehensive benefit options or visit our NYL Benefits Site.
Our Commitment to Inclusion
At New York Life, fostering an inclusive workplace is fundamental to who we are and how we serve our communities. We have a longstanding commitment to creating an environment where individuals can contribute their best and succeed together. This foundation is rooted in our core values of humanity and integrity, ensuring that every employee feels valued and supported. By embracing a broad range of perspectives and experiences, we achieve greater success and fulfill our promise of providing financial security and peace of mind to families across all communities. Click here to learn more about New York Life's leadership in this space.
Recognized as one of Fortune's World's Most Admired Companies, New York Life is committed to improving local communities through a culture of employee giving and volunteerism, supported by the Foundation. We're proud that due to our mutuality, we operate in the best interests of our policy owners. To learn more about career opportunities at New York Life, please visit the Careers page of www.NewYorkLife.com.
Visit our LinkedIn to see how our employees and agents are leading the industry and impacting communities.
Visit our Newsroom to learn more about how our company is constantly evolving to meet our clients' and employees' needs.
Job Requisition ID: 93700
Top Skills
What We Do
At New York Life, our 180-year legacy of integrity, mutuality, and financial strength fuels a future defined by bold transformation. As the largest mutual life insurance company in the U.S., we operate on behalf of our policy owners—not shareholders. That structure allows us to take a long-term view, investing in people, purpose, and innovation that endures. Guided by a clear enterprise vision to become a technology-, data-, and AI-powered company, we’re modernizing our platforms, rearchitecting experiences, and embedding intelligence across our products and services. Our mission has always been about helping people through life’s most meaningful moments. Today, technology is amplifying that mission—enabling us to serve clients, advisors, and communities in more personalized, proactive ways. With a diversified business portfolio spanning insurance, investments, retirement, group benefits, and direct-to-consumer offerings, New York Life delivers the stability of a Fortune 100 company with the agility of one that’s continuously evolving. We’re powered by a values-led culture, inclusive teams, and a shared belief that when our people thrive, so does our company. Here, tradition fuels momentum—and your ideas, energy, and growth power what’s next.
Why Work With Us
New York Life is transforming from the inside out—blending 180 years of trust with the velocity of innovation. What makes us different is our culture: grounded in integrity, humanity, and shared success—values that show up in how we work, lead, and grow. If you want a place where innovation has purpose—build what's next with us.
Gallery
New York Life Insurance Company Teams
New York Life Insurance Company Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.









