About this role
About the RoleBlackRock is one of the world’s leading providers of investment, advisory, and risk management solutions, powered by Aladdin, our integrated investment and risk management technology platform. Aladdin unifies data, analytics, and workflows across public and private markets, enabling scale, insights, and transformation for BlackRock and our clients.
As part of Aladdin Engineering, you will join the AI Platform Engineering team, which is building the next-generation AI infrastructure and services that power Aladdin and other firm-wide applications. This team sits at the intersection of backend systems, AI engineering, AI infrastructure, and platform reliability, enabling advanced AI capabilities at scale.
We are looking for a senior leader who thrives on solving complex engineering challenges, shaping AI reliability and automation strategy, and building robust, scalable platforms. You will lead teams responsible for ensuring operational excellence, reliability, and automation across AI workloads, influencing the AI ecosystem across the firm.
What You’ll Do- Define and execute the SRE and DevOps strategy for AI platforms, ensuring high availability, scalability, and security.
- Architect and oversee cloud-native infrastructure across AWS, GCP, and Azure for AI workloads.
- Drive Kubernetes-based orchestration for AI models, including GPU scheduling and resource optimization.
- Establish CI/CD pipelines for AI platform and AI model lifecycle management (training, testing, deployment) with enterprise-grade security and compliance.
- Implement observability frameworks and reliability standards (SLIs, SLOs, SLAs) for distributed AI systems.
- Lead incident management, root cause analysis, and performance optimization across compute, storage, and network layers.
- Collaborate cross-functionally to translate business and functional requirements into resilient technical designs.
- Stay ahead of trends in SRE, DevOps, MLOps, and AI infrastructure to drive innovation and operational excellence.
- Education: B.S./M.S. in Computer Science, Engineering, or related field.
- Experience: 8+ years in platform engineering, SRE, DevOps or AIOps roles.
- Technical Expertise:
- Proficiency in Python, Bash/Shell for automation, orchestration, and AI workflows.
- Familiarity with Rust build and dependency management frameworks.
- Hands-on expertise with CI/CD tools (e.g., Azure DevOps, Jenkins, GitHub Actions etc.).
- Proven ability to design and scale fault-tolerant, cloud-native systems for AI workloads.
- Deep proficiency in Kubernetes (Helm, Kustomize, CRDs) and containerization (Docker, containerd).
- Hands-on experience with AWS, GCP, Azure, and IaC tools (Terraform, CloudFormation).
- Strong knowledge of observability tools (Prometheus, Grafana, ELK) and performance tuning.
- Leadership Skills: Ability to build and lead high-performing teams, drive cross-functional collaboration, and influence technical strategy.
- Mindset: Strategic thinker with strong problem-solving skills, operational rigor, and adaptability.
- Experience with GPU orchestration and performance optimization in Kubernetes clusters.
- Exposure to secure model deployment practices and compliance frameworks for regulated industries.
- Practical experience with end-to-end ML lifecycle management and automated pipelines for large models.
- Familiarity with ML frameworks (PyTorch, JAX) and MLOps concepts.
- Hands-on expertise with CI/CD tools (e.g., Azure DevOps, Jenkins, GitHub Actions etc.).
Our benefits
To help you stay energized, engaged and inspired, we offer a wide range of benefits including a strong retirement plan, tuition reimbursement, comprehensive healthcare, support for working parents and Flexible Time Off (FTO) so you can relax, recharge and be there for the people you care about.
Our hybrid work model
BlackRock’s hybrid work model is designed to enable a culture of collaboration and apprenticeship that enriches the experience of our employees, while supporting flexibility for all. Employees are currently required to work at least 4 days in the office per week, with the flexibility to work from home 1 day a week. Some business groups may require more time in the office due to their roles and responsibilities. We remain focused on increasing the impactful moments that arise when we work together in person – aligned with our commitment to performance and innovation. As a new joiner, you can count on this hybrid model to accelerate your learning and onboarding experience here at BlackRock.
About BlackRock
At BlackRock, we are all connected by one mission: to help more and more people experience financial well-being. Our clients, and the people they serve, are saving for retirement, paying for their children’s educations, buying homes and starting businesses. Their investments also help to strengthen the global economy: support businesses small and large; finance infrastructure projects that connect and power cities; and facilitate innovations that drive progress.
This mission would not be possible without our smartest investment – the one we make in our employees. It’s why we’re dedicated to creating an environment where our colleagues feel welcomed, valued and supported with networks, benefits and development opportunities to help them thrive.
For additional information on BlackRock, please visit @blackrock | Twitter: @blackrock | LinkedIn: www.linkedin.com/company/blackrock
BlackRock is proud to be an Equal Opportunity Employer. We evaluate qualified applicants without regard to age, disability, family status, gender identity, race, religion, sex, sexual orientation and other protected attributes at law.
Top Skills
What We Do
As the world’s largest asset manager, BlackRock partners with investors around the globe to help them (and those on whose behalf they invest) plan for life’s most important goals – like retirement, home ownership and their children’s education. Our clients range from governments, foundations and other large institutions to those investing on behalf of individuals, including firefighters, nurses, teachers and factory workers.
BlackRock was founded with the idea of creating a better asset management firm — one that was purpose-driven, focused on clients and risk management, and propelled by data and technology. Our breakthrough Aladdin® platform is BlackRock’s technological backbone, helping investors see and manage their whole portfolios in one place – from constructing investments to monitoring risk and executing trades. Used by hundreds of external institutions around the world, Aladdin combines powerful analytics and a common language to help investment teams make faster, more informed decisions across public and private markets. It’s a key part of our business and one of the reasons we’re trusted to manage more assets than any other investment manager today.
At BlackRock, we challenge conventions and raise the bar for what’s possible. We harness technology to unlock new solutions, simplify complexity, and deliver investment strategies that meet people where they are. Whether it’s retirement planning, wealth building or navigating market shifts, we’re here to help clients invest more easily, more affordably and with more choice as we chart a path toward financial well-being together.
Learn more: Careers.BlackRock.com
Why Work With Us
Without our people, technology is irrelevant. When we combine the power of people with the power of technology, we amplify our ability to create better outcomes for our employees, clients, shareholders and society alike.
Gallery
BlackRock Teams
BlackRock Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.
BlackRock has 25,000 employees across more than 100 offices in over 40 countries around the world.






