Assoc. Dir. DDIT IES Cloud Engineering

Reposted 22 Days Ago
Be an Early Applicant
3 Locations
In-Office or Remote
Senior level
Biotech • Pharmaceutical
The Role
Responsible for architecting and developing advanced AI infrastructure on AWS for pharmaceutical research, ensuring data management, security, and collaboration with scientists.
Summary Generated by Built In

Job Description Summary

Responsible for designing, building, and managing a cutting-edge AI and Generative AI infrastructure based on NVIDIA SuperPOD NV72 system, tailored for pharmaceutical business use cases. The platform will enable Biomedical Research Scientists and other business users to accelerate early molecule development and research activities by providing robust, scalable, and secure GPU computing resources.


 

Job Description

Major Accountabilities:
  • Architect and Design: Lead the design and architecture of an NVIDIA SuperPOD-based AI infrastructure platform supporting Generative AI workloads and advanced analytics for pharma use cases like BioNeMo, AlphaFold, ESMFold, OpenFold, ProtGPT2, and NVIDIA Clara suite.
  • Platform Development: Implement ML/Ops solutions (Run:AI) on Kubernetes clusters optimized for NVIDIA GPUs.
  • Data Management: Design and implement high-performance data pipelines for large-scale genomics and chemical compound datasets.
  • Security and Compliance: Ensure robust security measures and compliance for HPC and multi-cloud environments.
  • Performance Optimization: Optimize GPU cluster performance, networking, and storage for cost-efficiency and scalability.
  • Innovation: Stay updated with NVIDIA AI infrastructure advancements and HPC trends.
Technical Expertise:
  • Expertise in deploying and managing GBX00 GPU-based clusters.
  • 8+ years of experience in GPU-based AI infrastructure and HPC systems.
  • Understanding of advanced interconnect technologies for GB-series GPUs.
  • Performance tuning for multi-node GBX00 workloads using NCCL, CUDA NVLink, NVSwitch, Storage and Inband High-Speed Ethernet Fabric, RDMA tuning, QoS policies, Out of Band Management.
  • Redundant power and cooling systems for HPC reliability.
  • Cluster Management: NVIDIA Base Command Manager, Slurm, Kubernetes for GPU scheduling.
  • Firmware & Driver Management: CUDA, NCCL, InfiniBand drivers, GPU firmware updates.
  • EFA, NVLink and InfiniBand switches for ultra-low latency GPU cluster communication.
  • Separate Ethernet-based management network for orchestration and monitoring.
  • Parallel File Systems: Spectrum Scale (GPFS) or Lustre for high-performance distributed storage.
  • Multi-petabyte capacity with NVMe SSD tiers for scratch space and HDD tiers for archival.
  • Integration with object storage for AI datasets.
  • Monitoring & Troubleshooting: DCGM, Prometheus, Grafana for telemetry and health checks.
  • Security & Compliance: RBAC, encryption, secure multi-tenant configurations.
  • Al/ML Workflow optimization, troubleshooting and job scheduling

Why consider Novartis?

Our purpose is to reimagine medicine to improve and extend people’s lives and our vision is to become the most valued and trusted medicines company in the world. How can we achieve this? With our people. It is our associates that drive us each day to reach our ambitions. Be a part of this mission and join us!
Learn more here:
https://www.novartis.com/about/strategy/people-and-culture
Commitment to Diversity and Inclusion:
Novartis is committed to building an outstanding, inclusive work environment and diverse teams' representative of the patients and communities we serve.
 

Join our Novartis Network: If this role is not suitable to your experience or career goals but you wish to stay connected to hear more about Novartis and our career opportunities, join the Novartis Network here:
https://talentnetwork.novartis.com/network


 

Skills Desired

Agile Project Management, Business Partnering, Change Management, IT Service Delivery, Performance Management

Top Skills

Amazon Q
AWS
Aws Bedrock
Ci/Cd
Docker
Ecs
Eks
Gpu
Kubernetes
Ml/Ops
Nvidia
Sagemaker
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Basel
110,000 Employees
Year Founded: 1996

What We Do

Novartis is an innovative medicines company. Every day, working to reimagine medicine to improve and extend people’s lives so that patients, healthcare professionals and societies are empowered in the face of serious disease. Our medicines reach more than 250 million people worldwide.

Similar Jobs

In-Office or Remote
3 Locations
110000 Employees

Rapid7 Logo Rapid7

Program Coordinator

Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
Remote or Hybrid
Prague, CZE
2400 Employees

Rapid7 Logo Rapid7

Senior Product Manager

Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
Remote or Hybrid
Prague, CZE
2400 Employees

Rapid7 Logo Rapid7

Remediation analyst

Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
Remote or Hybrid
Prague, CZE
2400 Employees

Similar Companies Hiring

SOPHiA GENETICS Thumbnail
Software • Healthtech • Biotech • Big Data • Artificial Intelligence
Boston, MA
450 Employees
Pfizer Thumbnail
Pharmaceutical • Natural Language Processing • Machine Learning • Healthtech • Biotech • Artificial Intelligence
New York, NY
121990 Employees
Cencora Thumbnail
Pharmaceutical • Logistics • Healthtech
Conshohocken, PA
51000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account