AI Platform & Cloud Engineer

Posted 6 Days Ago
Easy Apply
Rockville, MD
In-Office
125K-135K Annually
Mid level
Information Technology • Consulting
The Role
Build and maintain the hybrid-cloud AI platform and internal developer platform: containerize research code, define IaC specs, deploy workflow orchestration, manage vector/graph databases, deploy model serving and cloud AI agents, and collaborate with IT on Kubernetes-based compute and security integrations.
Summary Generated by Built In
(ID: 2025-0914)

Axle is a bioscience and information technology company that offers advancements in translational research, biomedical informatics, and data science applications to research centers and healthcare organizations nationally and abroad. With experts in biomedical science, software engineering, and program management, we focus on developing and applying research tools and techniques to empower decision-making and accelerate research discoveries. We work with some of the top research organizations and facilities in the country including multiple institutes at the National Institutes of Health (NIH).


Benefits We Offer:

  • 100% Medical, Dental & Vision Coverage for Employees
  • Paid Time Off and Paid Holidays
  • 401K match up to 5%
  • Educational Benefits for Career Growth
  • Employee Referral Bonus
  • Flexible Spending Accounts:
    • Healthcare (FSA)
    • Parking Reimbursement Account (PRK)
    • Dependent Care Assistant Program (DCAP)
    • Transportation Reimbursement Account (TRN)

Overview

The AI Platform & Cloud Engineer will help sustain the hybrid cloud production environment for the SOM Center’s data ecosystem. This role serves as the technical interface between Data Science and IT, focusing on Platform Engineering: building the internal developer platform (IDP) that utilizes the IT-managed Kubernetes infrastructure and cloud resource to scale resources for workflow orchestration, knowledge graph data pipelines, and distributed model inference.

 

Responsibilities:

  • IT Collaboration & K8s Support: Collaborate closely with the dedicated IT team to define compute requirements and orchestrate workloads on the new Kubernetes cluster. The engineer will not manage the cluster directly but will ensure data science applications are correctly containerized and configured to run efficiently on the infrastructure provided by IT.
  • Infrastructure Strategy: Define the Infrastructure as Code (IaC) specifications for application-level resources, working with IT to ensure on-premises GPU clusters and public cloud environments (GCP/AWS) are utilized effectively.
  • Refactoring & Model Serving: Transform experimental code (Jupyter Notebooks, R scripts) developed by NLP and Omics researchers into robust, containerized software packages. Deploy and optimize model inference servers (e.g., vLLM, Triton Inference Server) to expose AI models as reliable internal APIs.
  • Workflow Orchestration: Deploy and maintain the Workflow Orchestration platform (e.g., Apache Airflow, Prefect, or Dagster) to manage dependencies between data ingestion, model inference, and state updates, serving as the central execution controller for distributed processes.
  • AI-Assisted Development: Actively utilize AI-assisted coding tools (e.g., GitHub Copilot) to accelerate code generation, documentation, and refactoring processes to increase overall productivity.
  • Data Foundation: Administer the Data Foundation infrastructure, including supporting Graph Databases (e.g., Neo4j), Vector Databases (e.g., Milvus, pgvector) for RAG implementations, and ETL pipelines to ingest massive public datasets (e.g., Human Cell Atlas) into the Data Lake.
  • Cloud Agent Architecture: Architect and deploy managed Cloud AI Agents (e.g., via Vertex AI) to orchestrate complex reasoning workflows, including and not limited to parsing scientific literature, querying omics databases, and validating experimental protocols against Knowledge Graphs.
  • Security Implementation: Collaborate with data scientists to implement Workload Identity federation and secrets management (e.g., Vault), ensuring automated workflows securely authenticate against enterprise resources managed by IT.

 

Required Qualifications:

  • Bachelor’s or master’s degree in computer science or engineering with experience in Cloud Engineering, MLOps, or SRE.
  • Proficiency in Python and Infrastructure as Code concepts, with experience in major cloud platforms (GCP preferred, or AWS).
  • AI Productivity: Demonstrated ability to leverage AI-driven coding assistants and LLMs to increase development velocity and code quality.
  • Experience utilizing Hybrid Cloud architectures and configuring workloads for burst computing (Spot instances, Autoscaling groups).
  • Experience refactoring research-grade code into production-grade services (Docker/Kubernetes).
  • Experience with Workflow Orchestration tools (Airflow, Prefect, or Dagster) and Vector Database administration.

Preferred Qualifications:

  • Experience deploying applications to Kubernetes (GKE/EKS) and using GitOps workflows (ArgoCD/Flux).
  • Knowledge of Graph Database administration (Neo4j) and object storage architectures.
  • Familiarity with Serverless event processing (Cloud Functions) and ML Engineering concepts (quantization, distillation, serving via Triton/vLLM).

Disclaimer: The above description is meant to illustrate the general nature of work and level of effort being performed by individuals assigned to this position or job description. This is not restricted as a complete list of all skills, responsibilities, duties, and/or assignments required. Individuals may be required to perform duties outside of their position, job description or responsibilities as needed.

The diversity of Axle’s employees is a tremendous asset. We are firmly committed to providing equal opportunity in all aspects of employment and will not tolerate any illegal discrimination or harassment based on age, race, gender, religion, national origin, disability, marital status, covered veteran status, sexual orientation, status with respect to public assistance, and other characteristics protected under state, federal, or local law and to deter those who aid, abet, or induce discrimination or coerce others to discriminate.

Accessibility: If you need an accommodation as part of the employment process please contact: [email protected]

This role has a market-competitive salary with an anticipated base compensation range listed below. Actual salaries will vary depending on a candidate’s experience, qualifications, skills, and location.

#IND

Salary Range
$125,000$135,000 USD

Top Skills

Apache Airflow
Argocd
Autoscaling Groups
AWS
Cloud Functions
Dagster
Docker
Eks
Flux
GCP
Github Copilot
Gitops
Gke
Infrastructure As Code
Jupyter
Kubernetes
Milvus
Neo4J
Object Storage
Pgvector
Prefect
Python
R
Spot Instances
Triton Inference Server
Vault
Vertex Ai
Vllm
Workload Identity Federation
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Rockville, MD
191 Employees
Year Founded: 2002

What We Do

Axle Informatics is a bioscience and information technology company that offers advancements in translational research, health informatics, and data science applications to research centers and healthcare organizations around the globe. With experts in biomedical science, software engineering, and program management, we develop and apply research tools and techniques that empower decision-making and accelerate research discoveries. We work with some of the top research organizations and facilities in the country including multiple institutes at the National Institutes of Health (NIH) by offering the responsiveness of a small business coupled with the experience, breadth, and depth of a large organization.

Similar Jobs

CrowdStrike Logo CrowdStrike

Architect

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
USA
10000 Employees
135K-205K Annually

CrowdStrike Logo CrowdStrike

Technical Account Manager

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
USA
10000 Employees
110K-160K Annually

Applied Systems Logo Applied Systems

Associate Customer Project Lead

Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Remote or Hybrid
United States
3040 Employees
60K-90K Annually

General Motors Logo General Motors

Strategy & Operations Lead

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Remote or Hybrid
United States
165000 Employees
150K-200K Annually

Similar Companies Hiring

Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account