Principal AI Engineer

Posted Yesterday
Hiring Remotely in US
Remote
Senior level
Edtech
The Role
Lead architecture and technical direction for a cloud-native, multi-tenant Student Journey Platform. Design and enforce standards for Kubernetes-based AI services (AKS), event streaming, ML model serving, autoscaling, observability, and compliance. Ensure scalable, reliable, cost-efficient infrastructure, resolve architectural risks and production issues, and align platform decisions with business growth and SLAs.
Summary Generated by Built In

Risepoint is an education technology company that provides world-class support and trusted expertise to more than 100 universities and colleges. We primarily work with regional universities, helping them develop and grow their high-ROI, workforce-focused online degree programs in critical areas such as nursing, teaching, business, and public service. Risepoint is dedicated to increasing access to affordable education so that more students, especially working adults, can improve their careers and meet employer and community needs.

The Impact You Will Make

Risepoint is building our Student Journey Platform, a multi-component AI platform spanning real-time orchestration, machine learning model serving, event-driven workflow execution, and a student intelligence layer operating across 10+ university partner environments. The role will lead the technical direction for platform architecture within the engineering team: a Cadence Engine supporting durable stateful student engagement workflows; real-time AI-mediated communication endpoints across web chat, phone AI, and SMS AI with sub-second latency requirements; ML model serving infrastructure for propensity scoring and lead prioritization at scale; multi-tenant Kubernetes cluster architecture across partner deployments with distinct compliance and data isolation requirements; and a speech analytics pipeline processing call transcript data at volume. This is an architect-level role in scope and accountability charged with setting technical direction, defining the standards engineering teams build against, and keeping the platform ahead of Risepoint’s growth curve.

How You Will Bring Our Mission to Life

What You Will Do

  • Lead the architecture and evolution of cloud-native infrastructure for the Student Journey Platform, including all services and its integrated platforms (Salesforce, DBX, Marketing sites, Azure AI Foundry), setting technical direction across Kubernetes-based AI services deployed on Azure (AKS), with accountability for platform-wide scalability, reliability, and cost efficiency
  • Establish and promote architecture standards within the platform scope, design patterns, and deployment best practices that all engineering teams build against including service mesh configuration, autoscaling policy design, resource governance, API contracts, and container orchestration strategy.
  • Lead architecture design and implementation across the platform’s components for SJP, Student Success Team, Marketing Technology, and Enterprise Data Platform implementing a variety of resources (Kafka, Azure Event Hubs, Azure Service Bus, AI Foundry), ensuring asynchronous AI workloads are resilient, observable, and operate without bottlenecks, vulnerabilities, or data loss under production conditions.
  • Align platform architecture to business growth needs and scalability requirements, partnering with Product, Engineering, and business stakeholders to ensure infrastructure decisions stay ahead of adoption curves not reactive to them.
  • Present sound architecture proposals to the Architecture Review Board (ARB) for approval of products intended for release into production, representing the Student Journey Platform’s technical strategy as well as its integrated products and services, ensuring alignment with Risepoint’s enterprise standards for SLAs, security, compliance, and scalability.
  • Identify and resolve architectural risk early, before it compounds, working across engineering teams to close gaps in design, security posture, or operational readiness.
  • Debug and resolve production-level issues where infrastructure or architecture is a contributing factor, driving root cause resolution rather than symptomatic fixes.
  • Implement and manage event streaming and real-time processing pipelines (e.g., Kafka, Azure Event Hubs, Pub/Sub, Kinesis) at production scale, supporting high-volume asynchronous AI workloads.
  • Design and manage multi-tenant cloud infrastructure across university partner deployments, each with potentially distinct compliance, data isolation, and availability requirements.

What Success Looks Like

  • Engineering teams across the Student Journey Platform build with confidence against clear, documented architecture standards without requiring repeated escalation or one-off guidance on foundational decisions.
  • Kubernetes-based deployments are stable, observable, and horizontally scalable, supporting resilient operation under production load with well-defined SLAs for availability, latency, and throughput.
  • Infrastructure decisions made today hold up 12–18 months from now, as the platform scales across additional university partners and AI-mediated workloads.

How Impact Will be Measured

  • Kubernetes workloads demonstrate effective horizontal scaling and resource utilization, with cloud spend aligned to performance targets and capacity forecasts.
  • Event-driven and queue-based systems maintain consistent throughput and processing times under load, supporting business adoption targets without degradation or data loss.
  • Platform services meet defined SLAs/SLOs as measured through production monitoring tools (New Relic, Azure Monitor, Prometheus/Grafana), with alerting frameworks in place before issues surface in production. Architecture standards are adopted across engineering teams, measurable by reduction in ARB revision cycles, RCA reports, and consistency of implementation patterns across services.

What You’ll Bring to the Team

Experience That Matters Most

  • 8+ years of software engineering experience with demonstrated progression into architecture ownership including hands-on experience with Kubernetes (AKS preferred), containerization (Docker), and distributed system design at production scale.
  • A track record of setting technical direction across engineering teams, not just executing within one: including defining standards others build against and influencing architectural decisions in cross-functional environments.
  • Deep experience with autoscaling policy design, resource governance, and cost management in cloud environments (Azure preferred; AWS or GCP acceptable), managed through infrastructure as code.
  • Experience translating business and product requirements into infrastructure architecture including capacity planning, SLA definition, and trade-off communication to non-technical stakeholders. Proficiency in Python, C#, Java, or a comparable language used in production systems, with strong fundamentals in object-oriented programming and design patterns.

Experience That’s Great to Have

  • Architecture or deployment experience with AI/ML systems in cloud environments. Managed integrations with Databricks model serving endpoints and vector stores a plus.
  • Implementation experience with Azure AI Foundry and realtime models.
  • Experience designing APIs and backend systems supporting high concurrency and real-time interactions.
  • Familiarity with RAG systems, vector stores, and MCP server architecture.

Risepoint is an equal-opportunity employer and supports a diverse and inclusive workforce.

Skills Required

  • 8+ years software engineering experience with progression into architecture ownership
  • Hands-on Kubernetes experience (AKS preferred) and containerization (Docker)
  • Distributed system design and production-scale architecture experience
  • Experience defining technical direction, architecture standards, and influencing cross-team adoption
  • Autoscaling policy design, resource governance, and cloud cost management experience (Azure preferred; AWS/GCP acceptable)
  • Experience with infrastructure as code and capacity planning, SLA definition, and trade-off communication
  • Proficiency in Python, C#, Java, or comparable production language
  • Experience implementing and managing event streaming and real-time processing pipelines (Kafka, Azure Event Hubs, Pub/Sub, Kinesis)
  • Experience designing and managing multi-tenant cloud infrastructure with compliance and data isolation requirements
  • Experience with ML model serving infrastructure and real-time AI-mediated endpoints
  • Experience with Databricks model serving endpoints and vector stores
  • Experience with Azure AI Foundry and realtime model deployments
  • Familiarity with RAG systems, vector stores, MCP server architecture, and speech analytics pipelines
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
737 Employees
Year Founded: 2007

What We Do

Risepoint is a global education technology company partnering with more than 100 not-for-profit universities to launch and grow affordable, workforce relevant online programs for working adults. Founded in 2007, Risepoint provides the technology, expertise, and capital that help regional universities innovate and grow through online offerings in areas such as nursing, healthcare, teaching, business, and technology. Risepoint employs more than 1,400 professionals across the U.S., the United Kingdom, and APAC.

Similar Jobs

Dynatrace Logo Dynatrace

Principal Engineer

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Remote or Hybrid
Boston, MA, USA
5600 Employees
74K-112K Annually

Block Logo Block

Principal Engineer

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
In-Office or Remote
8 Locations
12000 Employees
319K-479K Annually

CrowdStrike Logo CrowdStrike

Data Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
7 Locations
10000 Employees
195K-320K Annually

eClinical Solutions Logo eClinical Solutions

Artificial Intelligence Engineer

Cloud • Healthtech • Professional Services • Software • Pharmaceutical
Easy Apply
Remote or Hybrid
Mansfield, MA, USA
400 Employees
190K-210K Annually

Similar Companies Hiring

ReUp Education Thumbnail
Social Impact • Edtech
Austin, TX
180 Employees
Learneo Thumbnail
Software • Machine Learning • Edtech • Artificial Intelligence
NL
397 Employees
CodePath.org Thumbnail
Edtech • Social Impact
US
55 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account