AI Solutions Architect - FS or CI Polygraph Required

Posted 4 Days Ago
Be an Early Applicant
Hiring Remotely in Virginia, USA
Remote
Senior level
Big Data • Software • Analytics
The Role
Design, build, and deploy secure, production AI solutions on Cloudera Data Platform for government agencies: select and fine-tune models, build ETL/vector/RAG pipelines, optimize inference and MLOps, enable GPU orchestration, and advise on governance and FedRAMP/IL5 compliance while collaborating with agency Centers of Excellence.
Summary Generated by Built In

Business Area:

Professional Services

Seniority Level:

Mid-Senior level

Job Description: 

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry.  Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

As an AI Solutions Engineer within Cloudera’s Public Sector Consulting team, you will be the technical architect and execution lead for agencies moving from "data chaos" to "agentic autonomy." You will work directly with government organizations to design, build, and deploy mission-critical AI applications on the Cloudera Data Platform (CDP).

This is not a "theoretical" role. You will be on the front lines of Phase 2 and Phase 3 adoption journeys—helping customers clean legacy data silos, select the right model architectures, and industrialize MLOps pipelines in highly secure, often air-gapped or hybrid-cloud environments.

As the AI Solutions Engineer you will: 

1. AI Model Strategy, Selection and Implementation

  • Evaluate and select optimal model architectures (LLMs, SLMs, or traditional ML) based on mission requirements, considering tradeoffs between accuracy, latency, and cost.

  • Guide customers on "Build vs. Buy vs. Fine-tune" decisions, prioritizing open-source models (Llama, Mistral, Falcon) that can run securely within a sovereign data perimeter.

  • Experience building Agentic Workflows (AI agents that can execute API calls and multi-step tasks).

2. End-to-End Data Engineering

  • Design and implement robust data pipelines within CDP to transform "messy" legacy data into AI-ready formats.

  • Develop and optimize Vector Databases and Retrieval-Augmented Generation (RAG) architectures to ground AI responses in verified agency facts.

  • Build Data pipelines with Spark, Nifi, Kafka or other ETL tools.

3. Optimization & Performance Tuning

  • Optimize model inference for production environments using quantization, pruning, and hardware acceleration (NVIDIA GPU orchestration).

  • Implement LLMOps to monitor model performance, detect hallucination rates, and manage model versioning and drift.

4. Public Sector Advisory & Governance

  • Collaborate with the customer’s AI Center of Excellence (CoE) to establish automated guardrails for ethics, bias mitigation, and FedRAMP/IL5 compliance.

  • Translate complex technical AI concepts into mission-value briefings for GS-level stakeholders and agency leadership.

We’re excited about you if you have: (Minimum Qualifications): 

  • Experience: 5+ years in Data Engineering, Machine Learning, or Software Engineering, with at least 2 years focused on Generative AI or Deep Learning.

  • Technical Stack: Expertise in Python and deep learning frameworks (PyTorch, TensorFlow, Hugging Face).

    • Hands-on experience with Cloudera (CDP), Spark, or similar big data ecosystems.

    • Proficiency in orchestration tools like LangChain, LlamaIndex, or Haystack.

    • Experience developing visual data representations and dashboards (Django, React, or Angular)  

    • Experience using a compiled programming language, preferably one that runs on the JVM (Java, Scala, etc)

  • Data Expertise: Proven ability to build ETL/ELT pipelines and work with both SQL and NoSQL/Vector databases (e.g., Pinecone, Milvus, or PGVector).

  • Public Sector Knowledge: Understanding of government security frameworks (NIST AI RMF, FedRAMP, SRGs, STIGs).

  • Active Top Secret Security Clearance

You may also have: (Preferred Qualifications)

  • Experience fine-tuning of foundational models using techniques such as PEFT (Parameter-Efficient Fine-Tuning) and LoRA to adapt AI to domain-specific government nomenclature.

  • Experience training of specialized models on proprietary datasets while ensuring strict adherence to data privacy and sensitivity labels.

  • Experience installing and operating Cloudera Data Platform 

  • Experience installing and operating Kubernetes

  • Experience in Air-Gapped deployments and managing AI workloads in disconnected environments.

  • Advanced degree (MS or PhD) in Computer Science, Data Science, or a related field.

  • Active Counterintelligence (CI) or Full Scope (FS) Poly is required.

This role is not eligible for immigration sponsorship.

What you can expect from us:

  • Generous PTO Policy 

  • Support work life balance with Unplugged Days

  • Flexible WFH Policy 

  • Mental & Physical Wellness programs 

  • Phone and Internet Reimbursement program 

  • Access to Continued Career Development 

  • Comprehensive Benefits and Competitive Packages 

  • Paid Volunteer Time

  • Employee Resource Groups

EEO/VEVRAA

#LI-Remote

#LI-ND3

Skills Required

  • 5+ years in Data Engineering, Machine Learning, or Software Engineering, with at least 2 years focused on Generative AI or Deep Learning.
  • Expertise in Python
  • Experience with deep learning frameworks: PyTorch, TensorFlow, Hugging Face
  • Hands-on experience with Cloudera (CDP), Spark, or similar big data ecosystems
  • Proficiency in orchestration tools like LangChain, LlamaIndex, or Haystack
  • Experience developing visual data representations and dashboards (Django, React, or Angular)
  • Experience using a compiled JVM language (Java, Scala)
  • Proven ability to build ETL/ELT pipelines and work with SQL and NoSQL/Vector databases (Pinecone, Milvus, PGVector)
  • Understanding of government security frameworks (NIST AI RMF, FedRAMP, SRGs, STIGs)
  • Active Top Secret Security Clearance
  • Active Counterintelligence (CI) or Full Scope (FS) Polygraph
  • Experience fine-tuning foundational models using PEFT and LoRA
  • Experience training specialized models on proprietary datasets with strict data privacy/sensitivity controls
  • Experience installing and operating Cloudera Data Platform
  • Experience installing and operating Kubernetes
  • Experience in air-gapped deployments and managing AI workloads in disconnected environments
  • Advanced degree (MS or PhD) in Computer Science, Data Science, or a related field

Cloudera Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Cloudera and has not been reviewed or approved by Cloudera.

  • Leave & Time Off Breadth Time off includes generous PTO and holidays plus recurring company‑wide Unplugged Days that provide regular recharge time. Volunteer time off and flexible scheduling options further expand usable leave.
  • Healthcare Strength Health coverage spans comprehensive medical, dental, and vision alongside EAP, wellness sessions, and U.S. gym reimbursement. These elements position healthcare as a strong anchor within the package.
  • Strong & Reliable Incentives Compensation often includes variable incentives and long‑term incentive programs with annual bonuses commonly offered. Sales and other revenue roles show competitive on‑target earnings when goals are met, reinforcing the incentive structure.

Cloudera Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alot, CA
3,092 Employees
Year Founded: 2008

What We Do

At Cloudera, we believe that data can make what is impossible today, possible tomorrow. We empower people to transform complex data into clear and actionable insights. Cloudera delivers an enterprise data cloud for any data, anywhere, from the Edge to AI. Powered by the relentless innovation of the open source community,

Similar Jobs

Deepgram Logo Deepgram

Technical Recruiter

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
Remote
USA
150 Employees
120K-180K Annually

Inspiren Logo Inspiren

Platform Engineer

Artificial Intelligence • Hardware • Healthtech • Software
Easy Apply
In-Office or Remote
3 Locations
150 Employees
180K-200K Annually

Vannevar Logo Vannevar

InfoSec Engineer - Compliance (ATO)

Artificial Intelligence • Machine Learning • Software • Defense
Remote
USA
225 Employees

Vannevar Logo Vannevar

Mission Systems Director

Artificial Intelligence • Machine Learning • Software • Defense
Remote
USA
225 Employees
250K-500K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account