Senior Data Engineer

Posted 4 Days Ago
Hiring Remotely in United States
Remote
Senior level
Artificial Intelligence • Software • Big Data Analytics
Drive Growth and Revenue: AI-Powered RFP Automation and Knowledge Management.
The Role
As a Senior Data Engineer, you'll build and optimize data pipelines, manage Azure-based data infrastructure, and ensure compliance while collaborating with ML and product teams.
Summary Generated by Built In

Are you passionate about pushing the boundaries of technology in the Gen AI space? Rohirrim is seeking a Senior Data Engineer to mentor engineers, provide technical direction, and drive the development of cutting-edge applications. If you thrive in a fast-paced environment and enjoy leading by example while staying hands-on with coding, we want to hear from you!

Why Join Rohirrim?

At Rohirrim, we're at the forefront of innovation in the Gen AI space. Joining our team means being part of a dynamic environment where your leadership and expertise make a tangible impact on our products and team growth.


About the Role

As a Data Engineer at Rohirrim, you’ll design, build, and optimize the data pipelines and infrastructure that fuel our AI products. You’ll work closely with our AI/ML teams, product teams, customer success managers,and security/compliance partners to transform complex enterprise datasets into clean, reliable, structured foundations for Rohan deployments — especially in controlled, secure, or GovTech environments.

You’ll help us scale:

  • ingestion pipelines
  • vector stores
  • embedding workflows
  • metadata & document-processing frameworks
  • Azure-native data services

…in a way that is fast, compliant, and deeply reliable.



What You’ll Do
  • Blend capabilities in software engineering, data engineering and devops to build and maintain scalable data ingestion pipelines for structured/unstructured data (documents, PDFs, knowledge bases, enterprise systems, APIs, etc.).
  • Develop and operate ETL/ELT workflows that ensure data integrity, security, and lineage.
  • Implement and optimize vector database systems and embeddings pipelines supporting RAG and AI features.
  • Collaborate with ML engineers to support model training, evaluation, and feature engineering pipelines.
  • Architect and manage Azure-based data infrastructure (e.g., Azure Functions, Azure Storage, Azure SQL, Azure Kubernetes Service, Azure OpenAI integrations).
  • Build internal tools for metadata extraction, OCR/document parsing, text normalization, and validation.
  • Ensure pipelines meet compliance, auditability, and security requirements (SOC2, FedRAMP, etc.).
  • Support customer-specific data onboarding workflows for government + enterprise deployments.
  • Monitor and improve pipeline performance, reliability, and scalability.



What Makes You a Great Fit
  • 10+ years in Data Engineering, Software Engineering, or ML/Data Infrastructure roles.
  • Strong experience with Python, SQL, and modern data engineering tools (Airflow, Dagster, dbt, Prefect, etc.).
  • Experience building large-scale document extraction ETL pipelines (OCR, PDF parsing, metadata extraction, NLP preprocessing).
  • Proficiency with Kubernetes, Docker, and containerized data pipelines deployed on Azure, AWS and/or Google Cloud
  • Hands-on experience with relational databases (Postgres, SQL Server, MySQL) and non-relational systems such as Elasticsearch, Redis, and graph databases
  • Experience with document-heavy or text-heavy data processing (OCR, parsing, NLP preprocessing).
  • Strong data quality, governance, lineage, and validation mindset.
  • Excellent communicator who can align with ML, engineering, and product teams.



Bonus Skills
  • Experience building or supporting GenAI / LLM / RAG pipelines.
  • Experience with Azure OpenAI Service.
  • Experience with min.io
  • Background with knowledge graphs, semantic search, or indexing at scale.
  • Familiarity with CI/CD pipelines in Azure DevOps, GitHub Actions, or similar.

Top Skills

Airflow
Azure
Dagster
Dbt
Docker
Elasticsearch
Kubernetes
MySQL
Postgres
Prefect
Python
Redis
SQL
SQL Server
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Reston, Virginia
59 Employees
Year Founded: 2022

What We Do

Domain-Aware Generative AI Solution for RFP Automation and Knowledge Management:

Rohan uses your data to provide knowledge-driven responses. It eliminates the need to sift through unstructured data for RFP information and ensures quick, accurate responses in your company's voice. It also increases team productivity while lowering burnout rates.

Secure and Trustworthy Platform Designed for Fortune 1000 and Aerospace/Defense:

Our platform is purpose-built for Aerospace/Defense and Fortune 1000 companies that require secured deployments. It is hosted exclusively as a dedicated PaaS – not a multi-tenant SaaS. You can be confident that your data is under your control and that there is no risk of leakage. Our platform has low risk, simpler governance, and higher compliance.

Purpose-Built for Complex Proposal Workflows:

Our platform is purpose-built for GovCon RFP workflows and provides the depth and detail required to submit high-quality and compliant bids. It reduces training costs and time to full productivity. There's no retrofitting involved or the need to identify new processes. Our platform and flexible deployment options minimize technology acquisition roadblocks.

Enterprise-Grade Scale and Support

Our Customer Success and Training Teams were designed to meet the unique challenges and requirements of each large enterprise. Our comprehensive Generative AI training programs and Proposal Sprints ensure teams are proficient in maximizing the potential of Rohan. This bespoke approach underscores our commitment to empowering enterprises with the transformative power of our platform for their sustained growth and success.

Eliminate Costs and Increase Revenue Simultaneously!

Similar Jobs

Atlassian Logo Atlassian

Senior Data Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
San Francisco, CA, USA
11000 Employees
146K-229K Annually

People Inc. Logo People Inc.

Senior Data Engineer

AdTech • Consumer Web • Digital Media • eCommerce • Marketing Tech
Remote or Hybrid
2 Locations
3500 Employees
160K-180K Annually

CrowdStrike Logo CrowdStrike

Senior Data Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
2 Locations
10000 Employees
125K-180K Annually

Rain Logo Rain

Senior Data Engineer

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3 • Infrastructure as a Service (IaaS)
In-Office or Remote
2 Locations
40 Employees

Similar Companies Hiring

PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account