Senior Data Engineer

Posted 5 Days Ago
Easy Apply
Hiring Remotely in United States
Remote
Senior level
Artificial Intelligence • Information Technology • Consulting
The Role
Design, build, and maintain secure, scalable cloud-native data platforms (lakehouse/data mesh) supporting batch, streaming, ML, GIS, and operational workloads. Lead ingestion/transformation pipelines, implement distributed/event-driven processing (Spark/Kafka), enforce governance, observability, and SLAs, and mentor engineers while partnering with data scientists and stakeholders.
Summary Generated by Built In
About Sand

Sand Technologies is a fast-growing enterprise AI company that solves real-world problems for large blue-chip companies and governments worldwide.

We’re pioneers of meaningful AI: our solutions go far beyond chatbots. We are using data and AI to solve the world’s biggest issues in telecommunications, sustainable water management, energy, healthcare, climate change, smart cities, and other areas that have a real impact on the world. For example, our AI systems help to manage the water supply for the entire city of London. We created the AI algorithms that enabled the 7th largest telecommunications company in the world to plan its network in 300 cities in record time. And we built a digital healthcare system that enables 30m people in a country to get world-class healthcare despite a shortage of doctors.

We’ve grown our revenues by over 500% in the last 12 months while winning prestigious scientific and industry awards for our cutting-edge technology. We’re underpinned by over 300 engineers and scientists working across Africa, Europe, the UK and the US. 

About the role

Sand Technologies build data-intensive systems that enable insight, intelligence, and informed decision-making. We typically work with hybrid data architectures with centralised lakehouses or data warehouses and distributed data products on top. Our stack includes tools such as Databricks, dbt, Docker, Python, SQL, and PySpark. We primarily work in cloud-native environments across AWS, Azure, and GCP, while occasionally supporting self-hosted open-source deployments.
A Senior Data Engineer is responsible for designing, building, and maintaining scalable data architecture that underpins our decision-support applications. Our decision-support applications range from traditional Analytics (data warehouse), to Machine Learning, to Digital Twins and on occasion serving LLMs and Agentic workflows, and as such your data architecture should support various use cases. You will work closely with cross-functional teams and contribute to the strategic direction of our data initiatives.
We operate with a strong code-first, “data as a product” mindset, where testing, reliability, observability, and performance are non-negotiable.


Specific Responsibilities
  • Architect and build a secure, scalable urban data platform integrating multi-agency and infrastructure datasets at scale.
  • Design resilient cloud-native architectures supporting batch, streaming, and near-real-time operational workloads.
  • Lead development of high-performance ingestion and transformation pipelines across legacy systems, APIs, IoT/telemetry, and structured data sources.
  • Implement distributed and event-driven processing systems (e.g., Spark, Kafka or equivalent) for large-scale analytical and operational use cases.
  • Establish platform reliability standards, including observability, automated data quality validation, lineage, monitoring, and defined SLAs/SLOs.
  • Design and enforce strong data governance and access control frameworks, including RBAC, encryption, auditability, and secure data handling practices.
  • Build modern lakehouse or equivalent architectures that enable advanced analytics, GIS, and production-grade machine learning.
  • Partner closely with data scientists, ML engineers, and senior stakeholders to operationalize AI and analytics at scale.
  • Optimize platform performance, scalability, and cost efficiency as adoption grows.
  • Contribute to long-term architectural direction and mentor engineering team members.
 Requirements - Essential
  • 6+ years designing and operating large-scale semi-distributed data platforms (hybrid centralised and distributed) in cloud or hybrid environments.
  • Proven experience architecting modern data systems (lakehouse, data mesh, or equivalent) supporting both analytical (descriptive and predictive) and operational workloads.
  • Deep hands-on expertise with distributed processing frameworks (e.g., Spark) and streaming/event systems (e.g., Kafka or similar).
  • Strong experience building secure, governed data environments with robust access controls, encryption, lineage, and audit capabilities.
  • Experience designing secure data platforms in regulated or government environments, with strong understanding of compliance, auditability, and data protection standards.
  • Experience integrating heterogeneous data sources, including legacy systems, APIs, telemetry/IoT systems, and relational databases.
  • Demonstrated ability to design highly available, observable, production-grade data systems.
  • Experience enabling machine learning and advanced analytics through robust data infrastructure and feature pipelines.
  • Strong proficiency in Python, SQL, and ideally DBT with a track record of writing clean, production-quality code.
  • Experience deploying and operating solutions in AWS, Azure, or GCP, including CI/CD and infrastructure-as-code is beneficial.
  • Ability to operate effectively in complex, multi-stakeholder environments.
  • Strong systems-thinking mindset with a focus on scalability, modularity, and long-term platform evolution.
  • Experience designing data platforms in U.S. public sector or highly regulated environments, with working knowledge of applicable federal and state data privacy and security requirements (e.g., HIPAA, CJIS, FERPA, state-level privacy acts), and the ability to embed compliance, auditability, and data governance principles into architectural design.
Location

We are looking to hire in specific areas in the US. These include,

  • DMV (DC, Maryland, Virginia)
  • Austin
  • New York
  • the Midwest
Personal Attributes
  • Client Centricity & Integrity: We let Our Clients Run the Company, Surf Like Yvon to stay true to our values, and Play the Long Game with integrity.
  • Collaboration and Inclusion: We live by Each One, Teach Ten and ensure Everybody is Welcome.
  • Operational Excellence and Simplicity: We K.I.S.S. by keeping things simple while always striving to Raise the Bar.
  • Action, Ownership, and Execution: We Decide, Get Stuff Done, and Do Hard Things with accountability.
  • Growth, Innovation, and Resilience: We Choose Growth, Pioneer boldly, and remember There is No Failure.
 

Due to the considerable amount of virtual work and interaction with colleagues and customers in different physical locations internationally, it is essential that the successful applicant has the drive and ethic to succeed in working in small teams physically but in larger efforts virtually. Self-drive to communicate constantly using web collaboration and video conferencing is essential.

Top Skills

AWS
Azure
Ci/Cd
Data Mesh
Databricks
Dbt
Docker
Encryption
Feature Pipelines
GCP
Gis
Infrastructure As Code
Iot
Kafka
Lakehouse
Pyspark
Python
Rbac
Spark
SQL
Telemetry
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
684 Employees

What We Do

Sand Technologies is a global AI solutions company that solves enterprise- and city-wide challenges with advanced Al and data. For the past 10 years, we have designed and deployed AI, data, software and IoT projects in the telecom, utilities, healthcare and insurance industries. Global enterprises trust Sand Technologies to provide the resources they need to close the gap between their current reality and digital future.

Similar Jobs

Zeta Global Logo Zeta Global

Senior Data Engineer

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Remote or Hybrid
United States
2429 Employees
165K-175K Annually

Bestow Logo Bestow

Senior Data Engineer

Big Data • Fintech • Information Technology • Insurance • Software
Remote or Hybrid
US
160 Employees
135K-159K Annually

People Inc. Logo People Inc.

Senior Data Engineer

AdTech • Consumer Web • Digital Media • eCommerce • Marketing Tech
Remote or Hybrid
New York, NY, USA
3500 Employees
140K-170K Annually

Cedar Logo Cedar

Senior Data Engineer

Artificial Intelligence • Fintech • Healthtech • Software
Easy Apply
Remote
United States
420 Employees
196K-247K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account