Data Engineer-Marketing Technology

Posted 6 Hours Ago
Be an Early Applicant
Alpharetta, GA, USA
Hybrid
Senior level
Software
The Role
Build, operate, and monitor ETL/ELT pipelines between marketing, sales, and operational systems into Databricks. Implement dimensional models, identity resolution, data quality checks, API integrations, and scheduled/event-driven syncs to support segmentation, attribution, and campaign activation. Maintain documentation, alerting, and runbooks.
Summary Generated by Built In

                                                                    Data Engineer, Marketing Technology

 

About Us:

Foxit is remaking the way the world interacts with documents through advanced PDF technology and tools. We are a leading global software provider of fast, affordable, and secure PDF solutions that are used by millions of people worldwide. Winner of numerous awards, Foxit has customers in more than 200 countries and global operations. We have a complete product line and an exciting and aggressive development schedule. Our proven PDF technology is disrupting the status quo establishment and has accelerated our company growth. We are proud to list as customers Google and Amazon, and with your skills and help, we plan to add many more. Foxit has offices all over the world, including locations in the US, Asia, Europe, and Australia.
 
For more information, visit us @ www.foxit.com

 

About the Role

We are looking for an experienced Data Engineer to own the data pipelines that power our go-to-market systems. While this role is aligned to the marketing department's priorities, you will work day-to-day within our Business Applications & Data Analytics team, following the team's established development standards, architecture patterns, and code review processes. This ensures the pipelines you build are consistent with our broader data platform and maintainable by the wider engineering team.

 

Your primary focus will be marketing-related data needs - working closely with demand gen, product marketing, sales operations, and digital teams to understand their requirements, then building and maintaining the pipelines and integrations that deliver on them.

 

This is a hands-on execution role. You will build and operate the data infrastructure that connects our marketing automation platform (HubSpot), CRM (Salesforce), data warehouse (Databricks), licensing system, payment platform, and other source systems. Your work will directly support marketing's ability to segment audiences, measure attribution, and run data-driven campaigns at scale.

 

What You'll Do

Data Pipeline Development & Operations

• Design, build, and maintain ETL/ELT pipelines, building upon and further optimizing our existing medallion architecture (Bronze → Silver → Gold) to move data between source systems (Salesforce CRM, HubSpot, NetSuite, Stripe, DealHub, LMS) and our Databricks data warehouse.

• Build pipelines using PySpark and SQL in Databricks notebooks, following established development standards for naming, project structure, and layer-appropriate transformations.

• Own the data sync layer between Databricks and HubSpot — enrichment flows inbound to HubSpot (license status, renewal dates, subscription state, firmographic data) and marketing engagement data flowing back to Databricks (email events, workflow enrollment, lifecycle changes).

• Build and maintain Exchange layer pipelines that curate data for external system consumption, formatting and validating data to meet target system requirements.

• Build and maintain scheduled batch jobs and event-driven integrations using APIs (REST, webhooks, OAuth).

• Monitor pipeline health, set up alerting for failures and data quality degradation, and own incident response when syncs break.

• Maintain documentation of data flows, integration architecture, and troubleshooting runbooks.

 

Data Modeling & Quality

• Build and maintain dimensional models in Databricks (fact tables, dimension tables, bridge tables) following our data warehouse object type definitions and naming standards.

• Work in collaboration with stakeholders and data analysts to build curated, business-ready tables and datamarts that apply business logic, KPI calculations, and aggregations optimized for analytics and campaign activation.

• Implement identity resolution and deduplication logic to produce unified customer profiles from multiple source systems.

• Establish data validation rules, quality checks, and monitoring to ensure accuracy and freshness of data flowing into marketing systems.

• Normalize disparate data sources into clean centralized schemas with proper type enforcement, deduplication, and null handling.

 

Marketing Data & Segmentation Support

• Ensure the data infrastructure supports audience segmentation, including firmographic, behavioral, and engagement signals.

• Build the data layer that powers lifecycle marketing - triggered campaigns, dynamic journey branching, and personalization based on enriched customer profiles.

• Support marketing and demand gen teams with reliable, accessible data for building audience targets in HubSpot.

• Maintain data flows for email deliverability, subscription management, and suppression list synchronization.

 

Integration Development

• Build and maintain API integrations between marketing, sales, and operational systems using Python and SQL.

• Implement field-level transformation logic, sync orchestration, and error handling for system-to-system data flows.

• Support website form and lead capture data flows - ensuring clean handoff from web properties into HubSpot and Databricks.

• Work with third-party enrichment providers (firmographic, intent, technographic) to integrate enrichment data into automated workflows.

 

Reporting & Attribution

• Build and maintain the data infrastructure that supports campaign attribution, channel performance analysis, and funnel reporting.

• Ensure accurate data for conversion analytics, lead source tracking, and marketing ROI measurement.

• Support centralized reporting by routing marketing engagement data back into Databricks for cross-functional analysis.

What You Bring

 

Required:

• 5+ years of experience in data engineering, with hands-on pipeline development and production operations.

• Strong proficiency in SQL and Python/PySpark for data pipeline development.

• Experience building and maintaining ETL/ELT pipelines using Databricks, dbt, Airflow, Azure Data Factory, or equivalent.

• Hands-on experience with cloud data platforms - Databricks, Snowflake, BigQuery, or Redshift.

• Solid understanding of dimensional data modeling - fact tables, dimension tables, schema design, and data warehouse concepts.

• Experience with medallion or layered data architectures (raw → cleansed → business-ready), Kimball-style star schemas, and one-big-table approaches to data modeling.

• Working knowledge of API integration patterns - REST, webhooks, OAuth, batch sync architectures.

• Experience with CRM platforms (Salesforce, HubSpot, or similar), marketing automation systems, and CPQ/quoting tools (DealHub or similar).

• Bachelor's degree in Computer Science, Information Systems, or equivalent industry experience.

 

Preferred:

• Experience with Databricks (Delta Lake, PySpark, Unity Catalog).

• Familiarity with HubSpot APIs and data model.

• Experience with identity resolution and customer data deduplication across multiple source systems.

• Exposure to marketing data concepts — lead scoring, audience segmentation, campaign attribution, lifecycle stages.

• Experience with Azure cloud services (Azure Functions, Azure DevOps, Azure Data Factory).

• Knowledge of data security and privacy practices, particularly regarding PII handling.

• Experience with code review processes and development standards compliance in a collaborative data engineering team.

 

What Sets You Apart

• You've built and operated production data pipelines that marketing teams depend on daily - you understand the impact of data freshness and accuracy on campaign execution.

• You're comfortable working within a marketing department and can translate data requests from non-technical stakeholders into pipeline requirements.

• You take ownership of pipeline reliability - building monitoring and alerting proactively rather than waiting for someone to report a problem.

• You've worked with multiple data sources and know how to handle the messiness of real-world identity resolution and deduplication.

 

Why Join Us

• High-impact work. Your pipelines will directly power how our marketing engine operates and scales.

• Modern stack. Databricks, Delta Lake, PySpark, HubSpot, Python, SQL - you'll work with current tools, not legacy systems.

• Room to build. We're investing in our data infrastructure as part of a major platform migration, and you'll shape how it's built.

• Collaborative environment. You'll work closely with marketing, sales, and IT teams - visible, cross-functional work without being siloed.

 

Foxit is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Skills Required

  • 5+ years of experience in data engineering with hands-on pipeline development and production operations.
  • Strong proficiency in SQL for data pipeline development.
  • Strong proficiency in Python and PySpark for data pipeline development.
  • Experience building and maintaining ETL/ELT pipelines using Databricks, dbt, Airflow, Azure Data Factory, or equivalent.
  • Hands-on experience with cloud data platforms (Databricks, Snowflake, BigQuery, or Redshift).
  • Solid understanding of dimensional data modeling (fact tables, dimension tables, schema design).
  • Experience with medallion/layered data architectures (raw/cleansed/business-ready) and Kimball-style schemas.
  • Working knowledge of API integration patterns (REST, webhooks, OAuth) and building event-driven integrations.
  • Experience with CRM platforms and marketing automation systems (Salesforce, HubSpot) and CPQ/quoting tools (DealHub or similar).
  • Bachelor's degree in Computer Science, Information Systems, or equivalent industry experience.
  • Experience with Databricks-specific features (Delta Lake, Unity Catalog).
  • Familiarity with HubSpot APIs and marketing data model.
  • Experience with identity resolution and customer data deduplication.
  • Exposure to marketing data concepts (lead scoring, segmentation, attribution).
  • Experience with Azure cloud services (Azure Functions, Azure DevOps, ADF).
  • Knowledge of data security and privacy practices for handling PII.
  • Experience with code review processes and collaborative development standards.
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Fremont, CA
460 Employees
Year Founded: 2001

What We Do

Foxit is a leading software provider of fast, affordable and secure PDF solutions. Businesses and consumers increase productivity by using Foxit's cost effective products to securely work with PDF documents and forms. Foxit is the #1 pre-installed PDF software, shipped on one-third of all new Windows PCs, including those from HP®, Acer, and ASUS®. Foxit's Software Development Kits (SDKs) help developers reduce costs and improve time to market by easily integrating industry leading PDF technology into application workflows. This technology shares the same underlying technology that powers Google's open-source PDFium project. Winner of numerous awards, Foxit has over 650 million users and has sold to over 425,000 customers, ranging from SMBs to global enterprises, located in more than 200 countries. Since Foxit products are ISO 32000-1/PDF 1.7 standard compliant, they are compatible with your existing PDF documents and forms. Foxit's Mission Enabling people to create, collaborate, share, and use documents on any device. Foxit's Vision Foxit on every device. To learn more about Foxit : - Official Website: https://www.foxit.com - Facebook: https://www.facebook.com/foxitsoftware - Twitter: https://twitter.com/foxitsoftware

Similar Jobs

Zeta Global Logo Zeta Global

Senior Paid Social Manager

AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
Easy Apply
Remote or Hybrid
United States
2429 Employees
60K-98K Annually

Sprinter Health Logo Sprinter Health

Information Technology Support Specialist

Artificial Intelligence • Healthtech • Logistics • Social Impact • Software • Telehealth
Hybrid
3 Locations
500 Employees
70K-85K Annually

HiBob Logo HiBob

Director Of Sales

HR Tech • Information Technology • Professional Services • Sales • Software
Remote or Hybrid
United States
1350 Employees
150K-190K Annually

HiBob Logo HiBob

VP of RevOps & Enablement

HR Tech • Information Technology • Professional Services • Sales • Software
Remote or Hybrid
United States
1350 Employees
220K-290K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account