Senior Data Engineer – AI-Driven Data Pipeline Automation

Reposted 19 Days Ago
Hiring Remotely in United States of America
Remote
128K-267K Annually
Senior level
AdTech • Digital Media • Information Technology • Other
The Role
The Senior Data Engineer leads the design and optimization of scalable data pipelines for AI-driven analytics while collaborating with cross-functional teams. Responsibilities include ensuring data integrity, automation, and compliance in cloud environments, particularly on Google Cloud Platform, to enhance decision-making processes.
Summary Generated by Built In
Yahoo serves as a trusted guide for hundreds of millions of people globally, helping them achieve their goals online through our portfolio of iconic products. For advertisers, Yahoo Advertising offers omnichannel solutions and powerful data to engage with our brands and deliver results.

A Little About Us: 

The Yahoo! Consumer Data Team manages a petabyte warehouse to glean insights on Yahoo Media products and to improve the experience for its massive user base. The team interacts and works across multiple organizations at yahoo to grow user engagement and user experience across yahoo's product portfolio. Your work will directly influence product changes and you will work with some of the brightest engineers you have known to improve the user experience on yahoo properties and contribute to company growth. Along the way, you will solve problems for an Internet Pioneer that is hard to match in the industry.
 

Summary:

The ideal candidate will have strong AI/ML experience to design, build, and optimize scalable data pipelines and infrastructure that power advanced analytic solutions. In this role, you will collaborate closely with software engineers and business stakeholders to prepare and transform large datasets, support end-to-end model development and deployment, and ensure robust, efficient, and secure data flows. You will leverage your expertise in cloud platforms, big data tools, and machine learning frameworks to drive innovation and deliver actionable insights that advance our organization’s AI initiatives and business objectives.

Responsibilities:

  • Design, build, and maintain scalable data pipelines and ETL processes to support machine learning and AI initiatives on Google Cloud Platform (GCP).

  • Implement and optimize data storage solutions using GCP services such as BigQuery, Cloud Storage, and Dataflow.

  • Ensure data quality, integrity, and security throughout the data lifecycle.

  • Collaborate with analysts and business stakeholders to understand data requirements and deliver actionable insights.

  • Monitor, troubleshoot, and maintain the health and performance of cloud-based data infrastructure.

  • Automate manual processes and repetitive tasks to improve efficiency and reduce errors.

  • Apply data governance and compliance best practices to protect sensitive information and meet regulatory standards.

  • Stay current with new GCP features, tools, and best practices to continuously enhance data management capabilities.

  • Document solutions, processes, and architectural decisions to facilitate knowledge sharing and maintainability.

Qualifications:

  • BS or MS in Computer Science or a related major, or equivalent experience

  • 7+ years of software engineering experience, with a strong emphasis on system design and backend development.

  • 2+ years hands-on experience with Google Cloud Platform ecosystem (BigQuery, Dataproc, Composer, Dataflow, Data Catalog, Observability) or AWS equivalent.

  • Exposure to AI-assisted development tools such as Claude, GitHub Copilot, Cursor, or similar is highly desirable.

  • Proven ability to design, build, and maintain data pipelines that support machine learning and AI model development, training, and deployment.

  • Fluency with at least one object-oriented programming language from Java, Python, or Scala is highly desirable, as these skills are critical for developing robust applications and managing data workflows effectively. SQL proficiency is also valued for database operations.

  • Experience with Google Analytics 360 is a plus.

  • Familiarity with data security, compliance, and governance best practices.

  • Strong problem-solving skills, attention to detail, and ability to work collaboratively with cross-functional teams.

  • Excellent communication skills and ability to tell insightful stories using data and also manage communication within internal teams and stakeholders.

The material job duties and responsibilities of this role include those listed above as well as adhering to Yahoo policies; exercising sound judgment; working effectively, safely and inclusively with others; exhibiting trustworthiness and meeting expectations; and safeguarding business operations and brand integrity.

At Yahoo, we offer flexible hybrid work options that our employees love! While most roles don’t require regular office attendance, you may occasionally be asked to attend in-person events or team sessions. You’ll always get notice to make arrangements. Your recruiter will let you know if a specific job requires regular attendance at a Yahoo office or facility. If you have any questions about how this applies to the role, just ask the recruiter!

Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo will consider for employment qualified applicants with criminal histories in a manner consistent with applicable law. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form (www.yahooinc.com/careers/contact-us.html) or call +1.866.772.3182. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.

We believe that a diverse and inclusive workplace strengthens Yahoo and deepens our relationships. When you support everyone to be their best selves, they spark discovery, innovation and creativity. Among other efforts, our 11 employee resource groups (ERGs) enhance a culture of belonging with programs, events and fellowship that help educate, support and create a workplace where all feel welcome.

The compensation for this position ranges from $128,250.00 - $266,875.00/yr and will vary depending on factors such as your location, skills and experience.The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions. Our comprehensive benefits include healthcare, a great 401k, backup childcare, education stipends and much (much) more.

Currently work for Yahoo? Please apply on our internal career site.

Skills Required

  • 7+ years of software engineering experience
  • 2+ years hands-on experience with Google Cloud Platform
  • BS or MS in Computer Science or related major
  • Fluency with an object-oriented programming language (Java, Python, Scala)
  • Proven ability to design and maintain data pipelines
  • Familiarity with data security and governance best practices
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Sunnyvale, CA
10,001 Employees

What We Do

Yahoo is a global media and tech company that connects people to their passions. We reach nearly 900 million people around the world, bringing them closer to what they love—from finance and sports, to shopping, gaming and news—with the trusted products, content and tech that fuel their day. For partners, we provide a full-stack platform for businesses to amplify growth and drive more meaningful connections across advertising, search and media.

Similar Jobs

Easy Apply
Remote
United States
350 Employees
146K-164K Annually
In-Office or Remote
Chicago, IL, USA
1805 Employees
62K-85K Annually
Easy Apply
Remote
United States
900 Employees
110K-122K Annually

Agero Logo Agero

Data Analyst

Automotive • Big Data • Insurance • Software • Transportation
Easy Apply
Remote or Hybrid
14 Locations
1600 Employees
110K-135K Annually

Similar Companies Hiring

ClickMint Thumbnail
AdTech • eCommerce • Marketing Tech • Generative AI
Malibu, CA
9 Employees
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account