AWS Data Architect

Posted Yesterday
Be an Early Applicant
Hiring Remotely in United States
Remote
Senior level
Information Technology • Database • Consulting
The Role
Design and implement AWS-based data architectures and lakehouse solutions, lead data engineering teams, build ETL and real-time pipelines (Kafka/Kinesis), use DBT and Airbyte for ingestion and transformation, orchestrate workflows with Airflow/MWAA, enforce security/compliance, and guide infra automation with Terraform/CloudFormation and CI/CD.
Summary Generated by Built In

Title – AWS Data Architect

Job Summary

We are seeking a skilled AWS Data Architect for one of our clients; a leading international sports league. The Architect will play a crucial role in designing, implementing and maintaining data and technology solutions that align with client’s business goals and objectives. This role requires a deep understanding of AWS data services, good understanding of AWS Infra & Ops services, and the ability to translate business requirements into scalable and efficient solutions.

Responsibilities

Key Responsibilities

Data Architecture and Cloud Strategy:

  1. Develop and maintain a comprehensive data architecture and cloud strategy that aligns with the organization's goals and needs.
  2. Design, implement, and manage cloud-based data infrastructure on AWS, ensuring scalability, reliability, and cost-efficiency.
  3. Utilize AWS services (S3, Glue, EMR, Redshift, Lambda, Kinesis, MWAA, etc.) to build and optimize data pipelines and storage solutions.
  4. Champion the use of data lakehouse architecture and optimize its performance for analytical and operational workloads.
  5. Identify the gaps and opportunities in the current system and suggest/implement to optimise the processes and costs.

Data Engineering:

  1. Lead and guide data engineering teams to develop, maintain, and optimize ETL processes for data ingestion, transformation, and loading.
  2. Implement real-time data processing solutions using technologies such as Apache Kafka and AWS Kinesis.
  3. Collaborate with data scientists, business stakeholders and analysts to ensure data availability and quality, enabling effective analytics and reporting.
  4. Leverage  DBT for data modelling and transformation to support self-service analytics and data governance.

Data Ingestion & Ingestion:

  1. Architect and implement data integration solutions for API ingestion, enabling data from diverse sources to be captured, transformed, and ingested into our data lakehouse.
  2. Utilize Airbyte and custom APIs to ensure efficient, reliable, and secure data transfers.
  3. Manage data integration pipelines to support real-time and batch data processing.

Workflow Orchestration:

  1. Design, configure, and maintain workflow orchestration using Apache Airflow to automate ETL processes and data pipeline executions.
  2. Monitor and optimize job scheduling, error handling, and performance of data workflows.

Security and Compliance:

  1. Implement data security protocols, access controls, and encryption to safeguard sensitive data, especially PIIs.
  2. Ensure compliance with data privacy regulations and industry standards.

Collaboration and Documentation:

  1. Collaborate with cross-functional teams to understand data requirements and provide data solutions to meet their needs.
  2. Maintain comprehensive documentation for data engineering and data architecture processes and solutions.

Infra & Operations:

  1. Guide the team in setting up cloud Infra and automate using tools like terraform, cloud formation, Jenkins etc
  2. Guide the operations team in setting up automated monitoring & alerts mechanism
Qualifications

Relevant Qualifications

  • Bachelor's or higher degree in a relevant field.
  • 6+ years of proven experience in data engineering, cloud architecture, and AWS services.
  • Extensive knowledge of data lakehouse technologies, Hudi, DBT, Airbyte, Redshift, Glue, Kinesis and Apache Airflow.
  • Strong expertise in programming languages like SQL, Python and processing frameworks like PySpark
  • Strong expertise in real-time data processing.
  • Excellent problem-solving and analytical skills.
  • Strong communication and teamwork abilities.
  • Passion for Sports/Gaming/Entertainment is preferred

Skills Required

  • Bachelor's or higher degree in a relevant field
  • 6+ years of experience in data engineering, cloud architecture, and AWS services
  • Design and manage AWS data infrastructure (S3, Glue, EMR, Redshift, Lambda, Kinesis, MWAA)
  • Extensive knowledge of data lakehouse technologies and Hudi
  • Experience with DBT for data modelling and transformation
  • Experience with Airbyte and custom API ingestion solutions
  • Experience building real-time data processing solutions (Apache Kafka, AWS Kinesis)
  • Design and maintain workflow orchestration using Apache Airflow/MWAA
  • Strong SQL and Python skills and experience with PySpark
  • Implement data security, access controls, and compliance for PII
  • Guide cloud infrastructure automation using Terraform, CloudFormation, Jenkins
  • Excellent problem-solving, analytical, communication, and teamwork skills
  • Passion for Sports/Gaming/Entertainment
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
30,246 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account