Data Engineer

Posted 12 Days Ago
Be an Early Applicant
Hiring Remotely in México
Remote
Entry level
Artificial Intelligence • Cloud • Information Technology • Software
The Role
Design, build, and operate scalable AWS data platforms and pipelines (batch/streaming) for analytics and ML. Implement data ingestion, transformation, governance, knowledge graphs, monitoring, and secure access. Optimize architectures, build tooling and dashboards, integrate enterprise systems, and collaborate cross-functionally to ensure data quality, lineage, and maintainability.
Summary Generated by Built In

This position is open to candidates based in Mexico only. Applications from outside Mexico will not be considered.

Company Overview

At Avahi, we’re redefining what it means to be a premier cloud-first consulting company, recognized for our people, culture, and innovative solutions. With expertise in Managed Services, Reselling, Staffing, and Professional Services, we are dedicated to delivering exceptional value and putting customers first.

As a remote-first, global team spanning North America, Europe, and Southeast Asia, we foster a collaborative and diverse environment where professional growth, creativity, and mutual respect thrive. Guided by our values—Customer-Centricity, Collaboration, Agility, Innovation, Integrity, and Diversity & Inclusion—we empower businesses to embrace the full potential of a cloud-first approach.

Key Responsibilities

  • Design, build, and maintain scalable AWS data platforms supporting batch and streaming pipelines, analytics, and AI/ML workloads, aligned with AWS Well-Architected best practices.
  • Build and operate data ingestion, transformation, and enrichment pipelines from internal systems and external APIs, handling structured, semi-structured, unstructured, and graph data.
    Implement data normalization workflows to ensure consistent schemas, high data quality, and reliable analytics, BI, and ML use cases.
  • Design and enforce data governance including cataloging, lineage, access control, and auditability.
  • Build and maintain knowledge graphs to model relationships across core business entities, enabling advanced analytics and inference.
  • Identify data gaps, inconsistencies, and missing relationships using strong analytical and inference skills.
  • Integrate data from enterprise platforms such as CRM and ERP systems (Salesforce, HubSpot, SAP, NetSuite, Dynamics 365, Workday).
  • Design secure data access layers for analytics, BI, ML, and downstream applications.
    Implement monitoring, observability, and data quality checks for freshness, completeness, and pipeline health.
  • Optimize data architectures for performance and cost efficiency using partitioning, indexing, compression, and storage tiering.
  • Build internal tooling, dashboards, and standardized scaffolding to improve visibility, maintainability, and onboarding.
  • Collaborate with cross-functional teams to deliver high-impact data solutions and share best practices, documentation, and technical guidance.

Required Skills & Qualifications

  • Strong experience designing and operating AWS data platforms, including S3, Glue, Lake Formation, Athena, Redshift, EMR, Kinesis/MSK, DynamoDB, OpenSearch, and Neptune.
  • Strong Python skills for data engineering, focused on modular, testable, and maintainable code.
  • Solid understanding of distributed data systems, including batch and streaming pipelines, fault tolerance, idempotency, and event-driven architectures.
  • Experience with data warehouse and lakehouse architectures, ETL/ELT pipelines, and analytical query engines.
  • Hands-on experience with Spark, Hadoop, Hive, or Flink.
  • Strong data modeling skills, including normalized, denormalized, and graph-based models, with safe schema evolution.
  • Advanced SQL skills for analytics and data engineering, including window functions, CTEs, and query optimization.
  • Experience integrating external APIs and enterprise systems, especially CRM and ERP platforms.
  • Knowledge of data governance, security, and compliance, including encryption, access control, and audit logging.
  • Experience implementing monitoring, observability, and data quality checks using CloudWatch and CloudTrail. 
  • Comfort with Infrastructure as Code using CloudFormation or Terraform.
  • Strong end-to-end ownership mindset, with a focus on scalability, reliability, and long-term maintainability.
  • Professional-level English communication skills, able to explain data architectures and trade-offs to technical and non-technical stakeholders.

Why Work Here

  1. Remote-First Flexibility: 
    Enjoy work-life harmony in a remote-first environment that allows you to work from anywhere. 
  2. Innovative Culture: 
    We embrace a startup mindset, encouraging creativity, agility, and growth. Be part of a team that explores cutting-edge technology and drives impactful solutions. 
  3. Career Development: 
    Avahi is committed to your growth, offering mentorship and opportunities to advance your career.
  4. Purpose-Driven Mission: 
    Join us in making a difference. Avahi is dedicated to championing diversity, supporting women in tech, and fostering sustainable practices. 
  5. Global Collaboration: 
    Work alongside a diverse, talented team, sharing insights and collaborating to create innovative solutions that make a real impact. 

Join Avahi and make an impact in a fast-paced, customer-focused environment with abundant opportunities for growth.

Accessibility and Inclusivity Statement
At Avahi, we are committed to fostering a workplace that celebrates diversity and inclusivity. We welcome applicants from all backgrounds, experiences, and perspectives, including those from underrepresented communities.

We are proud to be an equal opportunity employer, providing a fair and accessible recruitment process for all candidates. If you require accommodations at any stage of the application or interview process, please let us know, and we will work to meet your needs.

Top Skills

S3,Glue,Lake Formation,Athena,Redshift,Emr,Kinesis,Msk,Dynamodb,Opensearch,Neptune,Python,Spark,Hadoop,Hive,Flink,Sql,Cloudwatch,Cloudtrail,Cloudformation,Terraform,Salesforce,Hubspot,Sap,Netsuite,Dynamics 365,Workday
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
65 Employees
Year Founded: 2020

What We Do

Avahi is a trusted AWS Premier Partner dedicated to helping Small and Medium Businesses (SMBs) transform their operations and achieve significant growth through the strategic implementation of Artificial Intelligence.

While our team excels in architecting and operating the secure, automated, and scalable AWS cloud environments essential for modern business, our primary focus now is leveraging that foundation to make AI accessible and impactful for you.

We accelerate your AI adoption with the Avahi AI Platform, built specifically for rapid deployment on AWS. This allows your SMB to quickly integrate powerful, efficiency-boosting AI tools, including:

- AI Assistants to enhance productivity
- Facial Recognition for specific applications
- Automated Summarizers to distill information
- Custom AI-Powered Analytics Dashboards for deeper insights
- Data Masking for enhanced privacy and compliance
- Structured Data Extraction to unlock value from documents

Forget the complexity often associated with AI. Consider us an integral extension of your existing team. We work closely with you – walking hand in hand – to understand your unique business needs and goals. From there, we develop and implement a custom AI strategy leveraging our platform on AWS. Our deep expertise in cloud adoption, application modernization, DevOps, and security ensures your AI solutions are not only powerful but also robust and reliable.

We are committed to helping you achieve your business goals with practical, impactful AI. Connect with Avahi today to explore how our tailored approach, powerful platform, and AWS expertise can empower your SMB's growth.

Similar Jobs

John Deere Logo John Deere

Data Engineer

Artificial Intelligence • Cloud • Internet of Things • Machine Learning • Analytics • Industrial
Remote or Hybrid
Monterrey, Nuevo León, MEX
69000 Employees

John Deere Logo John Deere

Data Engineer

Artificial Intelligence • Cloud • Internet of Things • Machine Learning • Analytics • Industrial
Remote or Hybrid
Nuevo León, MEX
69000 Employees

CVS Health Logo CVS Health

Data Engineer

Fitness • Healthtech • Retail • Pharmaceutical
In-Office or Remote
22 Locations
119959 Employees
65K-173K Annually

Capgemini Logo Capgemini

Data Engineer

Information Technology
Remote
3 Locations
340000 Employees

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account