Data Engineer (LLM Applications)

Job Posted 25 Days Ago Posted 25 Days Ago
Be an Early Applicant
Mexico, Cuauhtémoc, Ciudad de México
Senior level
Sharing Economy
The Role
The Data Engineer will develop and implement LLM-based applications, manage data pipelines, integrate models, and collaborate with teams on new features.
Summary Generated by Built In

About Fusemachines
Fusemachines is a leading provider of AI strategy, talent, and education services. Founded by Dr. Sameer Maskey, an Adjunct Associate Professor at Columbia University, our mission is to democratize AI. With a presence in four countries—Nepal, the United States, Canada, and the Dominican Republic—and a team of over 350 full-time employees, we leverage our global AI expertise to drive innovation and transformation for businesses worldwide.
This is a hybrid role that requires on-site presence for 2-3 days each week OR remote from other cities in Mexico.
About the role

We are looking for a skilled Data Engineer with a background supporting LLM applications to join our team. You will work closely with data scientists and be responsible for developing and implementing large language model (LLM)-based applications. This includes working with both proprietary and open-source models and leveraging frameworks such as LangChain to ensure seamless integration and deployment.

Responsibilities

  • Develop and implement applications that interact with LLM models.
  • Build RAG-based applications.
  • Work with vector databases for LLM-based applications.
  • Integrate models with existing systems and APIs.
  • Develop and maintain production-quality data pipelines and ETL processes.
  • Preprocess and manage data for training and deployment.
  • Collaborate with cross-functional teams to define, design, and deploy new features.
  • Write clean, maintainable, and efficient code.
  • Document development processes, code, and APIs.

Requirements

  • 5+ years of experience in data engineering, with strong expertise in Python, AWS and APIs. 
  • Proven experience in developing and deploying machine learning APIs.
  • Experience in building scalable applications capable of handling large volumes of data.
  • Strong knowledge of API integration (RESTful, GraphQL).
  • Experience with data preprocessing, SQL, and NoSQL databases, as well as vector stores (e.g., Postgres, MySQL, Solr, Elasticsearch, OpenSearch).
  • Familiarity with deployment tools (Docker, Kubernetes).
  • Experience with DevOps tools such as Jenkins, Terraform, or Cloud Formation templates is a plus.
  • Strong problem-solving and communication skills.
  • Experience with distributed computing technologies such as Spark, Hadoop, or EMR is preferred.
  • Ability to work effectively in an agile team environment.

Preferred Qualifications

  • Degree in Computer Science, Data Science, or a related field.
  • Certifications in machine learning, data science, or cloud computing.


Equal Opportunity Employer: Fusemachines is committed to fostering a diverse and inclusive workplace. We welcome applications from all qualified individuals regardless of race, color, religion, sex, sexual orientation, gender identity, national origin, age, genetic information, disability, protected veteran status, or any other legally protected status.
 

Top Skills

APIs
AWS
Cloud Formation
Docker
Emr
GraphQL
Hadoop
Jenkins
Kubernetes
NoSQL
Python
Restful
Spark
SQL
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York City, NY
428 Employees
On-site Workplace
Year Founded: 2013

What We Do

A 10+ year old AI company offering cutting-edge AI products and solutions across industries.

With over a decade of experience, we help companies in their AI Transformation journey with our suite of AI Products and AI Solutions supported by our global AI Talent from underserved communities.

On a mission to #DemocratizeAI, we aim to bridge the gap between AI advancement and global impact, bringing the most advanced technology solutions to the world.

Similar Jobs

JumpCloud Logo JumpCloud

Data Engineer, Business Analytics- Mexico

Cloud • Information Technology • Security • Software
Easy Apply
Remote
4 Locations
800 Employees

Takeda Logo Takeda

Senior Data Engineer - LIMS

Healthtech • Software • Analytics • Biotech • Pharmaceutical • Manufacturing
Hybrid
Delegación Cuajimalpa de Morelos, Cuajimalpa de Morelos, Ciudad de México, MEX
50000 Employees

Takeda Logo Takeda

Database Engineer

Healthtech • Software • Analytics • Biotech • Pharmaceutical • Manufacturing
Hybrid
Delegación Cuajimalpa de Morelos, Cuajimalpa de Morelos, Ciudad de México, MEX
50000 Employees

Takeda Logo Takeda

Senior Data Engineer

Healthtech • Software • Analytics • Biotech • Pharmaceutical • Manufacturing
Hybrid
Delegación Cuajimalpa de Morelos, Cuajimalpa de Morelos, Ciudad de México, MEX
50000 Employees

Similar Companies Hiring

Cargill Thumbnail
Transportation • Sharing Economy • Logistics • Industrial • Greentech • Food • Agriculture
Wayzata, MN
155000 Employees
Taskrabbit Thumbnail
Software • Sharing Economy • Information Technology • eCommerce
IT
450 Employees
Federal Reserve Bank of Chicago Thumbnail
Social Impact • Sharing Economy • Payments • Fintech • Agency
Chicago, IL
1515 Employees
Not Eligible
Save
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account