Somnio Software

Lead AI Engineer (Generative AI & LLMOps)

Reposted 20 Hours Ago

Be an Early Applicant

Hiring Remotely in Buenos Aires, Ciudad Autónoma de Buenos Aires, ARG

Remote

Senior level

Software • App development

Digital Product Agency | Top Flutter Development Company | We Design, Build & Scale Products Across Platforms

The Role

The Lead AI Engineer will architect and implement generative intelligence, designing RAG architectures and overseeing the integration of AI services. They will mentor teams and ensure AI output safety and performance.

Summary Generated by Built In

We are looking for a visionary Lead AI Engineer to architect and implement the generative intelligence core of our upcoming project. This is not a traditional research role; we need a "Builder" who understands how to turn raw model capabilities into reliable, scalable, and cost-effective product features.

As the Lead GenAI Engineer, you will design the RAG (Retrieval-Augmented Generation) architectures, select the appropriate model stacks, and ensure that our AI outputs are grounded, safe, and performant. You will work in lockstep with the Technical Leader to integrate AI services into the broader application ecosystem and mentor the team on AI engineering best practices.

MUST

8+ years of professional experience in Software Engineering, with at least 2 years of focused experience building and deploying GenAI-powered applications.
LLM Orchestration Mastery: Deep expertise in frameworks like LangChain, LlamaIndex, or Haystack for building complex chains and agents.
RAG Architecture: Proven experience implementing Retrieval-Augmented Generation, including chunking strategies, embedding models, and vector database management (e.g., Pinecone, or pgvector).
Advanced Prompt Engineering: Expertise in systematic prompt optimization, few-shot prompting, and Chain-of-Thought techniques to minimize hallucinations.
Model Integration & Selection: Deep understanding of the trade-offs between proprietary models (OpenAI, Anthropic, Gemini) and open-source models (Llama 3, Mistral) including hosting via Hugging Face or vLLM.
Python Proficiency: Expert-level Python skills, including asynchronous programming and performance optimization for data-heavy workloads.
Evaluation & Observability: Experience setting up AI evaluation frameworks (e.g., RAGAS, TruLens, or LangSmith) to measure accuracy, latency, and cost.
API & Backend Integration: Ability to design robust APIs (FastAPI/Flask) that handle the non-deterministic nature of LLMs, including streaming responses and graceful error handling.
English C1: Ability to explain complex AI concepts (like temperature, top-p, or context windows) to stakeholders and non-technical clients.

Nice to have

Fine-tuning Experience: Practical experience fine-tuning open-source models (PEFT, LoRA, QLoRA) for specific domains or style-matching.
LLMOps & Deployment: Experience with automated deployment of AI models using tools like BentoML, Modal, or AWS SageMaker.
AI Security: Knowledge of LLM-specific vulnerabilities (Prompt Injection, data leakage) and mitigation strategies.
Multi-modal AI: Experience working with Vision-Language models or Audio-to-Text/Text-to-Audio pipelines.
Product Thinking: A strong sense of "AI UX"—understanding when a feature should be an agentic workflow versus a simple deterministic function.

Skills Required

8+ years of professional experience in Software Engineering
2 years of focused experience building and deploying GenAI-powered applications
Deep expertise in frameworks like LangChain, LlamaIndex, or Haystack
Proven experience implementing Retrieval-Augmented Generation
Expertise in systematic prompt optimization and Chain-of-Thought techniques
Deep understanding of proprietary vs. open-source models
Expert-level Python skills, including asynchronous programming
Experience with AI evaluation frameworks
Ability to design robust APIs handling LLMs
Ability to explain complex AI concepts to non-technical clients

View all jobs at Somnio Software

View Somnio Software Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

San Francisco, California

63 Employees

Year Founded: 2019

What We Do

We’re a Digital Product Agency you can grow with, globally recognized as a Top Flutter development company. We design, build, and scale digital products by combining strong product strategy, thoughtful design, and solid engineering. With 95+ experts, we build products for any screen: mobile, web, desktop, and embedded devices, always focusing on scalability, performance, and long-term maintainability. As a company focused on custom digital products, we understand that every business has unique needs. We have experience across multiple industries, including Fintech, Healthcare, Media & Entertainment, Fashion & Beauty, Retail, Gastronomy, and Hospitality, among others. Our services include: ► Full Product Development - From concept to reality, our team of experts combines technical prowess with a keen eye for design, ensuring that your cross-platform app stands out amidst a sea of competitors. ► Product Discovery - Navigate the market with confidence. We’ll guide you through product discovery, unlocking valuable insights and shaping products that resonate with your target audience. ► Staff Augmentation - Our expert team of Flutter developers will seamlessly integrate with your team to help you achieve your development goals and meet your deadlines. Whether you want to create a product from scratch or you need an addition to your in-house team, we are your trusted tech partner. 📩 Contact us at [email protected] and let's get started! 👉 Check out some of our success cases here https://somniosoftware.com/our-work