The Role
Design, fine-tune, and deploy LLMs for business applications; build low-latency inference pipelines; evaluate and integrate open-source models or APIs; collaborate on production deployments; run evaluation frameworks and monitor models for continuous improvement.
Summary Generated by Built In
At Scope Merge, we connect top Tunisian engineers with leading European companies. We offer long-term roles, international exposure, and above-market compensation. You’ll work on exciting international projects while we handle employment, payroll, and benefits.
We are hiring a Senior AI Engineer with deep expertise in Large Language Models (LLMs) and client facing to join a fast-growing European startup applying cutting-edge AI to solve real-world problems. You’ll work on the development, fine-tuning, and deployment of LLMs in production.
Tasks
* Design, train, and fine-tune LLMs for specific business use cases.
* Build and optimize inference pipelines for LLM-based applications, ensuring low-latency and scalability.
* Evaluate and integrate open-source LLMs (e.g., LLaMA, Mistral, Falcon) or APIs (e.g., OpenAI, Anthropic) depending on use case and cost constraints.
* Collaborate with backend engineers to deploy models efficiently using tools like Triton, vLLM, or ONNX Runtime.
* Design and run evaluation frameworks (e.g., prompt quality, hallucination detection, latency).
* Monitor models in production and implement mechanisms for feedback loops and continuous improvement.
* Stay up to date with advances in generative AI, open-source LLM tooling, and fine-tuning strategies.
Requirements
* 4+ years of experience in applied ML or NLP, with at least 1–2 years focused on LLMs.
* Strong knowledge of transformer architectures and experience working with model libraries like Hugging Face Transformers, LangChain, or LLM orchestration tools.
* Proven experience deploying LLMs into production (custom or API-based) and optimizing them for inference.
* Familiarity with techniques such as LoRA, QLoRA, PEFT, RAG, or prompt engineering.
* Solid Python skills, especially in ML stack (e.g., PyTorch, TensorFlow, FastAPI for serving).
* Experience working with cloud infrastructure (AWS/GCP) and containerized deployments (Docker, Kubernetes).
* Bonus: Experience with data pipelines, vector databases (e.g., Weaviate, Pinecone, FAISS), or hybrid search.
Benefits
* Work on international projects with top startups and tech companies.
* Collaborate with global teams and gain cross-border experience.
* Grow your skills through hands-on challenges and real-world impact.Modern offices in Lac 2, Tunis
* Supportive sick leave policy that respects your health and well-being
* Receive above-market salary and financial stability
CV in English.
We are hiring a Senior AI Engineer with deep expertise in Large Language Models (LLMs) and client facing to join a fast-growing European startup applying cutting-edge AI to solve real-world problems. You’ll work on the development, fine-tuning, and deployment of LLMs in production.
Tasks
* Design, train, and fine-tune LLMs for specific business use cases.
* Build and optimize inference pipelines for LLM-based applications, ensuring low-latency and scalability.
* Evaluate and integrate open-source LLMs (e.g., LLaMA, Mistral, Falcon) or APIs (e.g., OpenAI, Anthropic) depending on use case and cost constraints.
* Collaborate with backend engineers to deploy models efficiently using tools like Triton, vLLM, or ONNX Runtime.
* Design and run evaluation frameworks (e.g., prompt quality, hallucination detection, latency).
* Monitor models in production and implement mechanisms for feedback loops and continuous improvement.
* Stay up to date with advances in generative AI, open-source LLM tooling, and fine-tuning strategies.
Requirements
* 4+ years of experience in applied ML or NLP, with at least 1–2 years focused on LLMs.
* Strong knowledge of transformer architectures and experience working with model libraries like Hugging Face Transformers, LangChain, or LLM orchestration tools.
* Proven experience deploying LLMs into production (custom or API-based) and optimizing them for inference.
* Familiarity with techniques such as LoRA, QLoRA, PEFT, RAG, or prompt engineering.
* Solid Python skills, especially in ML stack (e.g., PyTorch, TensorFlow, FastAPI for serving).
* Experience working with cloud infrastructure (AWS/GCP) and containerized deployments (Docker, Kubernetes).
* Bonus: Experience with data pipelines, vector databases (e.g., Weaviate, Pinecone, FAISS), or hybrid search.
Benefits
* Work on international projects with top startups and tech companies.
* Collaborate with global teams and gain cross-border experience.
* Grow your skills through hands-on challenges and real-world impact.Modern offices in Lac 2, Tunis
* Supportive sick leave policy that respects your health and well-being
* Receive above-market salary and financial stability
CV in English.
Skills Required
- 4+ years of experience in applied ML or NLP, with at least 1-2 years focused on LLMs
- Client-facing experience
- Strong knowledge of transformer architectures and experience with Hugging Face Transformers, LangChain, or LLM orchestration tools
- Proven experience deploying LLMs into production (custom or API-based) and optimizing inference
- Familiarity with fine-tuning and adaptation techniques such as LoRA, QLoRA, PEFT, RAG, and prompt engineering
- Solid Python skills and experience with ML stack (PyTorch, TensorFlow) and serving (FastAPI)
- Experience with cloud infrastructure (AWS or GCP) and containerized deployments (Docker, Kubernetes)
- CV in English
- Experience with data pipelines, vector databases (Weaviate, Pinecone, FAISS), or hybrid search
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
Scope Merge is a dedicated engineering team provider that connects top Tunisian engineers with European companies. The company helps European industrial firms solve engineering capacity problems by building integrated teams in Tunisia, managing local recruiting, employment, payroll, and HR. Their expertise spans software development, AI/ML, and enterprise systems, acting as a hub for specialized talent acquisition and outsourcing solutions.








