About Centific
Centific is a frontier AI data foundry that curates diverse, high-quality data, using our purpose-built technology platforms to empower the Magnificent Seven and our enterprise clients with safe, scalable AI deployment. Our team includes more than 150 PhDs and data scientists, along with more than 4,000 AI practitioners and engineers. We harness the power of an integrated solution ecosystem—comprising industry-leading partnerships and 1.8 million vertical domain experts in more than 230 markets—to create contextual, multilingual, pre-trained datasets; fine-tuned, industry-specific LLMs; and RAG pipelines supported by vector databases. Our zero-distance innovation™ solutions for GenAI can reduce GenAI costs by up to 80% and bring solutions to market 50% faster.
Our mission is to bridge the gap between AI creators and industry leaders by bringing best practices in GenAI to unicorn innovators and enterprise customers. We aim to help these organizations unlock significant business value by deploying GenAI at scale, helping to ensure they stay at the forefront of technological advancement and maintain a competitive edge in their respective markets.
About Job
About Centific:
Centific is a frontier AI data foundry that curates diverse, high-quality data, using our purpose-built technology platforms to empower the Magnificent Seven and our enterprise clients with safe, scalable AI deployment. Our team includes more than 150 PhDs and data scientists, along with more than 4,000 AI practitioners and engineers. We harness the power of an integrated solution ecosystem—comprising industry-leading partnerships and 1.8 million vertical domain experts in more than 230 markets—to create contextual, multilingual, pre-trained datasets; fine-tuned, industry-specific LLMs; and RAG pipelines supported by vector databases. Our zero-distance innovation™ solutions for GenAI can reduce GenAI costs by up to 80% and bring solutions to market 50% faster.
Our mission is to bridge the gap between AI creators and industry leaders by bringing best practices in GenAI to unicorn innovators and enterprise customers. We aim to help these organizations unlock significant business value by deploying GenAI at scale, helping to ensure they stay at the forefront of technological advancement and maintain a competitive edge in their respective markets.
Role Overview
As a Multilingual AI - AI Engineer, you will design and optimize multilingual NLP/LLM models, build high-performance RAG pipelines, and transform enterprise data into structured, multilingual AI-ready assets. You will work closely with our data science, MLOps, and product teams to create production-grade multilingual AI capabilities that operate across complex datasets, languages, and business domains.
Key Responsibilities:
- Develop, fine-tune, and evaluate multilingual LLMs and transformer-based models
- Build scalable multilingual RAG systems (chunking, embeddings, retrieval optimization, evaluation)
- Architect multilingual pipelines for data preparation, alignment, labeling, and quality assurance
- Implement model monitoring, bias detection, and evaluation frameworks across multiple languages
- Collaborate with linguists, domain experts, and product teams to align AI solutions with enterprise needs
- Integrate vector databases into multilingual AI pipelines
- Optimize inference and deployment via MLOps workflows
- Conduct benchmarking and comparative analysis across languages and model families
- Support multilingual data curation at scale (synthetic data generation, augmentation, filtering)
Required Qualifications:
- Strong experience with NLP and LLMs (HuggingFace, OpenAI, Anthropic, Qwen, Llama, Mistral, etc.)
- Experience fine-tuning transformer models using PEFT, LoRA, or full-fine-tuning
- Proficiency with Python and ML frameworks (PyTorch, TensorFlow or JAX)
- Hands-on experience building RAG pipelines and working with vector databases
- Experience with multilingual embeddings and cross-lingual NLP
- Familiarity with MLOps practices (Docker, CI/CD, model deployment, monitoring)
- Ability to work with datasets across multiple languages and handle linguistic variation
- Strong problem-solving skills and ability to work in fast-paced AI environments
Preferred Qualifications:
- Experience with multilingual evaluation frameworks and benchmark datasets
- Background in linguistics, computational linguistics, or cross-cultural NLP
- Experience building enterprise-grade AI solutions (healthcare, finance, logistics, retail, etc.)
- Knowledge of prompt-engineering best practices and instruction-tuning
- Familiarity with Azure/AWS/GCP AI ecosystems
- Experience with data annotation workflows, synthetic data generation, or quality assurance automation
What we offer:
- Attractive remuneration package
- Private Health Care Insurance
- Growth and learning opportunities
- Work in a well-established multinational company
- Hybrid work setup
Centific is an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, ancestry, citizenship status, age, mental or physical disability, medical condition, sex (including pregnancy), gender identity or expression, sexual orientation, marital status, familial status, veteran status, or any other characteristic protected by applicable law. We consider qualified applicants regardless of criminal histories, consistent with legal requirements.
Centific is an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, ancestry, citizenship status, age, mental or physical disability, medical condition, sex (including pregnancy), gender identity or expression, sexual orientation, marital status, familial status, veteran status, or any other characteristic protected by applicable law. We consider qualified applicants regardless of criminal histories, consistent with legal requirements.Similar Jobs
What We Do
Zero distance innovation for GenAI creators and industries Expertly engineering platforms and curating multimodal, multilingual data, we empower the ‘Magnificent Seven’ and enterprise clients with safe, scalable AI deployment We a team of over 150 PhDs and data scientists, along with more than 4,000 AI practitioners and engineers. We bring platforms, partners and 1.8 million vertical domain experts to create high-quality pre-trained datasets, fine-tuned industry-specific LLMs, and RAG pipelines supported by vector databases. These innovations can reduce GenAI costs by up to 80% and bring GenAI solutions to market 50% faster in 230 locales.








