The Role
The role involves research, solution design, and developing production-grade AI systems, focusing on NLP and large language models. Responsibilities include prototyping LLM applications, conducting experiments, and optimizing model performance.
Summary Generated by Built In
About the Company:
Netomi is the leading agentic AI platform for enterprise customer experience. We work with the largest global brands like Delta Airlines, MetLife, MGM, United, and others to enable agentic automation at scale across the entire customer journey. Our no-code platform delivers the fastest time to market, lowest total cost of ownership, and simple, scalable management of AI agents for any CX use case. Backed by WndrCo, Y Combinator, and Index Ventures, we help enterprises drive efficiency, lower costs, and deliver higher quality customer experiences.
Want to be part of the AI revolution and transform how the world’s largest global brands do business? Join us!
Job Description:
Do you believe in the missions Intelligence agencies? Are you interested in solving complex programmatic and technical issues?
If you are interested in working on some of the most challenging technical and programmatic issues , we are interested in talking to you about Netomi work and career opportunities.
We are seeking a Senior Data Scientist with deep expertise in Large Language Models (LLMs) and modern AI systems to join our team. This role combines cutting-edge research, rapid prototyping, and production-grade implementation to deliver innovative AI-powered solutions. You will drive NLP and machine learning projects from conception through deployment, working as both a direct contributor and key technical advisor.
Job Responsibilities
- Research & Innovation - Stay current with the latest LLM research, architectures, and advancements in the field including real-time models and multimodal systems. Evaluate emerging techniques and methodologies for potential application to business problems. Monitor developments in transformer architectures, fine-tuning approaches, model optimization, and real-time inference. Research and assess new LLM capabilities, frameworks, and API features as they emerge
- Solution Design & Prototyping - Identify and define approaches for complex AI challenges leveraging state-of-the-art LLMs. Design and build proof-of-concept solutions to validate technical feasibility. Rapidly prototype LLM-based applications using modern frameworks and orchestration tools. Conduct rigorous experiments to evaluate different approaches and methodologies. Work collaboratively in multi-disciplinary team environments and establish professional networks with subject matter experts
- Production Development & Software Engineering - Write clean, maintainable, production-quality code following software engineering best practices and design patterns. Develop robust, scalable agentic workflows using orchestration frameworks (such as LangGraph, CrewAI, or similar). Implement advanced LLM features, including tool calling, function calling, structured outputs, and multi-turn conversations. Build production-grade systems utilizing Model Context Protocol (MCP) and other emerging standards. Design and implement scalable, fault-tolerant architectures for real-time LLM-powered applications. Conduct thorough code reviews and maintain high code quality standards. Optimize code for performance, memory efficiency, and cost-effectiveness in production environments
- Experimentation & Optimization - Design rigorous experiments to test hypotheses and validate model performance. Develop evaluation frameworks for LLM outputs, system performance, and user experience. Optimize prompt engineering strategies, fine-tuning approaches, and inference efficiency. Conduct A/B tests, performance benchmarking, and statistical analysis
Requirements
- 3-5 years of experience in data science, machine learning, and AI development with strong focus on NLP and LLM applications
- Bachelor's/Master's or higher degree in Computer Science, Machine Learning, Statistics, or related technical field
- Proven track record of building and deploying production ML/AI systems from research to deployment
- Mastery of Python with strong software engineering fundamentals (OOP, design patterns, testing)
- Deep hands-on experience with LLM frameworks and APIs (OpenAI, Anthropic, or similar)
- Strong experience with at least one deep learning framework (PyTorch or TensorFlow)
- Proficiency with modern ML orchestration and agentic frameworks (LangGraph, CrewAI, LangChain, or similar)
- Solid understanding of NLP techniques: embeddings, information extraction, semantic search, classification
- Experience with diverse ML models: neural networks, transformers, SVM, Random Forest, clustering, Bayesian models
- Hands-on experience with advanced LLM features: tool calling, function calling, multi-turn conversations, structured outputs
- Strong knowledge of software development practices: version control (Git), testing (pytest)
- Experience with REST APIs, async programming, and building scalable backend services
- Familiarity with vector databases and embedding systems (Pinecone, Weaviate, FAISS, or similar)
- Knowledge of distributed computing, cloud platforms (AWS, GCP, or Azure), and containerization (Docker)
- Strong experimental design skills with ability to formulate hypotheses and conduct rigorous analysis
- Excellent problem-solving abilities and intellectual curiosity to stay current with AI research
- Self-motivated with proven ability to work collaboratively in multi-disciplinary teams
Bonus
- Experience with voice/speech models and real-time audio processing (OpenAI Realtime API or similar)
- Knowledge of Model Context Protocol (MCP) and emerging LLM standards
- Experience with MLOps tools and practices (model monitoring, versioning, A/B testing)
- Contributions to open-source ML/AI projects or published research papers
- Familiarity with streaming architectures and event-driven systems (Kafka, RabbitMQ)
Netomi is an equal opportunity employer committed to diversity in the workplace. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, disability, veteran status, and other protected characteristics.
Top Skills
Anthropic
AWS
Azure
Crewai
Docker
Faiss
GCP
Langchain
Langgraph
Openai
Pinecone
Python
PyTorch
Rest Apis
TensorFlow
Weaviate
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
Netomi is an AI-first customer service platform that enables companies to deliver the highest quality customer experiences while significantly reducing cost. Netomi's Relationship Operating System automatically resolves up to 80% of routine customer service inquiries, decreasing resolution time, and increasing customer satisfaction and support quality. The patented, no-code platform works across messaging, chat, email and voice, and understands 100+ languages. Netomi is based in San Francisco and has offices in New York and India.







