LLM or GenAI Application Engineer

Posted 18 Days Ago
Be an Early Applicant
Mountain View, CA
In-Office
110-120
Senior level
Marketing Tech • Business Intelligence
The Role
The LLM or GenAI Application Engineer will design and develop LLM applications, research NLP techniques, optimize model architecture, and deploy LLMs into production, ensuring model performance and alignment with ethical standards.
Summary Generated by Built In

FocusKPI is looking for an LLM or GenAI Application Engineer to join one of our clients, a high-tech SaaS company.

An LLM or GenAI Application Engineer or LLM Research Engineer role is mainly focused on the consumer-facing applications. The role will be a part of the core GenAI AI development team within the client. It primarily contributes to LLM-based application development, evaluation, and testing of new features, as well as core technology, such as an agent framework, utilizing the latest technology. stack, LLM technology.
Work Location: Mountain View, CA
Duration: 12-month contract; Hybrid role (4 days per week onsite)
Pay Range: $110/hr to $120/hr
**No C2C resumes are considered**
Role & Responsibilities:

  • Design, train, and fine-tune large language models (e.g., GPT, LLaMA, PaLM) for various applications.
  • Research cutting-edge techniques in natural language processing (NLP) and machine learning to improve model performance.
  • Explore advancements in transformer architectures, multi-modal models, and emergent AI behaviors.
  • Collect, clean, and preprocess large-scale text datasets from diverse sources.
  • Develop and implement data augmentation techniques to improve training data quality.
  • Ensure data is free from bias and aligned with ethical AI standards.
  • Optimize model architecture to improve accuracy, efficiency, and scalability.
  • Implement techniques to reduce latency, memory footprint, and inference time for real-time applications.
  • Collaborate with MLOps teams to deploy LLMs into production environments using Docker, Kubernetes, and cloud.
  • Develop robust evaluation pipelines to measure model performance using key metrics like accuracy, perplexity, BLEU, and F1 score.
  • Continuously test for bias, fairness, and robustness of language models across diverse datasets.
  • Conduct A/B testing to evaluate model improvements in real-world applications.
  • Stay updated with the latest advancements in generative AI, transformers, and NLP research.
  • Contribute to research papers, patents, and open-source projects—present findings and insights at conferences and internal knowledge-sharing sessions. 
Qualifications:
  • Required to have 5-7 years of industrial work experience along with research/academic experience.
  • Advanced degree in Computer Science, Artificial Intelligence, Data Science, or a related field.
  • Strong programming skills.
  • Expertise with LLM and GenAI application development.
  • Experience with deep learning frameworks such as TensorFlow, PyTorch, or JAX.
  • Hands-on experience with transformer-based models (e.g., GPT, BERT, RoBERTa, LLaMA).
  • Expertise in natural language processing (NLP) and sequence-to-sequence models.
  • Familiarity with Hugging Face libraries and OpenAI APIs.
  • Experience with MLOps tools like Docker, Kubernetes, and CI/CD pipelines.
  • Strong understanding of distributed computing and GPU acceleration using CUDA.
  • Knowledge of reinforcement learning and RLHF (Reinforcement Learning with Human Feedback).

Top 3 skills (must have):

  • The candidate must have a real (actual) product experience, application development, and GenAI-based application shipping.
  • Actual or Industrial LLM or GenAI application experience of at least 2-3 years

**No C2C resumes are considered**
 

Thank you!

FocusKPI Hiring Team

Founded in 2010, FocusKPI, Inc. (FocusKPI) is a data science and technology firm specializing in predictive analytics practice and methodologies. FocusKPI is a US company headquartered in Silicon Valley, California, with an East Coast office in Boston, Massachusetts.

Top Skills

Cuda
Docker
Genai
Hugging Face
Jax
Kubernetes
Llm
Openai
PyTorch
TensorFlow
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
31 Employees
Year Founded: 2010

What We Do

FocusKPI brings deep domain experience in business and marketing analytics to enable our clients to unlock growth-driving insights from data. We help our clients develop action-oriented analytics and data science products that are customized to company-specific needs and integrated into their platforms for ongoing use. Our Accelerators, a toolbox of frameworks and models built over 10+ years, fast-track projects by capitalizing on our experience.

Capabilities:
Predictive Analytics
AI / Machine Learning
Measurement
Text Analysis

Key Industries Served:
Retail Media
B2B & B2C Sales, Marketing, and Merchandising
Software & Applications

Similar Jobs

Finch Logo Finch

Staff Software Engineer

Fintech • HR Tech • Software
Hybrid
2 Locations
69 Employees
200K-200K

Wells Fargo Logo Wells Fargo

Sales Coordinator

Fintech • Financial Services
Hybrid
2 Locations
213000 Employees
24-30

Wells Fargo Logo Wells Fargo

Product Manager

Fintech • Financial Services
Hybrid
3 Locations
213000 Employees
119K-224K Annually
Hybrid
6 Locations
213000 Employees
23-31

Similar Companies Hiring

Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
ClickMint Thumbnail
Marketing Tech • Generative AI • eCommerce • AdTech
Malibu, CA
7 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account