LLM Architect

Sorry, this job was removed at 04:09 a.m. (CST) on Thursday, Jun 26, 2025
Be an Early Applicant
Hiring Remotely in United States
Remote
Information Technology • Professional Services
The Role
Description

LLM Architect

Location: US / Canada (Eastern Time) - Home-based

Job Type: Full-time, Permanent 

About AllCloud

AllCloud is a global professional services company providing organizations with cloud enablement and transformation tools. As an AWS Premier Consulting Partner and audited MSP, a Salesforce Platinum Partner, and a Snowflake Premier Partner, AllCloud helps clients connect their front and back offices by building a new operating model to harness the benefits of cloud technology and data and analytics.

Job Summary

We are looking for an innovative LLM Architect to lead the design and development of custom language models at AllCloud. This role will be responsible for architecting, training, and optimizing large language models based on modified transformer architectures. The ideal candidate will have deep expertise in NLP, transformer model design, and efficient training methodologies. You'll work alongside GPU Engineers and ML Engineers to create state-of-the-art language models that meet our customers' specific requirements, pushing the boundaries of what's possible with generative AI.

Responsibilities

  • Design custom transformer-based language model architectures tailored to specific use cases
  • Develop and implement modifications to transformer architectures to enhance performance, efficiency, or capabilities
  • Create and execute model pre-training, fine-tuning, and evaluation strategies
  • Implement techniques like quantization, pruning, and knowledge distillation to optimize model size and performance
  • Design and implement training data pipelines, including data selection, cleaning, and augmentation
  • Establish rigorous evaluation frameworks to assess model performance, fairness, and safety
  • Research and implement state-of-the-art techniques in LLM development
  • Create detailed documentation on model architectures, training methodologies, and performance characteristics
  • Collaborate with GPU Engineers to implement efficient training strategies across distributed systems
  • Work with customers to understand their unique requirements and translate them into model design decisions

Requirements

Summary of Key Requirements

  • 4+ years of experience in deep learning research or development with a focus on NLP and transformer models
  • Strong understanding of transformer architecture and its variants (GPT, BERT, T5, etc.)
  • Experience designing and training large language models from scratch
  • Expertise in PyTorch or TensorFlow for implementing custom model architectures
  • Knowledge of distributed training approaches for large models (DeepSpeed, Megatron, etc.)
  • Experience with model compression techniques (quantization, pruning, knowledge distillation)
  • Strong background in mathematics, particularly linear algebra, differential equations, probability, and statistics
  • Familiarity with current research in LLM development, including attention mechanisms, mixture of experts, and efficient training methods
  • Master's or PhD in Computer Science, Machine Learning, or related field
  • Publication record in NLP, LLMs, or transformer architecture (strongly preferred)

Certifications

  • AWS Machine Learning Specialty (Strongly Preferred)
  • NVIDIA-Certified Associate - Generative AI Multimodal (Preferred)

Why work for us? 

Our team inspires progress in each other and in our customers through our relentless pursuit of excellence; you will work with leaders who promote learning and personal development.


AllCloud is an Equal Opportunity Employer and considers applicants for employment without regard to race, color, religion, sex, orientation, national origin, age, disability, genetics or any other basis forbidden under federal, provincial, or local law.


Similar Jobs

Dropbox Logo Dropbox

Program Manager

Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Remote
United States
2500 Employees
135K-183K Annually

Motive Logo Motive

Data Engineer

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Easy Apply
Remote
United States
4000 Employees
127K-175K Annually

HopSkipDrive Logo HopSkipDrive

CareDriver Support Manager

Automotive • Edtech • Kids + Family • Mobile • Social Impact • Transportation
Easy Apply
Remote
US
450 Employees
75K-85K Annually

Atticus Logo Atticus

Senior Back-end Engineer

Insurance • Legal Tech • Social Impact
Remote
USA
210 Employees
157K-230K Annually
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Denver, Colorado
474 Employees
Year Founded: 2014

What We Do

AllCloud is a global professional services company providing organizations with the tools for cloud enablement and transformation. Through a unique combination of expertise and agility, AllCloud accelerates cloud innovation and helps organizations fully unlock the value received from cloud technology and data and analytics. As an AWS Premier Consulting Partner, a Salesforce Platinum Partner and Snowflake Premier Partner, AllCloud helps clients connect their front office and back office by building a new operating model that allows them to harness the benefits of cloud technology and data and analytics. AllCloud is supported by a robust ecosystem of technology partners, proven methodologies, and well-documented best practices. Thereby elevating customers by achieving operational excellence on the cloud, within a secure environment, at every milestone of the journey to becoming cloud first. With years of experience and a portfolio of thousands of successful cloud deployments, AllCloud serves clients across the globe. AllCloud has offices in Israel, Europe and North America. www.allcloud.io

Similar Companies Hiring

Axle Health Thumbnail
Logistics • Information Technology • Healthtech • Artificial Intelligence
Santa Monica, CA
19 Employees
Scrunch  Thumbnail
Artificial Intelligence • Information Technology • Marketing Tech • Software • SEO
Salt Lake City, Utah
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account