AI Research Engineer

Reposted 24 Days Ago
Be an Early Applicant
2 Locations
Hybrid
Expert/Leader
Artificial Intelligence • Software • Business Intelligence • Generative AI • Big Data Analytics
CSquared Labs helps companies understand their customers.
The Role
The AI Research Engineer will implement advanced agentic systems, automate optimization processes, and design evaluation frameworks to improve performance on real-world AI tasks.
Summary Generated by Built In

About this role

We need a researcher who builds. You don't just read Arxiv; you ship it. You are steeped in the nuances of LLMs and agentic AI, bridging the gap between training models and applying them to messy, real-world problems.

You are a pragmatic innovator who knows that "State of the Art" means nothing if it doesn't solve the business problem. You quickly grok academic papers, but can cut through the noise to deliver a working MVP. You obsess over evaluation, not just principles.

What You’ll Do

  • Advanced Agentic Systems: Architect and implement complex agent behaviors, leveraging multi-agent hierarchies, multi-trajectory inference, and RL environments to solve open-ended tasks.
  • Automated Optimization: Move beyond manual prompt engineering. Build and deploy automated optimization pipelines (like DSPy or custom optimizers) to systematically improve agent performance.
  • Rigorous Evaluation: Design and own the evaluation stack. Implement LLM-as-a-judge frameworks and domain-specific metrics to ensure our agents are reliable, safe, and improving over time.
  • Applied Research: Translate the latest papers on RAG techniques and cognitive architectures into production-ready code that drives our product forward.

Examples of Every-Day Work

  • From Paper to Prototype: Read a new paper on tree-of-thought reasoning in the morning and have a working implementation testing against our benchmarks by the afternoon.
  • Optimize the "Brain": Diagnose why an agent is hallucinating in a specific edge case and deploy a targeted fix via context engineering or fine-tuning, validating it with regression tests.
  • Architect the Swarm: Design a communication protocol for a hierarchy of agents where a "Manager" agent effectively delegates sub-tasks to specialized "Worker" agents without getting stuck in loops.
  • Cut the Fat: Reject a complex research proposal because a simpler, heuristic-based approach solves 90% of the user's problem with 10% of the compute.

What We’re Looking For

  • Deep AI Fluency: You are steeped in the latest LLM research. You understand the internals of Transformers, but more importantly, the emergent behaviors of agentic systems.
  • Research agility: You can quickly digest papers on RAG, multi-agent collaboration, and RLHF, and judge their applicability to our stack.
  • Eval Obsession: You know best practices for evaluation like the back of your hand. You don't trust a prompt until you have the metrics to back it up.
  • Product Intuition: You grok business concepts. You propose creative technical solutions that align with commercial goals, not just academic curiosity.
  • MVP Mindset: You move fast. You prioritize speed of iteration and learning over theoretical perfection.

Skills Required

  • Deep AI Fluency in LLM research and agentic systems.
  • Ability to quickly digest academic papers and apply them in practice.
  • Experience in designing rigorous evaluation frameworks and metrics.
  • Strong intuition for aligning technical solutions with product goals.
  • Proven ability to rapidly prototype and iterate on new ideas.
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Menlo Park, CA
7 Employees
Year Founded: 2025

What We Do

CSquared Labs is an agentic AI company whose founding team comes from Google, Meta, Uber, Palanti, and the like. We've raised a large Seed round from top Silicon Valley VCs with the mission to help companies understand their customers.

Why Work With Us

We are learning-first culture, bridging the gap between AI/ML research and domain expert knowledge.

Similar Jobs

Félix (felixpago) Logo Félix (felixpago)

Lead AI Research Engineer

Fintech • Financial Services
In-Office or Remote
5 Locations
343 Employees
100K-150K Annually

Databricks Logo Databricks

Staff Software Engineer

Big Data • Machine Learning • Software • Analytics • Big Data Analytics
In-Office
2 Locations
2200 Employees
199K-270K Annually

Cogent Security Logo Cogent Security

AI Research Engineer

Artificial Intelligence • Machine Learning • Software • Cybersecurity
In-Office
2 Locations
100K-300K Annually

Databricks Logo Databricks

Staff Software Engineer

Big Data • Machine Learning • Software • Analytics • Big Data Analytics
In-Office
2 Locations
2200 Employees
190K-270K Annually

Similar Companies Hiring

Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
LTX Thumbnail
Conversational AI • Generative AI
Jerusalem, Israel
360 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account