AI Researcher

Reposted 21 Days Ago
Be an Early Applicant
Melbourne, Victoria, AUS
In-Office
Mid level
Artificial Intelligence • Information Technology • Software
The Role
The AI Researcher will design architectures, run experiments, evaluate large language models, and collaborate with ML engineers for model training and analysis.
Summary Generated by Built In
About the role

Maincode builds foundation models from first principles on Australian infrastructure. We design architectures, run our own compute, shape the training process, and operate the systems that serve our models.

We have built Matilda, the first large language model built and trained from scratch in Australia. Our new compute cluster is live; we are scaling the next version of Matilda and deploying and serving it live for public access.

We are looking for AI researchers who want to work on the core architecture, training, and evaluation of large-scale language models that power Matilda.

This role is not focused on incremental benchmarking or paper output. You will work directly with the engineers running large-scale training systems and help design models that learn efficiently and behave reliably in production.

What you would actually do

You will work across the model development loop, from research questions to training runs to evaluation.

This includes:

  • Designing and testing architecture changes and training regimes for large language models

  • Running controlled experiments at scale and isolating causal effects

  • Studying failure modes in reasoning, generalisation, robustness, and representation

  • Shaping objectives, data mixtures, and optimisation choices that influence model behaviour

  • Building and refining evaluations that measure capability and reliability, not just scores

  • Analysing training dynamics using logs, metrics, and model outputs

  • Collaborating with ML systems engineers on distributed training and training operations

  • Writing clear internal notes that turn experimental results into design decisions

You will spend substantial time in code, training runs, logs, and evaluation outputs. The goal is clarity about what improves the model and why.

What we are looking for

We care about depth of reasoning, experimental discipline, and the ability to make progress under ambiguity.

We expect:

  • Hands-on experience writing and running production-grade ML or research code

  • Strong Python and experience with PyTorch or JAX

  • Solid understanding of transformer-based language models and the basics of pre-training and evaluation

  • Ability to design experiments, interpret results, and communicate tradeoffs clearly

  • Comfort working close to infrastructure, performance constraints, and operational reality

  • Interest and exposure to reasoning-oriented architectures and training methods beyond standard approaches, and beyond standard LLMs


Nice to have
  • Experience with distributed training concepts and tooling (data parallel, tensor parallel, sharding, checkpointing)

  • Experience running training across multiple nodes and managing long training cycles

  • Familiarity with large-model training stacks and frameworks (for example Megatron-style systems, DeepSpeed-like tooling, FDSP or similar)

  • Comfort across the full workflow: training, evaluation, and deployment constraints

  • Experience working in ROCm-based environments

How you would work

This is hands-on research. You will use code as a primary tool for thinking.

You will be expected to:

  • Move between theory and implementation quickly and precisely

  • Prefer controlled experiments over broad sweeps

  • Use logs, metrics, and model behaviour to guide decisions

  • Work closely with engineering counterparts to scale and validate ideas

What this role is not
  • It is not a product research role

  • It is not prompt engineering

  • It is not fine-tuning someone else’s model and shipping wrappers around external APIs

You will work on Matilda, trained from scratch on our infrastructure, and pushed until its behaviour is understood and improved.

Why Maincode

Maincode builds and operates the full stack: training infrastructure, model code, evaluation systems, and deployment. We run one of the largest private AI compute environments in Australia, built for the sole purpose of training and deploying large scale models.

If you want to work directly on training and evaluating a large language model built from scratch, this is the only role in Australia that will put you inside that work.

Note

This is a full time role based in Melbourne, working closely with our in person team. At this time we are not able to offer visa sponsorship, so applicants must have existing and unrestricted work rights in Australia.

Skills Required

  • Hands-on experience writing and running production-grade ML or research code
  • Strong Python and experience with PyTorch or JAX
  • Solid understanding of transformer-based language models
  • Ability to design experiments and interpret results
  • Experience with distributed training concepts
  • Familiarity with large-model training stacks and frameworks
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Melbourne, VIC
13 Employees

What We Do

We create intelligent systems that understand context, anticipate needs, and turn ideas into action, unlocking entirely new ways for people to work and create. The future isn’t just about software that stores information. It’s about technology that thinks, adapts, and acts. We are pioneering the next generation of AI-powered, action-driven systems that amplify human capability, accelerate workflows, and make work feel effortless. We believe AI should do more than assist, it should empower. If you're passionate about building the next era of intelligent software, join us.

Similar Jobs

LexisNexis Logo LexisNexis

Legal AI Researcher/Analyst – Regulatory Compliance

Information Technology • Legal Tech • Professional Services • Analytics • Business Intelligence
In-Office
3 Locations
10001 Employees

ServiceNow Logo ServiceNow

Senior CRM Account Exec

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Melbourne, Victoria, AUS
29000 Employees

ServiceNow Logo ServiceNow

Senior Manager, Inbound Product Management

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Melbourne, Victoria, AUS
29000 Employees

Xero Logo Xero

Engineering Manager

Cloud • Fintech • Information Technology • Machine Learning • Software
Hybrid
2 Locations
4500 Employees

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account