Data Scientist - Feature Engineering

Posted 2 Days Ago
2 Locations
Remote or Hybrid
150K-190K Annually
Junior
Financial Services
The Role
As a Data Scientist in Feature Engineering at Voleon, you will curate complex datasets, develop predictive features, validate signals, build data pipelines, and leverage AI tools for analysis.
Summary Generated by Built In
 
Voleon is a technology company that applies state-of-the-art AI and machine learning techniques to real-world problems in finance. For nearly two decades, we have led our industry and worked at the frontier of applying AI/ML to investment management. We have become a multibillion-dollar asset manager, and we have ambitious goals for the future.
 
Your colleagues will include internationally recognized experts in artificial intelligence and machine learning research as well as highly experienced finance and technology professionals.
 
In addition to our enriching and collegial working environment, we offer highly competitive compensation and benefits packages, technology talks by our experts, a beautiful modern office, daily catered lunches, and more.

The Voleon Group is growing its Feature Engineering team — a small, high-impact group responsible for turning the world's messy, complex datasets into predictive signals that power our machine learning models. As a Data Scientist on this team, you'll dig into raw data from diverse domains and, in collaboration with Research, assist in design and implementation of features that capture what is actually happening in the world. This is data storytelling at its most consequential: every feature you build has a direct path to our investment process.

We're looking for deeply curious people who get genuine satisfaction from wrestling with an unfamiliar dataset, understanding its structure and quirks, and emerging with something that encodes real information. You'll own the full arc from data sourcing and curation through feature construction, statistical validation, and integration into our production systems. We also expect you to actively experiment with AI-powered tools — LLM-based coding assistants, agentic workflows, and whatever comes next — to accelerate your day-to-day work and push the boundaries of what a small team can accomplish.

Responsibilities

  • Explore, profile, and curate complex and often messy datasets from third-party vendors and internal sources, developing a deep understanding of what each dataset can and cannot tell us
  • Harness financial intuition, academic research, and statistical rigor to inform design and implementation of predictive features in collaborative setting
  • Validate features through a disciplined, test-driven framework — including cross-sectional analysis, stationarity testing, and point-in-time correctness — to ensure signals are real and not artifacts of data issues
  • Build and maintain data pipelines that bring features from prototype to production, with monitoring for data health and correctness along the way
  • Communicate your findings clearly — both the signal you've found and the story of how the data produces it — to researchers and leadership
  • Proactively investigate anomalies in data feeds and production behavior, performing root-cause analysis and surfacing issues to relevant stakeholders
  • Leverage AI tools to accelerate exploration, coding, and analysis — and share what you learn about effective workflows with the team

Requirements

  • 2 years of applied industry experience (including internships) working end-to-end with complex datasets: curation, querying, aggregation, exploratory analysis, and visualization
  • Experience using statistical methods to analyze data, identify patterns, conduct root-cause analysis, and translate findings into actionable insights
  • Ability to frame and answer questions mathematically
  • Ability to infer useful forward-looking directions from the results of retrospective analysis
  • Fluency in managing, processing, and visualizing tabular data using SQL and Python (Pandas or Polars)
  • Basic software development skills and experience with bash, Linux/Unix, and git
  • Ability to refine ambiguous requests into well-scoped analyses and communicate results with clarity and precision
  • Bachelor's degree in a quantitative discipline (statistics, data science, computer science, economics, physics, or a related field)

Preferred

  • Master's degree in a quantitative discipline
  • Prior industry experience or demonstrated interest in finance — academic projects, coursework in financial engineering, or industry internships
  • Familiarity with financial datasets such as Compustat, IBES, or similar vendor data
  • Experience developing in a production-facing environment with standard tooling (CI/CD, git, workflow orchestration)
  • Hands-on experience with AI coding assistants or LLM-based tools in a data science or engineering workflow
  • A track record of curiosity-driven exploration — side projects, Kaggle competitions, research papers, or anything that shows you can't leave an interesting dataset alone

Compensation
The base salary range for this position is $150,000 to $190,000 in the location(s) of this posting. Individual salaries are determined through a variety of factors, including, but not limited to, education, experience, knowledge, skills, and geography. Base salary does not include other forms of total compensation, such as bonus compensation and other benefits. Our benefits package includes medical, dental, and vision coverage, life and AD&D insurance, 20 days of paid time off, 9 sick days, and a 401(k) plan with a company match.
 
“Friends of Voleon” Candidate Referral Program
If you have a great candidate in mind for this role and would like to have the potential to earn $7,500 if your referred candidate is successfully hired and employed by The Voleon Group, please use this form to submit your referral. For more details regarding eligibility, terms, and conditions, please make sure to review the Voleon Referral Bonus Program.
 
Equal Opportunity Employer
The Voleon Group is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.
 
#LI-MD1

Top Skills

Ai Tools
Bash
Git
Linux
Llm-Based Tools
Pandas
Polars
Python
SQL
Unix
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Berkeley, CA
169 Employees
Year Founded: 2007

What We Do

Founded in 2007 by two machine learning scientists, The Voleon Group is a quantitative hedge fund headquartered in Berkeley, CA. We are committed to solving large-scale financial prediction problems with statistical machine learning. The Voleon Group combines an academic research culture with an emphasis on scalable architectures to deliver technology at the forefront of investment management. Many of our employees hold doctorates in statistics, computer science, and mathematics, among other quantitative disciplines. Voleon's CEO holds a Ph.D. in Computer Science from Stanford and previously founded and led a successful technology startup. Our Chief Investment Officer and Head of Research is Statistics faculty at UC Berkeley, where he earned his Ph.D. Voleon prides itself on cultivating an office environment that fosters creativity, collaboration, and open thinking. We are committed to excellence in all aspects of our research and operations, while maintaining a culture of intellectual curiosity and flexibility. The Voleon Group is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Similar Jobs

SailPoint Logo SailPoint

Sales Representative

Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Remote or Hybrid
2 Locations
2461 Employees
43K-90K Annually

Cox Enterprises Logo Cox Enterprises

Senior Manager, Client Relationship Management (Cox Fleet)

Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
112K-186K Annually

Cox Enterprises Logo Cox Enterprises

Manager, Client Relationship Management (Cox Fleet)

Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
92K-154K Annually

Zscaler Logo Zscaler

User Experience Designer

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
USA
8697 Employees
116K-165K Annually

Similar Companies Hiring

Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Rain Thumbnail
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3 • Infrastructure as a Service (IaaS)
New York, NY
100 Employees
Granted Thumbnail
Mobile • Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account