Sr Data Scientist

Posted 24 Days Ago
Santa Clara, CA, USA
In-Office
210K-275K Annually
Senior level
Artificial Intelligence • Software
The Role
The role involves managing data for machine learning, deploying computer vision systems, ensuring data quality, and leading internal training initiatives.
Summary Generated by Built In

Summary

We’re Blue River, a team of innovators driven to create intelligent machinery that solves monumental problems for our customers. We empower our customers – farmers, construction crews, and foresters - to implement safer and more sustainable solutions, driving increased profitability with less reliance on scarce labor. We believe that focusing on the small stuff – pixel-by-pixel and task-by-task - leads to big gains. 

Blue River Technology is based in Santa Clara, CA. 

Job Responsibilities

  • Define, curate, and manage datasets of images, sensor data, and scenarios that are designed to increase the trust and safety of autonomy.
  • Work closely with data engineers and field data capture technicians to mine fleet data and identify open needs.
  • Define frameworks for cataloging and searching scenario-based data to serve multiple stakeholders, including computer vision and robotics teams.
  • Monitor, investigate, and fix data ingestion issues related to dataset curation for training and testing computer vision algorithms.
  • Investigate data quality and actively participate in conceptualizing and developing short and long-term solutions.
  • Provide data and infrastructure support to internal teams.
  • Provide guidance to improve the stability, security, efficiency, and scalability of image data pipelines.
  • Improve code quality through writing unit tests, automation, and performing code reviews.
  • Examine the correlation between customer experience and virtual performance in like scenarios; adjust as needed. Ensure that defined safety and productive test cases are adequately covered with curated scenarios.

Requirements

  • Master’s degree in Math, Physics, Data Science, or related field plus 5 years of related experience.
  • Required skills:
    • Implement and deploy computer vision and machine learning-based data pipeline systems using semantic segmentation, image & video classification, object detection, supervised, and unsupervised learning (5 yrs).
    • Experience working with data engineers, data scientists, software engineers, and field staff through the lifecycle of developing and deploying a machine learning system (4 yrs).
    • Perform non-parametric statistical tests and analysis on large image-based data sets using sklearn, scikit-image, scipy, and OpenCV (3 yrs).
    • Write technical documentation, tutorials, and summaries to train data collection teams and conduct on-site training (3 yrs).
    • Deploy scalable cloud-based solutions to mine, preprocess, resize, crop, rectify, and filter image-based data sets (5 yrs).
    • Implement code using Python libraries, including NumPy, SciPy, OpenCV, Pandas, Seaborn, Matplotlib, CUDA, Pytorch, and TensorFlow (5 yrs).
    • Design, implement, debug, and deploy stereo image-based data pipelines using Apache TeamCity, AWS Airflow, Redis, Google appsheet, Data bricks datatables, Celery, and advanced search solutions on LabelBox with open source models such as CLIP and BLIP (6 mos).
    • Design, build, and debug custom Python pipelines using Python Functools for processing large image datasets, deploy these pipelines using Docker and Docker-compose (1 yr).
    • Use statistical sampling algorithms to design efficient data collection methods for large stereo camera-based image datasets and coordinate data collection (6 mos).
    • 10% domestic travel required. Position is remote, but there is domestic travel to test/training sites required and regular in-office time (about once a week) to interact with local workstations and participate in in-person meetings.

The US annual base salary range for this position is $209,862 - $275,000, along with eligibility for Blue River’s bonus and benefit programs.

#LI-DNI

Skills Required

  • Master's degree in Math, Physics, Data Science, or related field
  • 5 years of experience in implementing and deploying computer vision systems
  • 4 years of experience collaborating with engineers through the ML system lifecycle
  • 3 years of experience performing statistical tests on large image datasets
  • 3 years of experience writing technical documentation and training
  • 5 years of experience deploying cloud-based solutions for image datasets
  • 5 years of Python coding experience with various libraries
  • 6 months of experience designing stereo image-based data pipelines
  • 1 year of experience building custom Python processing pipelines
  • 6 months of experience using statistical sampling for data collection
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Sunnyvale, CA
182 Employees
Year Founded: 2011

What We Do

We’re Blue River, a team of innovators driven to radically change agriculture by creating intelligent machinery. We empower our customers – farmers - to implement more sustainable solutions: optimize chemical usage, reimagining routine processes, and improving farming yields year after year. We believe that focusing on the small stuff – pixel-by-pixel and plant-by-plant - leads to big gains. By partnering with John Deere, we are innovating computer vision, machine learning, robotics and product management to solve monumental challenges for our customers. Our people are at the heart of what we do. Through cross-discipline collaboration, this mission-driven and daring team is eager to define the new frontier of agricultural robotics. We are always asking hard questions, rapidly iterating, and getting our boots in the field to figure it out. We won’t give up until we’ve made a tangible and positive impact on agriculture.

Similar Jobs

Pfizer Logo Pfizer

Senior Data Scientist

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Remote or Hybrid
8 Locations
121990 Employees
147K-272K Annually

Thrive Market Logo Thrive Market

Senior Data Scientist

Consumer Web • eCommerce • Food • Healthtech • Natural Language Processing • Social Impact
In-Office or Remote
2 Locations
1000 Employees
175K-190K Annually

Tempus AI Logo Tempus AI

Senior Data Scientist

Artificial Intelligence • Big Data • Healthtech • Machine Learning • Analytics • Biotech • Generative AI
Hybrid
3 Locations
3775 Employees
100K-175K Annually

CoreWeave Logo CoreWeave

Senior Data Scientist

Cloud • Information Technology • Machine Learning
In-Office
3 Locations
1450 Employees
143K-210K Annually

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account