Environmental Data Engineer / Machine Learning Engineer

Posted 13 Days Ago
San Antonio, TX
In-Office
Mid level
Greentech
The Role
The role involves developing data pipelines and machine learning models for environmental metrics. Responsibilities include pipeline optimization, model training, and collaboration with cross-functional teams.
Summary Generated by Built In

Position: Environmental Data Engineer / Machine Learning Engineer

Position Type: Full-Time

Reports to: Jay Weeks, Director of Data & Soil Science 

Role Overview

As a Data Engineer / Machine Learning Engineer, you will play a pivotal role in bridging the gap between experimental prototypes and scalable, production-ready systems. You'll spend approximately 50% of your time optimizing and deploying data pipelines to support long-term business needs, and the other 50% developing advanced machine learning models for environmental mapping and predicting changes in environmental metrics (e.g., soil organic carbon stocks) using time series data. This is a hands-on position in a fast-paced startup environment, where you'll collaborate with cross-functional teams to deliver impactful, reliable solutions.

Key Responsibilities
  • Production Pipeline Development (50% of time):
    • Evaluate and refactor prototype code from R&D phases into efficient, maintainable production pipelines.
    • Design, implement, and maintain scalable data ingestion, processing, and ETL (Extract, Transform, Load) workflows using cloud-based infrastructure (e.g., AWS, GCP, or Azure).
    • Ensure pipelines are robust, fault-tolerant, and optimized for performance, security, and cost-efficiency.
    • Integrate monitoring, logging, and alerting systems to support ongoing operations and quick issue resolution.
    • Collaborate with software engineers, scientists, data scientists, and other stakeholders to align pipelines with business objectives, enabling long-term scalability and reliability.
  • Model Development (50% of time):
    • Build, train, and deploy machine learning models for environmental quantification (e.g.,  digital soil mapping, predicting soil organic carbon stock changes, etc.).
    • Work with time series data from various sources (e.g., satellite imagery, sensor data, historical records) to develop predictive models using techniques like time-series forecasting, geospatial analysis, and deep learning.
    • Perform feature engineering, model evaluation, hyperparameter tuning, and validation to ensure accuracy and generalizability.
    • Integrate ML models into production environments, including API development for real-time predictions and batch processing.
    • Stay abreast of advancements in ML for geospatial and environmental applications, experimenting with new algorithms and tools to improve model performance.
  • General Duties:
    • Conduct code reviews, write documentation, and mentor junior team members on best practices in data engineering and ML.
    • Troubleshoot and debug issues in both data pipelines and ML systems.
Required Qualifications
  • Bachelor's or Master's degree in Computer Science, Data Science, Machine Learning, Environmental Science, or a related field.
  • 3+ years of experience in data engineering, with a proven track record of productionizing prototype code in a startup or fast-paced environment.
  • Strong proficiency in programming languages such as Python (with libraries like Pandas, NumPy, Scikit-learn) and SQL.
  • Experience with ML frameworks (e.g., TensorFlow, PyTorch, XGBoost) and timeseries analysis (e.g., Prophet, LSTM networks, PINNs).
  • Hands-on experience with cloud platforms, containerization, and CI/CD pipelines.
  • Familiarity with geospatial data processing and environmental modeling concepts, particularly in soil science or agriculture.
  • Excellent problem-solving skills, with the ability to handle ambiguous requirements and deliver under tight deadlines.
Preferred Skills
  • Experience in digital soil mapping, carbon stock prediction models, advanced statistics, and/or Bayesian model calibration / inference
  • Knowledge of big data technologies (e.g., Spark, Kafka) for handling large-scale timeseries datasets.
  • Background in DevOps practices and infrastructure as code (e.g., Terraform).
  • Passion for sustainability and environmental impact.

Benefits:


  • Health Insurance plan with $0 deductible and $0 co-pay
  • Dental and vision insurance plans
  • Flexible spending account option. 
  • Open Paid Time Off Policy plus 9 paid holidays per year as listed in our Company Handbook
  • Participation in our 401(k) savings plan
  • Company-paid Life and AD&D coverage
  • Educational materials and expenses to support continuing education opportunities
About Grassroots Carbon

Grassroots Carbon is the leading grasslands restoration and soil carbon storage company that partners with landowners to implement and scale regenerative land management practices. In addition to enhancing soil health, promoting biodiversity, and improving water quality, these regenerative practices have tremendous potential to combat climate change by drawing down large quantities of atmospheric CO2 into the soil. Grassroots Carbon is proud to have partnered with ranchers across 1.6 million acres in 21 states to implement practices that restore grasslands, improve bird habitats, build soil health, and drive nature-based soil organic carbon drawdown through the healthy soils. Built on a foundation of scientific rigor, quality, and transparency, Grassroots Carbon has built strong partnerships with Audubon Conservation Ranching, Texas Agricultural Land Trust, Understand Ag, and Colorado State University’s Soil Carbon Solutions Center while generating high-quality soil carbon drawdown credits for leading corporations, including Nestle, Microsoft, Shopify, Marathon Oil, H-E-B, Olipop, and Urban Villages, to offset their carbon impact and reach their sustainability goals.

*Grassroots Carbon is proud to be a portfolio company of Soilworks Natural Capital*

About Soilworks Natural Capital:

Grassroots Carbon is proud to be a portfolio company of Soilworks Natural Capital, which provides shared services to our fast-growing company. Soilworks is a private equity fund that invests in, incubates, and acquires companies to help accelerate the Regenerative Agriculture movement and is on a mission to prove Regenerative grazing is the most profitable way to ranch. Soilworks principles include better and healthier food, restoring plant and animal diversity, regenerating soil to store water and carbon, and creating more profitable family farms. Soilworks was launched by the co-founders of Scaleworks, a technology venture equity fund based in San Antonio, TX.

We are proud to foster a workplace free from discrimination. We strongly believe that diversity of experience, perspectives, and background leads to a better environment for our employees and a better experience for our users and our customers. We are an equal-opportunity employer and do not discriminate against protected characteristics. All candidates will be given the same consideration.

*No visa sponsorship is available for this position* 

Top Skills

AWS
Azure
GCP
Python
PyTorch
SQL
TensorFlow
Xgboost
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Antonio, Texas
24 Employees
Year Founded: 2021

What We Do

Powered by nature to achieve impacts far beyond carbon removal.

Grassroots Carbon built the leading grasslands soil carbon company focused on empowering ranchers to implement regenerative land management practices on U.S. grasslands.

Based in San Antonio, Texas, Grassroots Carbon created a foundation of scientific rigor, quality, and trust that allows leading companies like Microsoft, Marathon Oil, HEB, Shopify, and Nestle to reach their carbon reduction goals. Learn more about our company's mission, vision and values at https://grassrootscarbon.com/

Similar Jobs

CoreWeave Logo CoreWeave

Inventory Control Specialist - Plano, TX

Cloud • Information Technology • Machine Learning
In-Office
Plano, TX, USA
65K-85K

Cloudflare Logo Cloudflare

Solutions Engineer

Cloud • Information Technology • Security • Software • Cybersecurity
Hybrid
Austin, TX, USA

BAE Systems, Inc. Logo BAE Systems, Inc.

Engineering Manager

Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
Hybrid
Austin, TX, USA
147K-249K Annually

Dandy Logo Dandy

Senior Technical Sourcer (6-month contract)

Computer Vision • Healthtech • Information Technology • Logistics • Machine Learning • Software • Manufacturing
In-Office
Carrollton, TX, USA

Similar Companies Hiring

Cox Enterprises Thumbnail
Software • Other • Information Technology • Greentech • Cybersecurity • Cloud • Automotive
Atlanta, GA
50000 Employees
Halter Thumbnail
Software • Machine Learning • Internet of Things • Hardware • Greentech • Business Intelligence • Agriculture
Auckland City, NZ
200 Employees
Energy CX Thumbnail
Utilities • Professional Services • Greentech • Financial Services • Energy • Consulting • Business Intelligence
Chicago, IL
78 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account