Data Engineer

Reposted 13 Days Ago
Be an Early Applicant
San Juan, PRI
Hybrid
Mid level
Information Technology • Software
The Role
The Data Engineer will build data infrastructure for predictive analytics, create data pipelines, and maintain ML datasets with a focus on data quality and validation.
Summary Generated by Built In

What sets INVID apart is our collaborative and flexible work environment. We encourage our team to raise the bar in everything they do while maintaining a healthy work-life balance. With our hybrid work model, team members thrive both in the office and remotely. We foster a culture of mutual respect, autonomy, and accountability, where your voice matters and your growth is supported. From structured career paths and paid professional development to access to industry events, we’re committed to your success.

Join us at INVID, where innovation meets support, and together we deliver excellence.

Job Description

We are hiring a Data Engineer to build the data infrastructure powering our predictive analytics initiative. You will create the pipelines that turn raw vessel tracking data into training datasets for ML models. The core challenge: we have rich behavioral data (vessel positions, AIS gaps, ship-to-ship transfers, spoofing events) but limited labeled outcomes (confirmed violations, detentions, seizures). You will build pipelines that create usable training data through proxy labels, data joins, and outcome correlation. 

Responsibilities
• Build labeling pipelines that join behavioral events to outcome data (sanctions designations, flag changes,
detentions)
• Implement proxy labeling strategies that create training signal from observable outcomes
• Build weak supervision infrastructure to combine multiple noisy labeling rules
• Create and maintain ML training datasets at scale
• Build data validation and quality monitoring systems
• Implement versioning for reproducible model training
• Integrate LRIT position data for prediction validation
• Build pipelines that compare predicted locations against actual LRIT reports
• Create feedback loops that improve model accuracy over time
• Scale data infrastructure as models and data sources grow

Required Skill
• 4+ years data engineering experience
• Strong SQL skills, including complex joins across large datasets
• Experience with Spark, Airflow, or equivalent distributed processing frameworks
• Python for data processing and pipeline orchestration
• AWS experience
• Understanding of ML training data requirements

Education/Certifications
• Bachelor's Degree in Computer Science, Engineering, or related field
Desired Skills (Not Required)
• Experience with geospatial data (PostGIS, H3, spatial joins)
• Maritime, defense, or intelligence domain experience
• Experience with data labeling infrastructure or weak supervision
• Familiarity with real-time streaming data systems

Important:

Must be a U.S. citizen and a U.S. resident

This job works on a hybrid work modality (San Juan, Puerto Rico)

Must have a valid driver's license

EEO

Top Skills

Airflow
AWS
Python
Spark
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Juan
73 Employees
Year Founded: 2003

What We Do

With over 20 years of experience, INVID provides scalable, functional, and high-impact software that saves time and money for our customers. Our solutions engage employees, improve processes and foster collaboration. Certifications: GSA, SBA 8(a), DBE, MBE, NSDC. Three years in a row on the Inc. 5,000 list. Microsoft Gold Partner.

Similar Jobs

IntelliPro Group Inc. Logo IntelliPro Group Inc.

Staff Software Engineer

HR Tech • Information Technology
In-Office
Country States, Pájaros Barrio, Bayamón, PRI
638 Employees
169K-225K Annually

IntelliPro Group Inc. Logo IntelliPro Group Inc.

Staff Software Engineer

HR Tech • Information Technology
In-Office
Country States, Pájaros Barrio, Bayamón, PRI
638 Employees
169K-225K Annually

IntelliPro Group Inc. Logo IntelliPro Group Inc.

Senior Software Engineer

HR Tech • Information Technology
In-Office
Country States, Pájaros Barrio, Bayamón, PRI
638 Employees
120K-160K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account