Principal Data Engineer

Posted 4 Hours Ago
United States
Senior level
Biotech
The Role
The Principal Data Engineer will design and implement end-to-end data workflows for ML modeling, improve model performance, and enhance data processing frameworks. Responsibilities include creating standards and best practices, ensuring data accuracy, and collaborating with data scientists to meet data needs.
Summary Generated by Built In

As a Principal Data Engineer, you will work with ML research scientists across IDEXX R&D to enable our product related to be leveraged for ML modeling. You will design and implement end-to-end data workflows for ML model building and monitoring for complex products, including imaging and clinical and operational solutions data. Additional data engineering work will impact the search, retrieval, processing, tagging, and publishing of curated data sets, with a focus on improving ML model performance over time.

Unfortunately, we are unable to provide sponsorship for this role.

Department:

IDEXX Data and AI Centre of Excellence develops and delivers data and AI assets and solutions to enhance IDEXX R&D, products, software, services, internal operations, and business practices.

You can find out more about the latest product developed in collaboration with AI/ML CoE here - https://www.idexx.com/en/veterinary/analyzers/invue-dx-analyzer/

Our tech stack:

AWS, Python, SQL

Also, Snowflake, Hadoop, Databricks, Spark, R

In this role:

  • You will design and implement scalable, reliable distributed data processing frameworks and analytical infrastructure using multiple technologies, including data sets or data warehouses, data virtualization and services, and repositories of semi-structured data sets.

  • You will design automated software deployment functionality that efficiently manages applications across distributed platforms.

  • You will monitor structural performance and utilization, identify problems, and implement solutions.

  • You will lead the creation of standards, best practices, and new processes for the operational integration of new technology solutions.

  • You will ensure environments are compliant with defined standards and operational procedures.

  • You will implement measures to ensure data accuracy and accessibility, constantly monitoring and refining the performance of data management systems.

  • You will understand structural requirements and define standards for storing, consuming, integrating, and managing data for machine learning applications.

  • You will collaborate with data scientists and analysts to understand their data needs and develop solutions to meet those needs.

  • You will develop and maintain data systems, processes, and procedures documentation.

  • You will complete problem tickets, including bug fixes, design modifications, and enhancement based on customer requirements.

What you need to succeed:

  • You have 5 or more years of experience working in machine learning and have delivered solutions into production in a professional setting.

  • You have 5 or more years of experience using Python, SQL, Spark

  • Nice to have: Experience with Databricks, Spark, with Big Data.

It would be helpful if you:

  • Your technical background is in Artificial Intelligence (AI) and Machine Learning (ML).

  • You have experience owning a technology product and assuming a technical lead role.

  • You understand structural requirements and can define standards for storing, consuming, integrating, and managing data.

  • You are proficient in coding and programming languages such as Structured Query Language (SQL) and Python. Familiarity with R will be an advantage.

  • You are familiar with cloud platforms such as Amazon Web Services (AWS).

  • You have experience or a good understanding of:

  • - Hadoop-based technologies like MapReduce and Spark

  • - SQL-based technologies like Oracle, PostgreSQL and MySQL

  • - Data processing tools including DBT

  • - Cloud-based data platforms, including Databricks and Snowflake

  • - data warehousing solutions and relational database theory

  • - industry-standard software APIs

  • You have good verbal and written communication skills and can translate technical subject matter to non-technical audiences.

  • You take the initiative in resolving problems and can balance conflicting requirements in partnership with others.

  • You excel at customer service and building relationships.

  • You have experience building distributed and cloud-based data pipelines.

Why IDEXX:

We’re proud of the work we do because our work matters. An innovation expert in every industry we serve, we follow our Purpose and Core Values to help pet owners worldwide keep their companion animals healthy and happy, to ensure safe drinking water for billions, and to help farmers protect livestock and poultry from disease. We have customers in over 175 countries and over 10,000 talented employees globally.

So, what does that mean for you? We enrich the livelihoods of our employees with a positive and respectful work culture that encourages learning and discovery. At IDEXX, you will be motivated by generous compensation, incentives, and benefits while enjoying purposeful work that drives improvement.

Let’s pursue what matters together.

IDEXX values diversity and encourages women, people of color, LGBTQ persons, people with disabilities, members of ethnic minorities, foreign-born residents, and veterans to apply.

IDEXX is an equal-opportunity employer. Applicants will not be discriminated against because of race, color, creed, sex, sexual orientation, gender identity or expression, age, religion, national origin, citizenship status, disability, ancestry, marital status, veteran status, medical condition, or any protected category prohibited by local, state, or federal laws.

#LI-KS1

Top Skills

Python
SQL
The Company
HQ: Westbrook, ME
6,764 Employees
On-site Workplace
Year Founded: 1983

What We Do

10,000 people, one global focus - enhancing the health and well-being of pets, people, and livestock

We are passionate about what we do at IDEXX – and why wouldn’t we be? When you’re working to raise the standard of care for pets, make drinking water safe for billions and keep our livestock population around the globe healthy and free of disease, it’s no wonder that what we do each day is more than just a job. There’s an energy across IDEXX that is contagious – where caring and committed people come together to make things better.

IDEXX Laboratories, Inc. (NASDAQ: IDXX), a member of the S&P 500, is a leader in pet healthcare innovation, serving practicing veterinarians around the world with a broad range of diagnostic and information technology-based products and services. Headquartered in southern Maine, we conduct operations through more than 70 locations around the world and serve customers in over 175 countries.

Our primary business focuses on pet health, a growing market around the world. Our products —in-clinic diagnostic tests and instrumentation, reference laboratory and telemedicine consultation services, and practice management software—enhance the ability of veterinarians to provide advanced medical care, improve staff efficiency and to build more economically successful practices.

We also develop and manufacture diagnostic tests and information for the global production animal industry, including poultry and livestock, as well as tests for the quality and safety of water and milk.

Please visit our website, IDEXX.com/careers, for further information and to view all of our job opportunities.

Jobs at Similar Companies

Pfizer Logo Pfizer

[SC] Manager, Business Development Japan

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Hybrid
Tokyo, JPN
121990 Employees

Takeda Logo Takeda

Platform Engineer - Application Virtualization Administrator

Healthtech • Software • Analytics • Biotech • Pharmaceutical • Manufacturing
Remote
Hybrid
Santa Fé, San Felipe, Guanajuato, MEX
50000 Employees

SOPHiA GENETICS Logo SOPHiA GENETICS

Platform Software Development Intern

Artificial Intelligence • Big Data • Healthtech • Software • Biotech
Hybrid
Bidart, Pyrénées-Atlantiques, Nouvelle-Aquitaine, FRA
450 Employees

Similar Companies Hiring

SOPHiA GENETICS Thumbnail
Software • Healthtech • Biotech • Big Data • Artificial Intelligence
Boston, MA
450 Employees
Pfizer Thumbnail
Pharmaceutical • Natural Language Processing • Machine Learning • Healthtech • Biotech • Artificial Intelligence
New York, NY
121990 Employees
Takeda Thumbnail
Software • Pharmaceutical • Manufacturing • Healthtech • Biotech • Analytics
Cambridge, MA
50000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account