Overview:
We are looking for a Machine Learning Engineer who will be responsible for improving and maintaining our machine learning models/pipelines, developing scalable ETL pipelines, and optimizing the end-to-end data processing workflow. This role requires strong experience in computer vision, OCR, and deep learning, as well as the ability to deploy models in a production environment.
Responsibilities:
Design, develop, and maintain data pipelines to manage TIFF images, extracted fields, CSV, PDF, XML, and structured outputs.
Build and optimize ETL workflows for preprocessing (resizing, rotation, denoising) and ML pipeline integration.
Develop, train, evaluate, deploy, and monitor deep learning models in a production environment.
Develop API endpoints and integrate structured data into the existing data keying application.
Implement logging, monitoring, and error-handling mechanisms for model performance and data consistency.
Work with Dockerized deployments to streamline ML and data workflows.
Collaborate with ML engineers, software developers, and quality test engineers to ensure seamless integration.
Qualifications:
Bachelor's Degree holder.
3+ years of experience in machine learning, ML engineering, or a hybrid role.
Proficiency in Python for data manipulation, API development, and ML integration.
Ability to write efficient ETL scripts and manage large datasets.
Strong knowledge of database management (SQL, NoSQL, PostgreSQL, or similar).
Familiarity with YOLO-based object detection and OCR processing.
Knowledge of image processing and computer vision tools such as Pillow, OpenCV, pdf2image, etc.
Experience with training and testing machine learning models using Tensorflow, Pyspark, and Scikit-learn.
Hands-on experience with Docker, Kubernetes, and containerized ML workflows.
Experience working with Azure services (Data Factory, Blob Storage, ML Studio, and other compute resources).
Experience with Flask, FastAPI, or other API frameworks.
Knowledge of MLOps best practices for deploying and monitoring ML models in production.
Hands-on experience using Git and Github/Gitlab, and Github Actions.
Exposure to DevOps practices for CI/CD pipelines in ML projects.
Ability to quickly learn and apply enterprise AI tools and technologies to support technical workflows and business objectives.
Preferred Skills (Nice to Have):
Experience working in insurance, document processing, or OCR-related applications.
Knowledge of distributed data processing (Spark, Dask, or similar).
Familiarity with NLP and LLM-based models.
We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location.
We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120.
Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here.
Please read our Candidate Privacy Policy.
We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.
USA Job Seekers:
EEO Know Your Rights.
Skills Required
- Bachelor's Degree
- 3+ years experience in machine learning, ML engineering, or hybrid role
- Proficiency in Python for data manipulation, API development, and ML integration
- Ability to write efficient ETL scripts and manage large datasets
- Database management knowledge (SQL, NoSQL, PostgreSQL or similar)
- Experience with computer vision, OCR, and deep learning
- Familiarity with YOLO-based object detection and OCR processing
- Knowledge of image processing tools such as Pillow, OpenCV, pdf2image
- Experience training and testing ML models using TensorFlow, PySpark, and scikit-learn
- Hands-on experience with Docker and Kubernetes
- Experience with Azure services (Data Factory, Blob Storage, ML Studio, compute resources)
- Experience developing APIs with Flask, FastAPI, or similar frameworks
- Knowledge of MLOps best practices for deploying and monitoring production models
- Hands-on experience using Git and GitHub/GitLab and GitHub Actions
- Exposure to DevOps practices for CI/CD pipelines in ML projects
- Ability to quickly learn and apply enterprise AI tools and technologies
- Experience in insurance, document processing, or OCR-related applications
- Knowledge of distributed data processing (Spark, Dask, or similar)
- Familiarity with NLP and LLM-based models
RELX Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about RELX and has not been reviewed or approved by RELX.
-
Retirement Support — Retirement support is positioned as a meaningful part of total rewards through a 401(k) plan with matching contributions, alongside other financial protections such as life and disability coverage. Tuition reimbursement and share purchase access further broaden the financial value of the package beyond base salary.
-
Leave & Time Off Breadth — Leave and time off breadth appears strong, with generous vacation allowances, mental health days, and options like sabbaticals and tiered PTO by tenure. Parental and caregiving leaves are described in detail, reinforcing time-away benefits as a standout component of the overall package.
-
Wellbeing & Lifestyle Benefits — Wellbeing and lifestyle benefits are supported by offerings such as mental health support (e.g., app access), EAP resources, gym-related perks, and wellness incentives. Flexible working hours and related work-life supports add to the perceived day-to-day value of benefits.
RELX Insights
What We Do
RELX is a global provider of information-based analytics for professional and business customers across industries. We help scientists make new discoveries, doctors and nurses improve the lives of patients and lawyers win cases. We prevent online fraud and money laundering, and help insurance companies evaluate and predict risk. Our events enable customers to learn about markets, source products and complete transactions. In short, we enable our customers to make better decisions, get better results and be more productive. We do this by leveraging a deep understanding of our customers to create innovative solutions which combine content and data with analytics and technology in global platforms. RELX serves customers in more than 180 countries and has offices in about 40 countries. It employs approximately 30,000 people of whom almost half are in North America. We operate in four major market segments: Scientific, Technical & Medical; Risk & Business Analytics; Legal; and Exhibitions.





.jpg)


