Senior Data Engineer (MLOps)

Posted 4 Days Ago
Hiring Remotely in Poland, IN
Remote
Senior level
Information Technology • Consulting
The Role
This role involves designing and implementing robust data systems, creating scalable data architectures, overseeing ETL processes with Apache Airflow, and building web scraping solutions. The Data Engineer will optimize cloud data solutions and work with MLOps to operationalize machine learning models.
Summary Generated by Built In

Who We Are

Massive Rocket is a high-growth Braze & Snowflake agency that has made significant strides in connecting digital marketing teams with product and engineering units. Founded just 5 years ago, we have experienced swift growth and are now at a crucial juncture, aspiring to reach $100M in revenue. Our focus is on delivering human experiences at scale, leveraging the latest in web, mobile, cloud, data, and AI technologies. We pride ourselves on innovation and the delivery of cutting-edge digital solutions.


Every role at Massive Rocket is Entrepreneurial - Successful people at Massive Rocket will not only think about their role but understand the roles around them, their goals and contribute to the success and growth of their team, customers and partners.


What We Offer

🚀 Fast-moving environment – you will never stop learning and growing

❤️ Supportive and positive work culture with an emphasis on our values

🌍 International presence – work with team members in Europe, the US, and around the globe

🪐 100% remote forever

🌴 Flexible Vacation Policy

🧗🏼‍♂️ Career progression paths and opportunities for promotion/advancement

🍕 Organised team events and outings


What we’re looking for

Massive Rocket, a global Martech agency specializing in Braze and Snowflake, is looking for a talented Data Engineer to join our growing team. We work with clients across the U.S., U.K., and European Union, delivering cutting-edge marketing technology solutions.

We are seeking a highly skilled and motivated Data Engineer to join our growing team. As a key member of our engineering organization, you will be resigning and implementing robust, scalable, and efficient data systems that power analytics, machine learning models, and business insights.. You will work closely with our engineering and product teams to deliver cutting edge AI and data solutions. 


Responsibilities


i) Data Architecture & Development: 

- Design and implement scalable, secure, and high-performance data lake and data warehouse solutions.

- Leverage best practices in schema design, partitioning, and optimisation for efficient storage and retrieval.

- Build and maintain data models to support analytics and machine learning workflows.

ii) Pipeline Orchestration: 

- Develop, monitor, and optimize ETL/ELT workflows using Apache Airflow.

- Ensure data pipelines are robust, error-tolerant, and scalable for real-time and batch processing.

iii) Data Scraping & Unstructured Data Processing:

- Develop and maintain scalable web scraping solutions to collect data from diverse sources, including APIs, websites, and other unstructured data sources.

- Extract, clean, and transform unstructured data such as text, images, and log files into structured formats suitable for analysis.

- Use tools and frameworks like BeautifulSoup, Scrapy, or Selenium for web scraping, and natural language processing (NLP) techniques for text processing.

iv) Cloud Integration:

- Design and implement cloud-native data solutions with Microsoft Azure.

- Optimize costs and performance of cloud-based data solutions.

v) Infrastructure as Code (IaC):

- Use Terraform to automate the provisioning and management of cloud infrastructure.

- Define reusable and modular Terraform configurations to support scalable deployment of resources.

vi) MLOps:

- Collaborate with data scientists and machine learning engineers to operationalise machine learning models.

- Implement CI/CD pipelines for machine learning workflows, ensuring efficient model deployment and monitoring.

vii) Containerisation and Orchestration:

- Utilize Kubernetes and containerisation technologies (e.g., Docker) to deploy scalable, fault-tolerant data processing systems.

- Manage infrastructure and resource allocation for containerised data applications.

viii) Collaboration: Collaborate effectively with developers and other stakeholders to understand their needs and provide appropriate platform solutions.

ix) Documentation: Maintain comprehensive documentation of platform architecture, processes, and procedures.


Required Skills and Qualifications:

- 5+ years of experience in data engineering or a related field.

- Strong expertise in data pipeline orchestration tools such as Apache Airflow.

- Proven track record of designing and implementing data lakes and warehouses (experience with Azure is a plus).

- Solid understanding of MLOps practices, including model training, deployment, and monitoring.

- Proficiency in programming languages such as Python & SQL.

- Experience with distributed computing frameworks such as Spark.

- Familiarity with version control systems (e.g., Git) and CI/CD pipelines.

- Collaboration and Communication: Effective communicator and team player, comfortable working with cross-functional teams to deliver high-quality solutions.

- Agency experience: Experience working in an agency setting with clients

- English C1 Level: strong communication skills with professional level of proficiency in english


Bonus Skills and Experiences:

- Demonstrated experience with Terraform for infrastructure provisioning and management.

- Hands-on experience with Kubernetes and containerised environments.

- Experience in the healthcare or medical industry.

- Familiarity with compliance standards like HIPAA.


Desired Qualities:

- Innovative Problem-Solver: A creative thinker who can efficiently solve complex problems and adapt to new technologies and changing product requirements.

- Quality Advocate: Passion for quality and a dedication to understanding the user’s perspective and how it impacts the product's overall experience.

- Effective Communicator: Strong interpersonal and communication skills, with the ability to articulate issues, solutions, and concepts to technical and non-technical stakeholders alike.

- Leadership Potential: While direct leadership experience is not mandatory, the aptitude to mentor others and lead by example in software engineering practices is highly valued.


During the process, please be ready to provide:

• Valid work visa - Massive Rocket does not provide sponsorship at the moment.

• Proof of identification: ID card, passport, Utility bill (Gas, Water, Electricity)

• 2 references - Name, Relationship, Contact details (Email, Mobile)

• Contractors Only: proof of incorporation and insurance


Note: Please ensure that your qualifications closely match the criteria outlined in the job description. Applications not meeting the specified criteria may not be processed or considered for this position.

Top Skills

Apache Airflow
Natural Language Processing
The Company
London
71 Employees
On-site Workplace
Year Founded: 2018

What We Do

Global Braze Agency, Massive Rocket, offers full-service solutions for businesses (Data, Engineering & CRM). We help our customers use data to understand their customers and automate communications across channels. We Grow Customer Lifetime Value. Our Global Delivery Centres support the most sophisticated customers in the world across the US, EMEA and APAC regions. Automate your end-to-end customer experience with our key services: Marketing Technology Stack Design Customer Engagement (CRM) Data Warehouse Management (DW) Customer Data Management (CDP) We extend your team with specialists Strategy | Planning | Setup | Integration | Execution As a consultancy, we deliver solutions that don’t just help digital marketing teams achieve their goals but also generate predictable growth. We adopt and implement new-age solutions that fulfil the needs of the new-age customer-centric companies. We are proud to be the technology partners of the leading technology solution providers in the industry: 1. Braze (One of the 4 Certified Level 4 “Orbit” Braze Partners) 2. mParticle (Solutions Partner of the Year) 3. Snowflake 4. Segment Get in touch to know more

Similar Jobs

Cleo Logo Cleo

Data Scientist

Cloud • eCommerce • Information Technology • Professional Services • Software
Remote
United States
400 Employees

Chime Logo Chime

Senior People Analyst

Fintech • Machine Learning • Mobile • Security • Software • Analytics • Data Privacy
Easy Apply
Remote
United States
1336 Employees

Pie Insurance Logo Pie Insurance

Senior Analyst, Pricing & Product

Fintech • Insurance • Machine Learning • Other • Analytics • Automation
Remote
United States
400 Employees
90K-110K Annually

GHX Logo GHX

Sr Clinical Data Specialist

Cloud • Healthtech • Payments • Professional Services • Software • Analytics • Automation
Easy Apply
Remote
United States
1300 Employees

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account