Apify's mission is to make the web more programmable. Why? We believe that humans have better things to do than filling forms, compiling huge Excel sheets of copy-pasted data, or watching thousands of videos to determine whether they include a specific brand logo. Those are jobs for a machine. And just as in the real world we have tools that make our lives easier, we should have tools to make our lives easier on the web.
Web automation has profound impact. Our users automate service unsubscriptions to save people money and they collect billions of rows of data from all over the world for AI training, protection of consumers, or finding lost children. Fortune 500 companies and small startups alike need this power to make great business decisions. We help our clients improve their products, marketing, and business decisions using structured data from the web or automating repetitive tasks. We also participate in creating domestic non-profit projects such as Hlídač Shopů.
We’re looking for a Senior Data Engineer/Analyst to join us and help take our analytics and BI to the next level.
What you’ll be working on?
You will join our existing data team, which currently consists of four data analysts / analytical engineers (one mid level, and three senior level analysts) and team lead. You will work closely with three product teams and help navigate their teamwork for the higher success of the company. As a Senior Data Engineer, you will continue to build our data-driven approach, design and implement ETL (Extract, Transform, Load) solutions for transferring data between various sources and platforms. You will mostly work on ET part of the ETL process with emphasis on MongoDB data extraction (our production database).
The role will also bring interesting challenges. We’re not an e-commerce company with a straightforward ‘customer - item - order - payment - delivery’ data model. The Apify platform is open to many use cases involving diverse datasets (usage-based billing, multiple pricing models, in-house vs. community Actors, and more). This is a great playground for a true puzzle solver! We extract data from MongoDB, Google Analytics, Mixpanel, AWS, Hubspot, use Keboola as our ETL tool on top of Snowflake ❄️ (we manage it by ourselves), and utilize multiple reporting tools ( Redash, Tableau, Google Analytics for web traffic, Mixpanel for user events, and HighTouch for reverse-ETL, HubSpot for CRM).
Your main responsibilities will be:
- Work closely with three product engineers on maintaining and improving our data pipelines.
- Identify new data sources that could be used to enrich current datasets.
- Proactively identify gaps and errors in data pipelines, and work towards improving those areas.
- Own data architecture/modeling and data definition across company
- Guarantee data quality and integrity, and also drive critical thinking to challenge the status quo.
- Help other team members in the data team to remove road blocks regarding data extraction.
- Provide ongoing support to data team members and other company stakeholders to ensure the efficient adoption of new pipelines and datasets.
- Someone who can break down data engineering tasks into concise and simple communications with Data Team, and navigate the business context.
- An experienced data engineer with strong knowledge of NoSQL databases.
- Advanced knowledge of MongoDB (data extraction), SQL and Keboola.
- Experience with reporting tools (ideally Tableau).
- Experience with data architecture design, including data storage, ETL processes, and data pipelines.
- Attention to detail and strong organizational skills.
- Excellent communication skills (in English).
By the end of the first month, we expect:
- You get to know the data team, three product teams and the work they do, and what are the tasks that may come your way.
- You meet with other teams and stakeholders you may work with. This will provide you with an overview of the company, business, and the scope of our team’s work.
- You go through our data architecture and projects in Keboola, and you understand what data we work with and what added value we provide to our product and business teams. By the end of the first month, you understand the high-level data architecture and can answer simple data questions about the company.
By the end of the first 3 months, we expect:
- You dive deeper into one domain of the business (e.g., acquisition) and Apify’s product personas ( creators, integrators), discuss needs with the teams and work on designing and creating solutions for them. That could be extracting new data points from MongoDB, AWS S3 buckets, creating new pipelines in Keboola, updating current ones, or preparing data automatization.
- At the end of the 3 months, you have an overview of the whole data architecture and more extensive knowledge of a specific business domain.
- You pick two product teams you will work closely with.
- You attend their planning meetings and provide valuable insights that help navigate the future directions of the teams’ work.
- You understand how the company and analytics/BI operate and are ready to help Apify reach new heights.
- You drive higher adoption of data-informed decisions across the teams you collaborate with.
Why should you work for Apify?
- Space, support, and autonomy for personal growth, with a direct impact on our success
- Full-time position in Prague at our amazing office located in Lucerna Palace, Prague, Czech Republic 🏰
- Option to work remotely 🛋️
- Flexible working hours (perfect for both night owls 🦉 and early birds 🐥)
- Zero bullshit policy – No Jira, no Okta, no Confluence, no Certified™ScrumMasters®, no quarterly estimates. Just GitHub, GSuite, Notion, Slack, and shipping stuff to production
- Nobody counts holidays as long as the work gets done 💪
- Stock options and profit sharing 💰
- We welcome pets, kids, and bikes at the office 🐕👨👧🚲
- Epic team buildings and offsites 🚢 with biking, canoeing, and other adventures 🪂
- Solid education and training budget, conference tickets, internal “Eat & Learn” sessions, and the possibility to work across teams 👩🏼💻👨🏽💻
- Generous hardware budget 💻
- Free lunches every day 🌮🍱🍜🍕🥡
- Unlimited supply of ☕ & 🍺 and snacks
- Free entry to the wonderful Prague Zoo 🐘
- Ping-pong, chess, PS5, lightsabers, foosball league after lunch, free Multisport card, and so on
Hiring process
-
Intro call ☎️ → Home assignment 🧑🏼💻 → Interview ✍️ → Meeting with the team 🍕 → Offer 📄
Top Skills
What We Do
Apify is a full-stack web scraping and browser automation platform that lets you extract data from websites and automate workflows on the web.
With Apify, you can turn any website into an API!