Full Stack Data Discovery Engineer

Posted An Hour Ago
Be an Early Applicant
3 Locations
In-Office
90K-138K Annually
Mid level
Productivity • Software • App development • Automation
We build unrivaled Document Processing Technology for Developers.
The Role
The Full Stack Data Discovery Engineer will design scalable data pipelines, develop APIs and crawlers, and analyze external data. Responsibilities include conducting searches, data governance, and collaborating with stakeholders to integrate new data sources.
Summary Generated by Built In

The Role:

We are hiring a Full-Stack Data Discovery Engineer to design and ship end to end systems that uncover technology usage across public and private ecosystems.  You will build innovative backend pipelines (APIs, crawlers, document finger printers, package-registry miners, etc.) and frontend dashboards or analysis that transform raw signals into actionable insights.  You will combine engineering skill with investigative creativity to discover patterns across ecosystems and turn them into actionable intelligence.

 

Responsibilities:

  • Own the full stack: Design, build and optimize scalable data pipelines to discover OSINT and software usage across a wide public ecosystem.
  • Pipeline development: Develop APIs, microservices, crawlers, document fingerprinting to gather data securely and efficiently. Implement backoff/caching, data normalization, and persist to SQL/NoSQL indexes
  • Data Discovery: Conduct systematic searches across the web, public databases, developer ecosystems and other platforms to identify potential external data repositories relevant to organizational objectives.
  • Metadata and Attribution Analysis: Programmatically uncover and analyze metadata associated with identified data sources to understand data structure, content, quality, and potential use cases.
  • Signals & scoring: develop heuristics/ML‑lite ranking to identify relevant artifacts , deduplicate, and assign confidence scores.
  • Data Governance: Ensure data quality, security, compliance and governance.
  • Productize discovery: build internal tools that let non‑engineers run searches, review candidates, and export leads—fast and safely
  • Documentation and Reporting: Document data structures, origins (data lineage), and quality issues. Create clear, concise reports and presentations to communicate findings and recommendations to technical and non-technical stakeholders.
  • Collaboration: Work closely with data stewards, data architects, and internal business units to define data requirements and facilitate the integration of new data sources.
  • Innovation and Scale: Continuously explore new data sources, improve attribution logic and propose ML-based enhancements to finding and classifying data.

 

Requirements:

  • Education: Bachelor's degree in Computer Science, Engineering, Library Science, Information Systems, Data Management, or a related field (Master's degree preferred).
  • Experience: 1-5 years of proven experience as a full-stack developer and data engineer. Creating the initial inception and idea of the project.
  • Technical Skills:
    • Back-end: Python, SQL, Java and Node.js
    • Front-end: Modern JS/TS + React, component libraries, auth patterns, state mgmt.
    • Data & search: schema design, dedup/near‑dup logic, Elasticsearch/OpenSearch; building usable search/triage UIs.
    • Acquisition: Scrapy/Playwright/Puppeteer; API design with rate‑limit/backoff; ethical crawling.
    • Experience with cloud-native architecture and containerization. Familiarity with metadata standards (e.g., Dublin Core, XML) and data management tools.

Assets:

  • Knowledge of data visualization tools (e.g. Power BI, Tableau) to present findings.
  • Experience building internal platforms/tools used by end users or GTM teams.

Soft Skills:

  • Exceptional attention to detail and strong analytical thinking skills.
  • Excellent written and verbal communication skills, with the ability to translate technical findings into business insights.
  • Strong problem-solving aptitude and the ability to work independently and collaboratively in a fast-paced environment.

 

Benefits:

  • Competitive salary commensurate with experience & qualifications.
  • A comprehensive extended benefits package including health, dental and vision for you and your family.
  • A great team environment and resources, supporting you to do the best work of your life and providing unlimited career growth potential.
  • Annual recurring WFH allowance for you to purchase items you need for your home office.
  • On going support for learning development so you can master your craft.
  • Work with the hardware you're most comfortable with (Windows or Mac).
  • Diverse and inclusive workplace where we all learn from each other.
  • Excellent work-life balance with a flexible remote work environment.

 

 

Benefits:

  • Competitive salary commensurate with experience and qualifications. 
  • A comprehensive extended benefits package including health, dental and vision for you and your family, with company paid offerings.
  • 401K savings program with company match.
  • Generous paid time off (PTO) is offered to support the ability to rest and recharge.
  • A great team environment and resources, supporting you to do the best work of your life and providing unlimited career growth potential.
  • Highly autonomous and entrepreneurial environment.
  • Annual recurring WFH allowance for you to purchase items you need for your home office.
  • Ongoing support for learning development so you can master your craft.
  • Work with the hardware you're most comfortable with (Windows or Mac).
  • Diverse and inclusive workplace where we all learn from each other.


Company Description

As the industry-leading provider of document software development (SDK) technology powering everything from traditional desktop software to innovative web and mobile applications, at Apryse we are committed to delivering cutting-edge technology solutions that empower our clients to achieve their goals. With a broad international portfolio of combined companies, products, and leading technologies, we are actively changing the way the world works with documents to make work better and life simpler.

Customers like IBM, Autodesk, DocuSign, Boeing, Microsoft (and many more!) come to us to realize their web and mobile strategies for document management, editing, and collaboration as the #1-ranked commercial document SDK of choice for companies worldwide. As a result, you can find our document technology in thousands of solutions, including those of household names, used by millions across virtually every industry. Our XODO app alone has 25M unique installs -- and counting -- and the highest ratings among PDF productivity apps on the largest online app marketplaces.

Internally, we foster an atmosphere of opportunity, growth, and success for every individual amidst an exciting and challenging entrepreneurial culture. Career progression is based on merit, not tenure. Every member of our vibrant team is empowered to be a contributor, innovator, and successful leader.

Ready to join our team?

If you are interested in helping Apryse deliver on its commitments and taking your career to the next level, we invite you to apply online now. Additionally, we view the above section as a guide, not a checklist. We welcome diverse and non-traditional backgrounds and encourage you to apply even if you do not have every requirement listed.

The compensation for this position is commensurate upon experience, with a range between $90,000.00-$138,000.00 USD in on target earnings.

We are committed to a work environment that is inclusive to all and free of discrimination. It is our policy to be an equal opportunity employer without regard to race, color, religion, sex, age, national origin, disability, sexual orientation, gender identity or expression, genetic predisposition or carrier status, veteran status, citizenship status or any other factors prohibited by law. Apryse will provide reasonable accommodations for qualified individuals.

Top Skills

Elasticsearch
Java
Modern Js/Ts
Node.js
Opensearch
Playwright
Puppeteer
Python
React
Scrapy
SQL

What the Team is Saying

Andrew
Elma
Jess
Marko
Blake
Evan
Cassidy
Josh
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Denver, CO
665 Employees
Year Founded: 1998

What We Do

Apryse, previously known as PDFTron, takes document solutions to the next level, making work better and life simpler.

As a global leader in document processing technology, Apryse gives developers, enterprise customers, and small businesses the tools they need to reach their document goals faster and easier.

Apryse’s market-leading SDK drives digital transformation and powers next-generation software applications with dynamic document viewing, annotation, processing, and conversion capabilities, as well as advanced features such as document understanding, data extraction, and redaction.

Apryse technology supports all major platforms and dozens of unique file types, including support for PDF, MS Office, and CAD formats. It’s an easier and faster way to build document functionality, making your developers more productive and your users happier.

Our product portfolio includes the Apryse developer suite with server, mobile, and web SDKs, iText’s PDF SDK, and low-code integrations. Xodo and eversign cover Small-Mid-sized businesses.

Why Work With Us

Here at Apryse, we live by four core values — Win Together, Always Learning, Quality First, and Strength in our Differences. Brought together with a common goal, every team member is a crucial piece of the puzzle, and our collective success is a direct outcome of the dedication and passion of our people.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Apryse Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Like our customers, we believe that talent transcends borders. This is why we are a remote first company with offices around the globe. Employees near an office can come in as often or as little as they like.

Typical time on-site: Flexible
HQDenver, Colorado HQ
Singapore Office
Boston, Massachusetts Office
Vancouver, Canada Office
Ghent, Belgium Office
Learn more

Similar Jobs

Apryse Logo Apryse

Senior Business Analyst

Productivity • Software • App development • Automation
In-Office or Remote
6 Locations
665 Employees
100K-115K Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account