Responsibilities
- Build ML operations analytics tracking pipeline health, costs, latency, and quality metrics across all services
- Lead LLM evaluation initiatives using frameworks like Braintrust or Langfuse, designing custom metrics, quality monitoring and LLM observability in general
- Collaborate with our data annotation team on prompt engineering and evaluation dataset creation
- Aggregate and operationalize user feedback for model improvement
- Create actionable dashboards and insights for ML teams and leadership
- Help define SLAs, cost optimization and failure prevention strategies for our ML pipelines
Qualifications
- 3+ years in data science, ML operations, or analytics engineering
- Experience with LLM evaluation frameworks and quality metrics design
- Strong Python, SQL, and data visualization skills
- Knowledge of prompt optimization techniques
- Excellent communication skills in English to translate technical metrics into business insights
- Collaborative mindset with ability to work across machine learning teams
- Ability to meet periodically in our Prague office as we value in-person collaboration
Top Skills
What We Do
Filevine is case management software built for and inspired by real attorneys. As a fully-featured suite of tools, it comes ready to manage every part of a moving case. Assign tasks, upload files or images, monitor staff productivity, and communicate with your client directly from within their case file.
Our software is built on the truth that every law firm functions differently. That’s why Filevine is so customizable. Build new case-type templates, design automatic workflows, and receive customized reports on a schedule that fits your needs.
Accessing your information is never a problem, because Filevine is hosted on The Cloud. To ensure security, your law firm’s data is protected through state-of-the-art encryption on redundant servers. All you need to get started is an internet connection and your favorite web browser.
Learn more at filevine.com.
Gallery







