Transcription Software Engineer

Reposted Yesterday
Hiring Remotely in Austin, TX
In-Office or Remote
140K-170K Annually
Senior level
Information Technology
The Role
Responsible for improving transcription quality using ASR models, building integration pipelines, conducting data analysis, and deploying services to production.
Summary Generated by Built In
At LeoTech, we are passionate about building software that solves real-world problems in the Public Safety sector. Our software has been used to help the fight against continuing criminal enterprises, drug trafficking organizations, identifying financial fraud, disrupting sex and human trafficking rings and focusing on mental health matters to name a few.

Role

  • This is a remote, WFH role.
  • We are seeking a highly skilled Transcription Engineer to join our Platform Team. This role is core to our mission of extracting intelligence from audio in some of the most challenging environments. You will be responsible for advancing transcription quality across our workflows—experimenting with ASR models, integrating third-party services, and building tooling that ensures accuracy, reliability, and scalability. The ideal candidate has a strong software engineering background with expertise in Python, audio processing, and applied machine learning techniques for speech. A mix of hands-on engineering, data science, and DevOps skills is essential, as the role involves both experimentation with ASR models and deploying services into production at scale. This is a challenging and rewarding role for someone who is passionate about audio, language, and building high-quality systems that power real-world intelligence use cases.

Core Responsibilities

  • Lead efforts to improve transcription quality by evaluating, testing, and fine-tuning ASR models (both commercial APIs and open-source).
  • Build pipelines that handle speaker identification, diarization, multi-language support, and noise-robust transcription in difficult audio environments.
  • Develop and maintain services that integrate multiple ASR providers, ensuring resilience and flexibility across transcription workflows.
  • Collaborate with platform engineers to ensure seamless ingestion and persistence of transcription outputs in data pipelines.
  • Use data wrangling and exploratory analysis to deeply understand transcription accuracy and error patterns. - Explore and apply audio engineering techniques (denoising, voice isolation, codecs, signal processing) to improve speech clarity.
  • Deploy and maintain transcription-related services with basic DevOps practices, ensuring scalability and reliability.
  • Participate in all stages of the development lifecycle: ideation, design, prototyping, implementation, deployment, and iteration.

What We Value

  • Strong software engineering background in fields such as Computer Science, Software Engineering, or related disciplines.
  • 5+ years of professional development experience, with significant focus on speech processing, NLP, or transcription systems.
  • Proficiency in Python and comfort with system-level programming when needed.
  • Experience with ASR frameworks (e.g., Whisper, Kaldi, Vosk, NVIDIA NeMo, or similar).
  • Familiarity with audio engineering tools (e.g., ffmpeg, Sox) and denoising/voice enhancement techniques.
  • Knowledge of speaker diarization, speaker recognition, and multi-language ASR challenges.
  • Experience with data analysis and wrangling (e.g., Pandas, NumPy, Jupyter) to evaluate model performance.
  • Understanding of cloud deployment and DevOps basics (e.g., Docker, Kubernetes, serverless workloads).
  • Comfort working in a fast-paced environment with dynamic objectives and quick iteration cycles.
  • Demonstrated ability to work independently, make tradeoffs, and deliver results with minimal supervision.
  • Bonus Points
  • Hands-on experience fine-tuning ASR models on domain-specific datasets.
  • Familiarity with real-time streaming pipelines for audio ingestion and transcription.
  • Exposure to search and retrieval systems (e.g., Elasticsearch) for indexing transcribed text.
  • Prior experience in audio forensics or noisy-channel speech analysis.
  • Experience with applying heuristics to improve transcription results.

Technologies We Use

  • We are hosted on AWS Cloud and use numerous AWS services. 
  • Our backend languages primarily consist of Elixir, NodeJS and some Python. 
  • TypeScript and React are central to our front-end development. 
  • Terraform, CloudFormation, Ansible are leveraged for our Infrastructure deployment and automation. 
  • Industry-standard build tooling and CI/CD using AWS CodePipeline and GitHub Actions. 
  • A low-code test automation framework for end-to-end testing.  
  • A mix of open-source and proprietary technologies that are tailored to the problems at hand.

What You Can Expect

  • Work from home opportunity
  • Enjoy great team camaraderie.
  • Thrive on the fast pace and challenging problems to solve.  
  • Modern technologies and tools.
  • Continuous learning environment.
  • Opportunity to communicate and work with people of all technical levels in a team environment.
  • Grow as you are given feedback and incorporate it into your work.
  • Be part of a self-managing team that enjoys support and direction when required.  
  • 3 weeks of paid vacation – out the gate!!  
  • Competitive Salary.
  • Generous medical, dental, and vision plans.
  • Sick, and paid holidays are offered.

LeoTech is an equal opportunity employer and does not discriminate on the basis of any legally protected status.

Top Skills

Ansible
Asr Frameworks
Aws Cloud
CloudFormation
Docker
Elixir
Kubernetes
Node.js
Numpy
Pandas
Python
React
Terraform
Typescript
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Los Angeles, California
90 Employees

What We Do

LeoTech is leading the effort to assist public safety efforts around the nation. We bring industry expertise and innovative technology to address real-world problems public safety agencies face every day.

Our search and analytics platform, Verus, has been used to assist in homeland security, provide assistance for mental health, and increase safety and awareness of public safety.

Everything we do and build is aligned with our mission and the goals of our public safety partners. We work hand-in-hand to build technology solutions that drive the future of public safety

Similar Jobs

Dandy Logo Dandy

Staff Software Engineer

Computer Vision • Healthtech • Information Technology • Logistics • Machine Learning • Software • Manufacturing
Remote
USA
1800 Employees
221K-268K Annually

Headway Logo Headway

Senior Manager, Strategy & Operations (Claims Pod)

Consumer Web • Healthtech • Professional Services • Social Impact • Software
Easy Apply
Remote
USA
819 Employees
146K-215K Annually

Dropbox Logo Dropbox

Director, Virtual First

Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Remote
United States
2500 Employees
205K-277K Annually

HopSkipDrive Logo HopSkipDrive

Director of Litigation and Regulatory Affairs

Automotive • Edtech • Kids + Family • Mobile • Social Impact • Transportation
Easy Apply
Remote
United States
450 Employees
180K-200K Annually

Similar Companies Hiring

Axle Health Thumbnail
Logistics • Information Technology • Healthtech • Artificial Intelligence
Santa Monica, CA
17 Employees
Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account