Data Scientist - NLP

Posted 2 Days Ago
Washington, DC, USA
In-Office
Mid level
Information Technology • Database • Consulting
The Role
Support federal client engagements by preprocessing and engineering NLP features, building and validating classification and deep-learning models (transformers), and presenting model results and visualizations to inform public-sector decisions. Work includes production-ready NLP development, model validation, and collaboration on user stories.
Summary Generated by Built In

Analytica is seeking a Data Scientist to support long term federal client engagements projects in the DC Metro area.  The role will apply statistical programming, modeling, visualization techniques, data mining, and forecasting skills to analyze challenging public sector problems. 
This position is fully remote.
Analytica has been recognized by Inc. for 3 consecutive years as one of the 250 fastest growing business.  We offer competitive compensation with opportunities for bonuses, employer paid health care, training and development funds, and 401k match.  
Responsibilities include:

  • Pre-processing - Demonstrate the skills and experience to collect, clean, and prepare data sets for input into a computational model using Python. Strong candidates will explain various methods you have applied using common pre-processing functions such as stop word removal, stemming, lemmatization, and tokenization.
  • Feature Engineering and Attribute Evaluation - Candidate must demonstrate experience with NLP feature engineering methods such as TF-IDF, word2vec, GloVe, and FastText identifying the key determinants for modeling that exist in the business process and within existing data sets as well as selecting evaluation protocols (model techniques).
  • Modeling - Candidates will have practiced skills and experience selecting classification modeling techniques to fit the business problem. Examples will include techniques such as machine learning (ML) supervised and unsupervised learning, regression, neural networks and deep learning, natural language processing, etc.
  • Validation - Strong candidates will describe their experience with investigating, reporting, and justifying model results.
  • Visualization- Experience in presenting the results of their modeling activities, depicting the insights realized, and explaining the relevance of their results to the organization’s business challenges.
Qualifications:
  • Master's degree required, and PhD preferred in Statistics, Mathematics, Computer Science, or similar
  • High degree of experience utilizing SAS, R, or Python to support NLP use cases such as Document Summarization, Named Entity Recognition, Sentiment Analysis, and/or Topic Modeling
  • At least four years of experience developing scalable, production-ready NLP solutions using sci-kit learn, Keras, TensorFlow, PyTorch, Spark NLP.
  • Experience using git/github to version control source code
  • Experience leveraging transformer architecture to develop NLP models
  • Experience with open source NLP packages such as Gensim, SpaCy, or NLTK.
  • Experience with BERT, GPT-J, RoBERTa, T5 or other transformers
  • Experience with GenAI and Prompt Engineering is a plus
  • Experience in Databricks and MLFlow is a plus
  • Experience with machine translation and transcription of foreign language documents using Microsoft Azure translation services is a plus
  • Experience working in an AWS cloud environment and with related AWS services such as Bedrock and Textract
  • Experience coordinating and maintaining user stories
  • Must be a US citizen
  • Must be able to obtain and maintain a Public trust security clearance

About ANALYTICA: Analytica is a leading consulting and information technology solutions provider to public sector organizations supporting health, civilian, and national security missions. Founded in 2009 and headquartered in Bethesda, MD, the company is an established SBA small business that has been recognized by Inc. Magazine each of the past three years as one of the 250 fastest-growing companies in the U.S.  Analytica specializes in providing software and systems engineering, information management, analytics & visualization, agile project management, and management consulting services. The company is appraised by the Software Engineering Institute (SEI) at CMMI® Maturity Level 3 and is an ISO 9001:2008 certified provider. 

Skills Required

  • Master's degree in Statistics, Mathematics, Computer Science, or similar
  • PhD in relevant field
  • Experience using SAS, R, or Python for NLP use cases (Document Summarization, NER, Sentiment, Topic Modeling)
  • At least four years developing scalable, production-ready NLP solutions using scikit-learn, Keras, TensorFlow, PyTorch, Spark NLP
  • Experience using Git/GitHub for version control
  • Experience leveraging transformer architecture (e.g., BERT, GPT-J, RoBERTa, T5)
  • Experience with open-source NLP packages such as Gensim, SpaCy, or NLTK
  • Experience coordinating and maintaining user stories
  • Must be a US citizen
  • Must be able to obtain and maintain a Public Trust security clearance
  • Experience with GenAI and Prompt Engineering
  • Experience with Databricks and MLflow
  • Experience with machine translation and transcription using Microsoft Azure translation services
  • Experience working in an AWS cloud environment and with services such as Bedrock and Textract
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Washington, DC
110 Employees
Year Founded: 2009

What We Do

Analytica is an award-winning consulting and technology services provider that supports public-sector health, civilian, and national security. We specialize in data-driven solutions, which have been recognized by organizations such as NYU’s Governance Lab for driving public sector modernization and innovation. Analytica is an SBA Certified 8(a), HUBZone that has been honored as one of the 250 fastest-growing businesses in the U.S. for three consecutive years by Inc. ​For information on the company visit: www.analytica.net For exciting career opportunities visit: careers.analytica.net

Similar Jobs

Datadog Logo Datadog

Principal Partner Manager - Technology Alliances

Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
5 Locations
6500 Employees
195K-286K Annually

Collectors Logo Collectors

Senior Software Engineer

Consumer Web • eCommerce • Machine Learning • Software • Sports • Analytics
Remote or Hybrid
US
2246 Employees
141K-229K Annually

HiBob Logo HiBob

Business Development Representative

HR Tech • Information Technology • Professional Services • Sales • Software
Remote or Hybrid
United States
1350 Employees
64K-64K Annually

HiBob Logo HiBob

Customer Experience Manager

HR Tech • Information Technology • Professional Services • Sales • Software
Remote or Hybrid
United States
1350 Employees
140K-170K Annually

Similar Companies Hiring

Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account