Software Engineer III -Gen AI Inferencing

Posted 5 Days Ago
Be an Early Applicant
4 Locations
In-Office
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services • Data Privacy
The Role
Develop and deliver AI and Gen AI solutions, mentor other engineers, automate processes, and ensure compliance in an agile environment.
Summary Generated by Built In

Job Description:

At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day.
Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being an inclusive workplace, attracting and developing exceptional talent, supporting our teammates’ physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve.
Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations.
At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us!
 

Position Summary

Join a groundbreaking team at Bank of America, at the forefront of innovation in AI.  We are building the next generation of Gen AI platform, empowering new AI initiatives across Consumer, Small Business, Global Banking, and Wealth organizations. This is a unique opportunity to contribute to a critical platform that will enable secure, scalable, and high-performance AI capabilities across the organization. We value curiosity, collaboration, and a passion for pushing the boundaries of what’s possible with AI.

This position is focused on design, build, and operate of reusable toolkits for Gen AI RAG capabilities.

This job is responsible for developing and delivering complex requirements to accomplish business goals. Key responsibilities of the job include ensuring that software is developed to meet functional, non-functional and compliance requirements, and solutions are well designed with maintainability/ease of integration and testing built-in from the outset. Job expectations include a strong knowledge of development and testing practices common to the industry and design and architectural patterns.

Responsibilities:

  • Codes solutions and unit test to deliver a requirement/story per the defined acceptance criteria and compliance requirements
  • Designs, develops, and modifies architecture components, application interfaces, and solution enablers while ensuring principal architecture integrity is maintained
  • Mentors other software engineers and coach team on Continuous Integration and Continuous Development (CI-CD) practices and automating tool stack
  • Executes story refinement, definition of requirements, and estimating work necessary to realize a story through the delivery lifecycle
  • Performs spike/proof of concept as necessary to mitigate risk or implement new ideas
  • Automates manual release activities
  • Designs, develops, and maintains automated test suites (integration, regression, performance)
  • Utilizes multiple architectural components (across data, application, business) in design and development of client requirements
  • Manage multiple priorities, and simultaneously engage with multiple teams.
  • Participates in estimating work necessary to realize a story/requirement through the delivery lifecycle.
  • Be vocal and actively participate in all session with business stakeholders and agile teams.
  • Collaborate with product teams, data analysts and data scientists to design and build solutions.

Required qualifications:

  • 5+ years OOP in Python/Scala/Java programming experience with expert level development skills
  • Experience with AI/ML/GenAI Lifecycle Management and Development and its Ecosystem. Hands on experience building frameworks using MLOps, Fine – Tuning techniques, Inference Frameworks
  • Experience with deploying models using vLLM/Triton Inference Server in containers in production with automation. Performs Continuous Integration and Continuous Development (CI-CD) activities. Performance Tuning those models and deployment to provide higher throughput.
  • Track record of maintaining large scale Python/Unix based systems.
  • Hands on experience and knowledge generative AI RAG process for various use cases, including chunking, embedding, retrieval, reranking and summarization.
  • Hands-on experience in application development in one or more areas MongoDB, Redis, Angular/React Frameworks, Containerization, Building API based application leveraging FAST API services, JWT Integration, API Gateway
  • Develop efficient utilities, automation frameworks, data science platforms that can be utilized across multiple Data Science teams for AI/ML and GenAI work.
  • Working in large sized teams that collaboratively develop on a shared multi-repo codebase using IDEs (e.g. VS Code rather than Jupyter Notebooks), Continuous Integration (CI), Continuous Deployment (CD) and Continuous Testing
  • Strong automation, scripting, and Python development skills. Hands-on DevOps experience with one or more of the following enterprise development tools: Version Control (GIT/Bitbucket), Build Orchestration (Jenkins), Code Quality (SonarQube and pytest Unit Testing), Artifact Management (Artifactory) and Deployment (Ansible)

Desired Qualifications

  • Experience building & deploying Gen AI inferencing platform with open-source toolsets, building inferencing & servicing capabilities (AI Gateway, Policy store, Observability) for RAG/ MCP use cases etc.
  • Hands on experience on driving and maintaining a culture of quality, innovation, and experimentation.
  • Research on new tools and capabilities for better UI and UX for advanced analytics platform, quick prototype and demonstrate the features and capabilities, and participate on various user forums.

    Skills:

    • Application Development
    • Automation
    • Influence
    • Solution Design
    • Technical Strategy Development
    • Architecture
    • Business Acumen
    • DevOps Practices
    • Result Orientation
    • Solution Delivery Process
    • Analytical Thinking
    • Collaboration
    • Data Management
    • Risk Management
    • Test Engineering

    Shift:

    1st shift (United States of America)

    Hours Per Week: 

    40

    Top Skills

    Angular
    Ansible
    Artifactory
    Bitbucket
    Fast Api
    Git
    Java
    Jenkins
    Jwt
    Mlops
    MongoDB
    Pytest
    Python
    React
    Redis
    Scala
    Sonarqube
    Triton Inference Server
    Vllm
    Am I A Good Fit?
    beta
    Get Personalized Job Insights.
    Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

    The Company
    HQ: Charlotte, NC
    208,000 Employees
    Year Founded: 1784

    What We Do

    We make financial lives better for our clients and our communities through the power of every connection. Our employees are at the heart of this purpose, and are key to driving responsible growth.

    Every day, across the globe, our employees bring a commitment to our purpose and to driving responsible growth by living our values: deliver together, act responsibly, realize the power of our people and trust the team. A key aspect of driving responsible growth is doing so in a sustainable manner, a critical pillar of which is being a great place to work for our teammates.

    Gallery

    Gallery

    Similar Jobs

    Bank of America Logo Bank of America

    Software Engineer

    Big Data • Fintech • Mobile • Payments • Financial Services • Data Privacy
    In-Office
    4 Locations
    208000 Employees

    CoreWeave Logo CoreWeave

    Sr. Analyst, Tax US & Canada

    Cloud • Information Technology • Machine Learning
    In-Office
    4 Locations
    1450 Employees
    108K-143K Annually

    CoreWeave Logo CoreWeave

    Senior Engineer

    Cloud • Information Technology • Machine Learning
    In-Office
    4 Locations
    1450 Employees
    139K-204K Annually

    BlackRock Logo BlackRock

    Business Analyst

    Big Data • Cloud • Fintech • Financial Services • Conversational AI
    In-Office
    Princeton, NJ, USA
    21000 Employees
    95K-128K Annually

    Similar Companies Hiring

    Camber Thumbnail
    Social Impact • Healthtech • Fintech
    New York, NY
    53 Employees
    Rain Thumbnail
    Web3 • Payments • Infrastructure as a Service (IaaS) • Fintech • Financial Services • Cryptocurrency • Blockchain
    New York, NY
    40 Employees
    Scotch Thumbnail
    Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
    US
    25 Employees

    Sign up now Access later

    Create Free Account

    Please log in or sign up to report this job.

    Create Free Account