Sr. Staff AI Engineer - On-Prem AI Infrastructure & Agentic Systems

Posted 6 Days Ago
Be an Early Applicant
San Jose, CA, USA
In-Office
140K-165K Annually
Senior level
Information Technology • Semiconductor • Industrial
The Role
Design, deploy, and maintain on-prem AI infrastructure and agentic systems. Build GPU clusters, model serving, vector DB-backed RAG pipelines, fine-tune and deploy models, implement governance (MCP), and automate CI/CD and monitoring to integrate AI capabilities into enterprise workflows.
Summary Generated by Built In

About the Company:

At SK Hynix Memory Solution, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data centers. As a global leader in DRAM and NAND flash technologies, we drive the evolution of advancing mobile technology, empowering cloud computing, and pioneering future technologies. Our cutting-edge memory technologies are essential in today's most advanced electronic devices and IT infrastructure, enabling enhanced performance and user experiences across the digital landscape.

We're looking for innovative minds to join our mission of shaping the future of technology. At SK Hynix Memory, you'll be part of a team that's pioneering breakthrough memory solutions while maintaining a strong commitment to sustainability. We're not just adapting to technological change – we're driving it, with significant investments in artificial intelligence, machine learning, and eco-friendly solutions and operational practices. As we continue to expand our market presence and push the boundaries of what's possible in semiconductor technology, we invite you to be part of our journey to creating the next generation of memory solutions that will define the future of computing.


Why Join Us?

  • Build foundational AI infrastructure that powers next-gen enterprise systems.
  • Work on cutting-edge agentic AI — not just chatbots, but autonomous systems that reason, plan, and act.
  • Opportunity to influence AI strategy, deployment, and governance in a high-impact environment.

About the Role:

We are seeking a hands-on AI Engineer to design, deploy, and maintain on-prem AI infrastructure and build agentic AI systems that drive real-world automation. You’ll be responsible for setting up scalable AI environments, implementing RAG pipelines, fine-tuning embedded models, and architecting AI agents that operate autonomously in enterprise settings. This role sits at the intersection of AI systems engineering and applied ML — you’ll bridge infrastructure, model deployment, and agent logic.


Responsibilities:

  • Design and deploy on-prem AI infrastructure — including GPU clusters, model serving (e.g., vLLM, TGI, Triton), vector DBs (e.g., Milvus, Qdrant, FAISS), and orchestration (Kubernetes, Helm, Docker).
  • Build and optimize RAG pipelines — including document chunking, retrieval strategies (hybrid, re-ranking), and evaluation of retrieval accuracy and latency.
  • Develop agentic AI systems — design stateful agents with memory, tool use, and planning capabilities (e.g., using LangGraph, AutoGen, or custom frameworks).
  • Fine-tune and deploy embedded models — work with LoRA, QLoRA, or full fine-tuning for domain-specific tasks; optimize for edge/on-device inference.
  • Implement Model Control Protocols (MCP) — ensure model governance, versioning, access control, and monitoring for production AI systems.
  • Collaborate with product and engineering teams to integrate AI capabilities into enterprise workflows — especially in storage, QA, or systems engineering contexts.
  • Automate and monitor AI pipelines — build CI/CD for model deployment, logging, and performance tracking.

Minimum Qualifications:

  • 2+ years of experience in AI/ML engineering, with hands-on deployment of AI systems on-prem or private cloud.
  • Proven experience building agentic AI systems — including state management, tool integration, and multi-step reasoning.
  • Strong working knowledge of RAG architectures — chunking, retrieval, re-ranking, evaluation metrics.
  • Experience with model fine-tuning (LoRA, QLoRA, full fine-tuning) and embedding models for retrieval.
  • Familiarity with Model Control Protocols (MCP) or similar governance frameworks (model versioning, access control, audit trails).
  • Proficiency in Python, Linux, Docker/Kubernetes, and vector databases (e.g., Milvus, Qdrant, Pinecone).
  • Experience with AI serving frameworks (vLLM, TGI, Triton, Ollama, etc.).

Preferred Qualifications:

  • Experience deploying AI in enterprise storage or hardware-adjacent environments.
  • Background in systems engineering or QA automation — bonus if you’ve used AI to automate testing or validation.
  • Familiarity with embedded AI or edge inference (ONNX, TensorRT, GGUF, etc.).
  • Experience with AI agent frameworks (LangGraph, AutoGen, BabyAGI, etc.).
  • Knowledge of AI observability tools (LangSmith, Weights & Biases, Prometheus/Grafana for AI).
  • As a Storage company, knowledge of storage area/NVMe is a PLUS.

Education Requirement:

  • Bachelor of Science in CS, EE, ME, or other applicable Engineering field.

COMPENSATION$140,000/yr - $165,000/yr


REGARDING COMPENSATION:

SK hynix memory solutions America Inc. offers you the opportunity to apply your skills to exciting projects while working with innovative teams. Our compensation package is complimented by a generous benefits package including medical, dental, vision, life insurance and a company 401(k) match, as well as cafeteria, onsite gym and much more. If you are motivated by technical challenges, we offer a collaborative work environment that encourages career growth.

The salary offered to a selected candidate will be tailored based on several factors, including the location, job grade, relevant knowledge, skills, and experience. We also take into account the internal equity among our current team members to ensure fairness and competitiveness

Skills Required

  • 2+ years of experience in AI/ML engineering with hands-on deployment of AI systems on-prem or private cloud.
  • Proven experience building agentic AI systems including state management, tool integration, and multi-step reasoning.
  • Strong working knowledge of RAG architectures including chunking, retrieval, re-ranking, and evaluation metrics.
  • Experience with model fine-tuning (LoRA, QLoRA, full fine-tuning) and embedding models for retrieval.
  • Familiarity with Model Control Protocols (MCP) or similar governance frameworks (model versioning, access control, audit trails).
  • Proficiency in Python, Linux, Docker/Kubernetes, and vector databases (Milvus, Qdrant, Pinecone).
  • Experience with AI serving frameworks (vLLM, TGI, Triton, Ollama, etc.).
  • Bachelor of Science in Computer Science, Electrical Engineering, Mechanical Engineering, or related engineering field.
  • Experience deploying AI in enterprise storage or hardware-adjacent environments.
  • Background in systems engineering or QA automation; experience using AI to automate testing or validation.
  • Familiarity with embedded AI or edge inference (ONNX, TensorRT, GGUF).
  • Experience with AI agent frameworks (LangGraph, AutoGen, BabyAGI, etc.).
  • Knowledge of AI observability tools (LangSmith, Weights & Biases, Prometheus/Grafana).
  • Knowledge of storage area/NVMe.

SK hynix Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about SK hynix and has not been reviewed or approved by SK hynix.

  • Strong & Reliable Incentives Profit-sharing linked to operating results and the removal of a bonus cap in Korea have produced very large payouts and materially boosted total compensation there. Reporting indicates these incentives have been especially favorable during the AI/HBM upcycle.
  • Healthcare Strength Company materials highlight medical expense support and robust health coverage, and U.S. packages commonly include comprehensive medical, dental, and vision plans. This strengthens the appeal of core benefits alongside cash compensation.
  • Parental & Family Support Programs in Korea include fertility support, extended childcare leave, and education grants for children through university. These family-forward policies expand benefits value beyond salary and bonuses.

SK hynix Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Icheongeo-ri
328 Employees
Year Founded: 1983

What We Do

Semiconductors are essential to all IT products, and its performance often determines the performance of the final products. SK hynix is a global leader in producing semiconductor, such as DRAM, NAND Flash and CMOS Image Sensors. With these technology driven semiconductor products, SK hynix has consistently led the industry and is now the second largest memory chip maker worldwide. IT devices become more pervasive as new imaginative and innovative IT products continue to grab imagination and desires of consumers. SK hynix has enhanced its competency with the best level of technology and a wide range of business portfolios in order to satisfy all those demand from customers. As a member of SK Group*, SK hynix is aiming at becoming the world’s best semiconductor company. SK hynix America Inc. operates as a subsidiary of SK Hynix Inc. *SK Group is one of South Korea's top five industrial conglomerates. It has about 40 affiliated companies, ranging from energy, telecommunications, finance, to construction.

Similar Jobs

Boeing Logo Boeing

Propulsion Engineer (Propulsion Analysis - Air)

Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
In-Office
Seal Beach, CA, USA
170000 Employees
127K-228K Annually
Hybrid
Oceanside, CA, USA
205000 Employees
38K-67K Hourly
Hybrid
Vista, CA, USA
205000 Employees
38K-67K Hourly
Hybrid
Merced, CA, USA
205000 Employees
35K-63K Hourly

Similar Companies Hiring

Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Amalgamated Sugar Thumbnail
Food • Greentech • Agriculture • Industrial • Manufacturing
Boise, Idaho
768 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account