DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports
Events Video Library
Refcards
Trend Reports

Events

View Events Video Library
  1. DZone
  2. Events
  3. Video Library
  4. Running LLM inference on Kubernetes: A Reliable Path From Cluster To First Prompt (EKS)

Running LLM inference on Kubernetes: A Reliable Path From Cluster To First Prompt (EKS)

Details

Platform and operations teams are increasingly asked to host large language model inference alongside existing applications. Kubernetes on AWS (EKS) can be a strong fit. It provides shared capacity, familiar deploy and rollout patterns, and integration with GPU instance types. Reliability, cost, and "who owns what" are typical areas of friction that can stall projects.

This 30–45 minute session is aimed at platform engineers, SREs, and engineering leaders who already know Kubernetes basics. We'll stay semi-agnostic on stacks and favor open-source patterns you can adapt, while grounding the examples in EKS and realistic CPU and GPU considerations.

We will cover a practical reliability lens: what an inference workload needs from the cluster (scheduling, resources, networking, and scaling signals), where failure modes show up, and how to keep the path to production understandable for both platform and application teams. The session includes a live demo whose end state is straightforward but concrete: a running inference endpoint on the cluster and a successful call that returns a model response so you see the full loop from deploy to usable LLM, not just a pod Running.

You will leave with a concise checklist for the next inference initiative on EKS and clearer language for scoping platform work versus model and application ownership.

Presenters:

Presenter Avatar

Andy Suderman

CTO, Fairwinds

Presenter Avatar

Stevie Caldwell

Senior Tech Lead, Fairwinds

Join Now for More Content & Events

For event and sponsorship inquiries, please email: [email protected]

  • RSS
  • X
  • Facebook

ABOUT US

  • About DZone
  • Support and feedback
  • Community research

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Core Program
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 3343 Perimeter Hill Drive
  • Suite 215
  • Nashville, TN 37211
  • [email protected]

Let's be friends:

  • RSS
  • X
  • Facebook