Skip to main contentAWS Startups

    AIBrix & vLLM - A new open source toolkit for scaling GenAI inference at scale

    AWS GenAI Loft | San Francisco

    Day:

    -

    Time:

    -

    Type:

    IN PERSON

    Language:

    English

    Level(s):

    300 - Advanced

    We're excited to invite you to the SF vLLM meetup, hosted by the AWS SF GenAI Loft on June 18th, 2025, at 525 Market St, San Francisco, CA 94105.

    ​Look forward to hearing from speakers from Bytedance and AWS Neuron and EKS teams. This is your chance to learn and connect with a growing community of vLLM users, developers, maintainers, and engineers.

    ​Together, we'll dive deep into technical talks, share insights, and discuss our journey in optimizing LLM inference for performance and efficiency.

    ​Agenda:

    • ​5:00pm: Doors Open
    • ​5:30pm: Intro to AIBrix
    • ​6:00pm: Intro to Trainium and Inferentia & Project Update
    • ​6:30pm: Production Deployment of vLLM, AIBrix, Inferentia on EKS
    • ​7:00pm: Q&A and Open Discussion
    • ​7:30pm: Pizza and Networking 🍕 🤝