10 September 2025
The Cyprus Institute
Europe/Athens timezone

Hands-On: Model Deployment through vLLM, Communication and Creation of RAG Pipelines

10 Sept 2025, 12:30
2h
Andreas Mouskos Seminar Room (The Cyprus Institute)

Andreas Mouskos Seminar Room

The Cyprus Institute

Speaker

Mr Marios Constantinou (CaSToRC CyI)

Description

In this hands-on session, participants will deploy large language models on Cyclone, the National High Performance Computing (HPC) infrastructure, using tools like vLLM for efficient inference and Haystack for building retrieval-augmented generation (RAG) pipelines. The session will guide attendees through the end-to-end process of setting up model environments, running local inference, and integrating retrieval components to create responsive, data-aware applications. By working directly on HPC resources, participants will gain practical experience in managing compute workloads, handling model-serving pipelines, and building systems that combine LLM outputs with relevant external knowledge.

Presentation materials

There are no materials yet.