A Practical Guide to LLM Deployment and RAG Systems

Name: A Practical Guide to LLM Deployment and RAG Systems
Start: 2025-09-10T09:30:00+03:00
End: 2025-09-10T14:30:00+03:00
Location: The Cyprus Institute

10 September 2025

The Cyprus Institute

Europe/Athens timezone

Hands-On: Model Deployment through vLLM, Communication and Creation of RAG Pipelines

10 Sept 2025, 12:30

Andreas Mouskos Seminar Room (The Cyprus Institute)

Andreas Mouskos Seminar Room

The Cyprus Institute

Mr Marios Constantinou (CaSToRC CyI)

In this hands-on session, participants will deploy large language models on Cyclone, the National High Performance Computing (HPC) infrastructure, using tools like vLLM for efficient inference and Haystack for building retrieval-augmented generation (RAG) pipelines. The session will guide attendees through the end-to-end process of setting up model environments, running local inference, and integrating retrieval components to create responsive, data-aware applications. By working directly on HPC resources, participants will gain practical experience in managing compute workloads, handling model-serving pipelines, and building systems that combine LLM outputs with relevant external knowledge.

There are no materials yet.

A Practical Guide to LLM Deployment and RAG Systems

Hands-On: Model Deployment through vLLM, Communication and Creation of RAG Pipelines

Andreas Mouskos Seminar Room

The Cyprus Institute

Speaker

Description

Presentation materials

Choose timezone

A Practical Guide to LLM Deployment and RAG Systems

Speaker

Description

Presentation materials