PROPOSAL

Profiling Infrastructure for DAPHNE


Supervisors: Pınar Tözün
Semester: Fall 2022
Tags: integrated data analysis pipelines, profiling big data systems

DAPHNE is an EU project that aims at building a data system targeting integrated data analysis pipelines across data management and processing, high-performance computing (HPC), and machine learning (ML) training and scoring. The project had its first code release back in March. This project aims at adding a profiling infrastructure for DAPHNE codebase. If you are interested in learning about different profiling tools on CPUs (VTune, perf, etc.) or GPUs (nvidia-smi, dcgm, etc.) and analyzing the bottlenecks and characteristics of a complex codebase using such profiling tools, please contact us.

The project is in many ways open and can be done as a regular semester project, a BSc thesis, or an MSc thesis. Based on the interests and time of the student, we can adjust the tools and hardware devices to focus on.