Query optimization lies at the core of database systems and learning-based query optimization attracts more and more attention because AI techniques are expected to bring new opportunities to further improve the task. As is generally believed, data preparation plays an important role in machine learning tasks, and this may also apply to learning-based query optimization. However, it is noted that …
Supervisors:
Xiao Li, Zoi Kaoudi
Semester: Spring 2026
Tags: data preparation, query optimization, machine learning, database
Query optimization is crucial for any data management system to achieve good performance. Recent advancements in AI have led academia and industry to investigate learning-based techniques in query optimization. In particular, many works propose replacing the cost model used during plan enumeration with a machine learning model (typically a regression model) that estimates the runtime of a query …
Supervisors:
Zoi Kaoudi
Semester: Fall 2025
Tags: machine learning, database, query optimization, ranking
Are you interested in working with a big data open source project?
You are welcome to conduct your thesis/project in Apache Wayang. Apache Wayang is the first cross-platform framework that allows users to specify their task/query in a system-agnostic manner and Wayang will determine which is the best system(s) to execute this task with the goal of optimizing performance. For a general overview …
Supervisors:
Zoi Kaoudi
Semester: Fall 2025
Tags: big data, database, cross-platform data processing, open source, Apache
Are you interested in working with a big data open source project and help the environment?
You are welcome to conduct your thesis/project in Apache Wayang. Apache Wayang is the first cross-platform framework that allows users to specify their task/query in a system-agnostic manner and Wayang will determine which is the best system(s) to execute this task with the goal of optimizing performance. …
Supervisors:
Zoi Kaoudi
Semester: Fall 2025
Tags: big data, database, cross-platform data processing, open source, Apache
Do you like open-source systems? Would you like to experience working with an open-source system? Do you want to learn about big data research in practice? Then, this project is for you!
We have a number of thesis/project topics under the umbrella of Apache Wayang. Wayang is the first cross-platform framework that allows users to specify their task/query in a system-agnostic manner and Wayang will …
Supervisors:
Jorge Quiané
Semester: Fall 2022
Tags: big data, database, cross-platform data processing, open source, Apache