Towards High-Performance and Reliable Systems Software

Event details
Date | 21.02.2023 |
Hour | 14:00 › 15:00 |
Location | |
Category | Conferences - Seminars |
Event Language | English |
The EDIC program is happy to invite you to a public talk by Diyu Zhou, postdoctoral reseacher at EPFL, hosted by Prof. Kashayap.
The aim of the talk is to present his achievements to a broad audience to prepare for hiring interviews coming up soon. You are warmly welcome to listen to the talk and participate in the Q&A session at the end of the presentation.
Abstract
Exponential growth in users, requests, and data poses an ever-increasing demand on the performance of today's data centers. This challenge has resulted in two major trends. First, data centers scale out the computation by leveraging multicore architecture and deploying more servers. Second, ultra-fast storage devices are developed to meet the exponential growth in data. Unfortunately, traditional systems software is a poor fit for these trends, rendering applications unable to realize the potential of these developments. In this talk, I will present my work on designing modern system software to exploit these computing trends by supporting three critical application requirements: I/O efficiency, multicore scalability, and practical reliability. I will first present OdinFS, a high-performance and scalable file system for emerging non-volatile memory (NVM). By taking into account the unique characteristics of NVM, OdinFS scales to hundreds of cores and achieves tens to hundreds of times better performance than prior state of the art. I will next present RRC, an application-transparent replication system for commercial off-the-shelf containers. RRC incurs latency overhead up to 75x lower than competitive schemes, while also achieving significantly lower throughput overhead, thus enabling practical deployment for critical server applications.
Bio
Diyu Zhou is postdoctoral researcher at EPFL. He completed his Ph.D. at UCLA advised by Yuval Tamir. His research focuses on building high-performance, scalable, and reliable computer systems. Specifically, he has developed I/O stacks to support modern storage devices, devised frameworks and algorithms for synchronization primitives to scale to massive multi-core machines, found and fixed concurrency bugs, and designed practical fault tolerance mechanisms for modern systems
Practical information
- General public
- Free