Keynote 1: Benefits and Challenges of Decentralization in Data Systems: Opportunities for Data Management Research
Ruben Mayer
Abstract: Decentralization promises to overcome many of the shortcomings of centralized data systems: It puts clients in control of their own data and the processing that they perform on it, without requiring them to trust a central entity. However, decentralization faces many challenges. While the problems of efficiency and security have been in the focus of the data management community, in this keynote talk, I emphasize the need for research on other challenges of decentralized data systems, namely, governance and responsibility. Based on examples from blockchain and federated learning systems, I argue that these challenges require more attention from the data management community and open many new research opportunities.
Bio: Ruben Mayer is a professor of computer science at the University of Bayreuth, heading the Chair of Data Systems since March 2023. Simultaneously, He is also a guest professor at Technical University of Munich (TUM). Previously, he was an interim professor at TUM, leading the Chair for Data Engineering at the TUM Campus in Heilbronn. Before that, he was leading the Chair for Application and Middleware Systems at TUM as an interim professor, after working there as a post-doctoral researcher together with Prof. Hans-Arno Jacobsen. Before that, he was a postdoctoral researcher at the University of Stuttgart in the Distributed Systems research group (Prof. Kurt Rothermel), where he was also awarded his PhD.
His research is centered on distributed/decentralized information systems and data management. In particular, he is interested in improving the efficiency of machine learning systems. The focus areas of his team are federated learning and graph neural networks. Lately, he has also embarked on investigating interdisciplinary aspects of machine learning and law.
Keynote 2: Data Management and AI for Blockchain Data Analysis
Arijit Khan
Abstract: A blockchain platform is generally cohabited by human users, autonomous agents, cryptocurrencies, other digital assets, and decentralized protocols. As an example, consider the Ethereum ecosystem - currently the most actively used and the second-largest cryptocurrency network by market capitalization after Bitcoin. Ether is the native cryptocurrency of Ethereum that is transferred between accounts. Ethereum accounts are of two types: Externally owned accounts are controlled by users, whereas a contract account is controlled by a smart contract, which is an autonomous agent and can execute complex code across a decentralized network. For instance, smart contracts can define tokens that are digital assets in the blockchain platform. Decentralized applications (dApps) such as exchanges, wallets, and DeFi may combine multiple smart contracts and their protocols constitute a collection of rules that govern dApps in a decentralized blockchain platform. Complex interactions across various actors in blockchains generate massive-scale, dynamic, heterogeneous, and multi-modal data that are often publicly accessible and can be considered big data – an emerging trend since the past decade. Analysis of blockchain data using the latest data management and AI techniques is critical for the improvement of the blockchain technology, such as detecting and predicting trends, anomalies, ecrimes, and key actors. In the first part of the talk, I shall discuss our recent work on blockchain data extraction and graph construction, graph mining, topological data analysis, and machine learning methods for various target applications such as detecting market manipulators in the blockchain world including the collapse of the stablecoin LunaTerra, Ethereum’s switch from Proof-of-Work (PoW) to Proof-of-Stake (PoS), and the stablecoin USDC’s temporary peg loss. In the second part, I shall showcase the contributions of blockchain technology in the growing ecosystem of data management and AI – in the form of diverse datasets, tools, novel challenges, and algorithms. I shall conclude by emphasizing future research directions such as cross-chain data analysis, combining signals from external sources, e.g., tweets and social media data about blockchains for holistic pre- dictions, higher-order and multi-modal network analysis, designing of temporal machine learning and machine unlearning algorithms.
Bio: Arijit Khan is an IEEE senior member, an ACM distinguished speaker, and an associate professor in the Department of Computer Science, Aalborg University, Denmark. He earned his PhD from the Department of Computer Science, University of California, Santa Barbara, USA, and did a postdoc in the Systems group at ETH Zurich, Switzerland. He has been an assistant professor in the School of Computer Science and Engineering, Nanyang Technological University, Singapore. Arijit is the recipient of the prestigious IBM PhD Fellowship in 2012-13, a VLDB Distinguished Reviewer award (2022), and a SIGMOD Distinguished PC award (2024). He published more than 80 papers in premier databases and data mining conferences and journals including ACM SIGMOD, VLDB, IEEE TKDE, IEEE ICDE, SIAM SDM, USENIX ATC, EDBT, The Web Conference (WWW), ACM WSDM, ACM CIKM, ACM TKDD, and ACM SIGMOD Record. Arijit served as the co-chair of Big-O(Q) workshop co-located with VLDB 2015, LLM+KG workshop co-located with VLDB 2024, and KG for responsible AI workshop co-located with CIKM 2024, wrote a book on uncertain graphs in Morgan & Claypool’s Synthesis Lectures on Data Management. Dr Khan is serving as an associate editor of IEEE TKDE 2019-2024 and ACM TKDD 2023-present, IEEE ICDE TKDE poster track co-chair 2023, ACM CIKM short paper track program co-chair 2024, and IEEE ICDE demonstration track program co-chair 2025.