Ryoma Sato is Assistant Professor at the National Institute of Informatics, Japan, specializing in graph neural networks, optimal transport, and efficient deep learning. He is the author of 'Theory and Algorithms of Optimal Transport' (2023) and 'Graph Neural Networks' (2024). He is a former IOI Japan representative and ACM-ICPC World Finalist, as well as lead developer of Readable, an AI-powered PDF translation service.
'This book is a practical guide to DNN and LLM acceleration, bridging the gap between theory and practice. Moving beyond 'black-box' tricks, it pairs the latest techniques-like FlashAttention-with runnable code and empirical data. Readers will gain both the technical tools and the fundamental understanding to optimize models effectively.' Masashi Sugiyama, RIKEN and University of Tokyo 'This book effectively bridges theory and practice in accelerating deep learning. It offers clear insights into modern architectures such as Mamba, while also elucidating fundamental concepts and practical techniques for efficient deep learning. It will be a valuable resource for researchers and graduate students seeking a deep understanding of modern deep learning.' Makoto Yamada, Okinawa Institute of Science and Technology