Skip to product information
Data Discovery in Data Lakes

Data Discovery in Data Lakes

Sale price  $116.99 Regular price  $129.99

Reliable shipping

Flexible returns

Data Discovery in Data Lakes

Abedjan, Ziawasch; Esmailoghli, Mahdi; Galhotra, Sainyam

As data lakes have become a prominent foundation for enterprise and scientific data management, organizations increasingly face the challenge of locating relevant datasets and building ad-hoc integration pipelines across heterogeneous, poorly documented, and rapidly evolving data collections. In this setting, data discovery becomes a critical capability for turning raw, distributed data assets into usable knowledge.

This book examines data discovery and its evolution across industry and academia. It covers the principles, systems, and techniques that enable users to find, understand, and use relevant data across increasingly complex data ecosystems. The book discusses modern approaches to efficient and effective data discovery, including novel system architectures, search and matching methods, metadata use, dataset profiling, and human-in-the-loop techniques.

Beyond core technical concepts, the book offers insight into how data discovery systems are evaluated and benchmarked. It highlights practical challenges faced in real-world deployments, compares emerging academic and industrial approaches, and identifies open research questions that continue to shape the field. The book is intended for researchers, practitioners, and students interested in data management, data integration, data lakes, and the future of intelligent data access.

Details

Published by: Springer

Publication Date: 2026-08-15

Format: Hardcover

ISBN-13: 9783032308214

DOI:

Dimensions: 235cm x155cm

Pages:

You may also like