Practical Big Data Analytics with Hadoop and Spark

Name: Practical Big Data Analytics with Hadoop and Spark
Price: 68.54 AUD
Availability: OutOfStock
ISBN: 9789365894745

Shikha Mehta

$80.95 $68.54

Paperback

Not in-store but you can order this
How long will it take?

Availability Information

We source books from suppliers in Australia and overseas. For books we don't currently have in stock, the time it takes to get them from our suppliers can vary widely - from a few days to a few months - so we check each book with each supplier to determine the expected time it will take to be supplied to us.

We will update you on expected arrival time to us. If this delay is too long, please let us know within 2 business days and we can give you options regarding cancelling or adjusting your order. More details on cancelling your order can be found in our Terms of Trade.

To find out the anticipated arrival time for specific items prior to ordering, please contact us by phone or email:

Phone +61 2 9264 3111, or 1800 4 BOOKS (1800 4 26657) if outside Sydney:
option 1 Abbey's Bookshop (Crime, History, Science, Kids & more) • info@abbeys.com.au
option 2 Language Book Centre (ESL & Foreign Languages) • language@abbeys.com.au
option 3 Galaxy Bookshop (Sci-fi, Fantasy, Romance, Graphic Novels) • sf@galaxybooks.com.au

QTY:

English

BPB Publications
13 May 2026

Database design & theory; Data capture & analysis; Data mining; Information architecture

Summary
Details

Technologies like Hadoop and Spark, powered by the Cloudera platform, have become essential for storing, processing, and analyzing big data across various industries, including finance, healthcare, e-commerce, and research in today's data-driven world.

This book systematically navigates the entire ecosystem, starting with big data fundamentals, security, and HDFS architecture before mastering MapReduce through weather and stock data case studies. Readers will gain hands-on experience with the Cloudera framework, learning high-level scripting with Pig Latin and structured data warehousing using HiveQL's Metastore and partitions. Additionally, it explores NoSQL versatility with HBase and MongoDB's CAP theorem, followed by Scala programming and Spark's high-speed in-memory engine. You will learn to optimize queries with the Catalyst optimizer and process complex Parquet or JSON files using Spark SQL DataFrames. The book also covers machine learning pipelines with spark.ml for professional-grade classification and clustering applications.

By the end of this book, readers will be able to develop strong conceptual clarity and practical expertise in big data analytics. This will enable them to confidently design, implement, and manage scalable data processing solutions, preparing them to solve real-world data challenges and take on professional roles in big data engineering and analytics.

WHAT YOU WILL LEARN

● Understand big data concepts, architecture, ethics, and applications.

● Build scalable storage using HDFS and MapReduce.

● Perform data analysis using Pig and Hive.

● Develop NoSQL solutions using HBase and MongoDB.

● Process large datasets using Apache Spark.

● Analyze data using Spark SQL and DataFrames.

● Implement machine learning using PySpark.

WHO THIS BOOK IS FOR

This book is ideal for students, researchers, and academicians. It empowers aspiring big data engineers, data scientists, and software engineers. Readers should possess basic programming knowledge and database fundamentals to master Hadoop and Spark for professional-grade data science and faculty-level instruction.

By: Shikha Mehta
Imprint: BPB Publications
ISBN: 9789365894745
ISBN 10: 9365894743
Pages: 386
Publication Date: 13 May 2026
Audience: General/trade , ELT Advanced
Format: Paperback
Publisher's Status: Active