Buy new:
-17% $41.57
$3.99 delivery May 15 - 17
Ships from: 365giftshop
Sold by: 365giftshop
$41.57 with 17 percent savings
List Price: $49.99

The List Price is the suggested retail price of a new product as provided by a manufacturer, supplier, or seller. Except for books, Amazon will display a List Price if the product was purchased by customers on Amazon or offered by other retailers at or above the List Price in at least the past 90 days. List prices may not necessarily reflect the product's prevailing market price.
Learn more
$3.99 delivery May 15 - 17. Details
Or fastest delivery May 14 - 16. Details
Only 1 left in stock - order soon.
$$41.57 () Includes selected options. Includes initial monthly payment and selected options. Details
Price
Subtotal
$$41.57
Subtotal
Initial payment breakdown
Shipping cost, delivery date, and order total (including tax) shown at checkout.
Ships from
365giftshop
Ships from
365giftshop
Sold by
Sold by
Returns
Eligible for Return, Refund or Replacement within 30 days of receipt
Eligible for Return, Refund or Replacement within 30 days of receipt
This item can be returned in its original condition for a full refund or replacement within 30 days of receipt. You may receive a partial or no refund on used, damaged or materially different returns.
Returns
Eligible for Return, Refund or Replacement within 30 days of receipt
This item can be returned in its original condition for a full refund or replacement within 30 days of receipt. You may receive a partial or no refund on used, damaged or materially different returns.
Payment
Secure transaction
Your transaction is secure
We work hard to protect your security and privacy. Our payment security system encrypts your information during transmission. We don’t share your credit card details with third-party sellers, and we don’t sell your information to others. Learn more
Payment
Secure transaction
We work hard to protect your security and privacy. Our payment security system encrypts your information during transmission. We don’t share your credit card details with third-party sellers, and we don’t sell your information to others. Learn more
$8.52
Get Fast, Free Shipping with Amazon Prime FREE Returns
100% satisfaction guaranteed. Ships directly from Amazon. 100% satisfaction guaranteed. Ships directly from Amazon. See less
FREE delivery Thursday, May 16 on orders shipped by Amazon over $35. Order within 8 hrs 11 mins
Only 6 left in stock - order soon.
$$41.57 () Includes selected options. Includes initial monthly payment and selected options. Details
Price
Subtotal
$$41.57
Subtotal
Initial payment breakdown
Shipping cost, delivery date, and order total (including tax) shown at checkout.
Access codes and supplements are not guaranteed with used items.
Loading your book clubs
There was a problem loading your book clubs. Please try again.
Not in a club? Learn more
Amazon book clubs early access

Join or create book clubs

Choose books together

Track your books
Bring your club to Amazon Book Clubs, start a new book club and invite your friends to join, or find a club that’s right for you for free.
Kindle app logo image

Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer - no Kindle device required.

Read instantly on your browser with Kindle for Web.

Using your mobile phone camera - scan the code below and download the Kindle app.

QR code to download the Kindle App

Something went wrong. Please try your request again later.

Big Data: Principles and best practices of scalable realtime data systems 1st Edition

4.2 4.2 out of 5 stars 100 ratings

{"desktop_buybox_group_1":[{"displayPrice":"$41.57","priceAmount":41.57,"currencySymbol":"$","integerValue":"41","decimalSeparator":".","fractionalValue":"57","symbolPosition":"left","hasSpace":false,"showFractionalPartIfEmpty":true,"offerListingId":"NCgF1vl2buNGpNQZAKavphD6Ci57Xoj8Wm1YilPJ8v0ATfdF45PREcO7tDfumxgkGBZKk96P9vSxyT%2FrKPL9ce4anySsQlLiWK6Xkh3lC6%2FFHqYcWUcHc8p%2FmpeBSDgoH6kDOcjVTnCbgBhnGYtgHEKvKFasTrlnm%2BCuaZ3yminLBTqj3KZ%2BcNbCiADtUanw","locale":"en-US","buyingOptionType":"NEW","aapiBuyingOptionIndex":0}, {"displayPrice":"$8.52","priceAmount":8.52,"currencySymbol":"$","integerValue":"8","decimalSeparator":".","fractionalValue":"52","symbolPosition":"left","hasSpace":false,"showFractionalPartIfEmpty":true,"offerListingId":"NCgF1vl2buNGpNQZAKavphD6Ci57Xoj8Y8%2BbnjTIonUWvd8ssguQDrY90uAVbC2PsxE29ySuF%2FuX6wSm6wHl2m1n4PYKx9DacdevrLyiGW1bsVSH9DMOZuqjQxrwHb6ndog7InZGckdIDq%2FrxtFmogYfLFYIhHQLVzaMlgGWPQjCmGUTLlXJoA%3D%3D","locale":"en-US","buyingOptionType":"USED","aapiBuyingOptionIndex":1}]}

Purchase options and add-ons

Summary

Big Data teaches you to build big data systems using an architecture that takes advantage of clustered hardware along with new tools designed specifically to capture and analyze web-scale data. It describes a scalable, easy-to-understand approach to big data systems that can be built and run by a small team. Following a realistic example, this book guides readers through the theory of big data systems, how to implement them in practice, and how to deploy and operate them once they're built.

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

About the Book

Web-scale applications like social networks, real-time analytics, or e-commerce sites deal with a lot of data, whose volume and velocity exceed the limits of traditional database systems. These applications require architectures built around clusters of machines to store and process data of any size, or speed. Fortunately, scale and simplicity are not mutually exclusive.

Big Data teaches you to build big data systems using an architecture designed specifically to capture and analyze web-scale data. This book presents the Lambda Architecture, a scalable, easy-to-understand approach that can be built and run by a small team. You'll explore the theory of big data systems and how to implement them in practice. In addition to discovering a general framework for processing big data, you'll learn specific technologies like Hadoop, Storm, and NoSQL databases.

This book requires no previous exposure to large-scale data analysis or NoSQL tools. Familiarity with traditional databases is helpful.

What's Inside

  • Introduction to big data systems
  • Real-time processing of web-scale data
  • Tools like Hadoop, Cassandra, and Storm
  • Extensions to traditional database skills

About the Authors

Nathan Marz is the creator of Apache Storm and the originator of the Lambda Architecture for big data systems. James Warren is an analytics architect with a background in machine learning and scientific computing.

Table of Contents

  1. A new paradigm for Big Data
  2. PART 1 BATCH LAYER
  3. Data model for Big Data
  4. Data model for Big Data: Illustration
  5. Data storage on the batch layer
  6. Data storage on the batch layer: Illustration
  7. Batch layer
  8. Batch layer: Illustration
  9. An example batch layer: Architecture and algorithms
  10. An example batch layer: Implementation
  11. PART 2 SERVING LAYER
  12. Serving layer
  13. Serving layer: Illustration
  14. PART 3 SPEED LAYER
  15. Realtime views
  16. Realtime views: Illustration
  17. Queuing and stream processing
  18. Queuing and stream processing: Illustration
  19. Micro-batch stream processing
  20. Micro-batch stream processing: Illustration
  21. Lambda Architecture in depth
Read more Read less

Amazon First Reads | Editors' picks at exclusive prices

Frequently bought together

$41.57
Get it May 15 - 17
Only 1 left in stock - order soon.
Ships from and sold by 365giftshop.
+
$28.80
Get it as soon as Monday, May 20
Only 1 left in stock - order soon.
Sold by Pearlzone and ships from Amazon Fulfillment.
+
$43.99
Get it as soon as Thursday, May 16
In Stock
Ships from and sold by Amazon.com.
Total price:
To see our price, add these items to your cart.
Details
Added to Cart
Some of these items ship sooner than the others.
Choose items to buy together.

From the Publisher

About This Book

Services like social networks, web analytics, and intelligent e-commerce often need to manage data at a scale too big for a traditional database. Complexity increases with scale and demand, and handling Big Data is not as simple as just doubling down on your RDBMS or rolling out some trendy new technology. Fortunately, scalability and simplicity are not mutually exclusive—you just need to take a different approach. Big Data systems use many machines working in parallel to store and process data, which introduces fundamental challenges unfamiliar to most developers.

Big Data teaches you to build these systems using an architecture that takes advantage of clustered hardware along with new tools designed specifically to capture and analyze web-scale data. It describes a scalable, easy-to-understand approach to Big Data systems that can be built and run by a small team. Following a realistic example, this book guides readers through the theory of Big Data systems and how to implement them in practice.

Big Data requires no previous exposure to large-scale data analysis or NoSQL tools. Familiarity with traditional databases is helpful, though not required. The goal of the book is to teach you how to think about data systems and how to break down difficult problems into simple solutions. We start from first principles and from those deduce the necessary properties for each component of an architecture.

Editorial Reviews

About the Author

Nathan Marz is currently working on a new startup. Previously, he was the lead engineer at BackType before being acquired by Twitter in 2011. At Twitter, he started the streaming compute team which provides and develops shared infrastructure to support many critical realtime applications throughout the company. Nathan is the creator of Cascalog and Storm, open-source projects which are relied upon by over 50 companies around the world, including Yahoo!, Twitter, Groupon, The Weather Channel, Taobao, and many more companies.

James Warren is an analytics architect at Storm8 with a background in big data processing, machine learning and scientific computing.

Product details

  • Publisher ‏ : ‎ Manning Publications; 1st edition (May 19, 2015)
  • Language ‏ : ‎ English
  • Paperback ‏ : ‎ 328 pages
  • ISBN-10 ‏ : ‎ 1617290343
  • ISBN-13 ‏ : ‎ 978-1617290343
  • Item Weight ‏ : ‎ 1.21 pounds
  • Dimensions ‏ : ‎ 7.38 x 0.6 x 9.25 inches
  • Customer Reviews:
    4.2 4.2 out of 5 stars 100 ratings

About the author

Follow authors to get new release updates, plus improved recommendations.
Nathan Marz
Brief content visible, double tap to read full content.
Full content visible, double tap to read brief content.

Discover more of the author’s books, see similar authors, read author blogs and more

Customer reviews

4.2 out of 5 stars
4.2 out of 5
100 global ratings
Great content. Bad structure/assembling quality
4 Stars
Great content. Bad structure/assembling quality
I just received this book. Content is great but as I started to turn the pages, they started to fall off. I buy a lot of books and it has been a very long time since I saw such a bad quality in the book physical structure. 5 stars on the content. 0 stars on the book physical structure.
Thank you for your feedback
Sorry, there was an error
Sorry we couldn't load the review

Top reviews from the United States

Reviewed in the United States on April 10, 2016
If you are looking for a survey of different approaches of handling big data, you want to read "ELEMENTS OF SCALE: COMPOSING AND SCALING DATA PLATFORMS". ([...]) This book is dedicated to Lambda Architecture (one that is surveyed in the above article.)

The book is very organized. Introduction in chapter 1 will be the road map of the whole book. Motivating with a simple web application based on RDBMS, the author showed how the approach to scale it becomes undesirable. After enumerating a list of desired properties, he proposed Lambda architecture, an approach in contrast to fully incremental architecture (with RDBMS).

The Lambda architecture is partitioned into three layers:
1. batch layer that computes different views on big data
2. serving layer that answers user queries using views from the batch layer and speed layer.
3. speed layer that compensates an approximate answer over a period time when the batch layer is working on the complete answers.

In the remaining chapters, the author dive deep into the rationale and requirements of all the different pieces of Lambda Architecture.

To understand the context of Lambda Architecture, also refer to the wikipedia for crticism.
9 people found this helpful
Report
Reviewed in the United States on June 14, 2015
Great explanation of both the theory and practice of the lambda architecture. While the practice chapters are nice, it's the theory chapters that really shine. The book explains down to the byte level why components are implemented the way they are. For example, there's an immense amount of detail as to why using a db that doesn't support random writes allows for an application to query the batch layer's results without locking.

The only downside to the book is that the architecture and exosystem is so new that there's not really a lot of pragmatic solutions. For example, the theory describes a query layer that can merge the results of batch and real time processing for client applications. However, in real life there are no pragmatic solutions for doing this so you'd have to write your own.

It'll be interesting to see how the lambda architecture matures and to see future editions of this book. Hopefully, future editions will be as well written and have a better ecosystem for practice chapters.
14 people found this helpful
Report
Reviewed in the United States on November 4, 2015
In all honesty, the book has simplified big data architecture and its general premise in an eye opening way. Starting from the batch layer and spending a good amount of time addressing different aspects of it gave me a valuable lesson as a developer in understanding the complexity as well as the necessity of evaluating my data entries and their impact in the future formation of worthy analytics/results.

My girlfriend and I enjoyed every chapter in this book. I guarantee you that you won't regret buying this book. I am looking forward to another book from you guys on the topic because its the first time where I couldn't wait to pick up the book and get to the end of it.
2 people found this helpful
Report
Reviewed in the United States on February 23, 2020
Right up there with Paul's Letter to the Romans! Well, not equal with Paul's Letter to the Romans.

But it brought Paul's letter to the Romans to mind!

Clear, just enough detail, well-ordered.

I work at a large corporation, on a real-time data system. If we had followed the author's recommendations, I wouldn't have the problem I've been dealing with for the last several weeks.
Reviewed in the United States on March 13, 2016
This book is written by a specialist in big data. I know that because I worked on the big data pipeline. And now I read the book and I see that all my problems are addressed in this book. Virtually every problem discussed appeared in my pipeline too, as if the author worked with me on my project.

The other very useful for me feature of this book is that it is the first book where I could find a concise explanation of Storm Trident framework, even though the book is not about Storm.
5 people found this helpful
Report
Reviewed in the United States on October 21, 2015
I feel really sorry for those who gave 5 stars for this book. I purchased the book and started reading it eagerly as soon as I received it. It got my attention until I got to page 20 with a statement saying "...... If anything ever goes wrong, you can discard the state for the entire speed layer, and everything will be back to normal within a few hours." Within a few hours? No high-traffic production sites can afford a few hours down-time. At that point, I decided to return the book, which I did.

I did scan through the rest of the book, though. First, the so-called lambda architecture might sound like a new term, but many high concurrency websites already work that way. For a high concurrency web site, the first-layer would be memcached-based, which gives O(1) low latency on all queries. The second layer would be a clustered app-server layer. The third layer could be a high-concurrency, extremely low-latency layer like a NoSQL cluster. The far backend could be Hadoop- or Spark-based for batch jobs. This is the known architecture in production for high traffic websites that need to support millions of concurrent users.

Secondly, the bulk of the book is actually about Hadoop in the so-called batch layer. Hadoop once generated some excitement, but has lost its steam due to the new kid in the spot named Spark, which can do whatever Hadoop can do, but 10x - 20x faster with a fractional cost.
9 people found this helpful
Report
Reviewed in the United States on May 4, 2020
This book has a bad binding. I bought this book and opened it only twice and it is already broken
Reviewed in the United States on December 22, 2021
I just received this book. Content is great but as I started to turn the pages, they started to fall off. I buy a lot of books and it has been a very long time since I saw such a bad quality in the book physical structure. 5 stars on the content. 0 stars on the book physical structure.
Customer image
4.0 out of 5 stars Great content. Bad structure/assembling quality
Reviewed in the United States on December 22, 2021
I just received this book. Content is great but as I started to turn the pages, they started to fall off. I buy a lot of books and it has been a very long time since I saw such a bad quality in the book physical structure. 5 stars on the content. 0 stars on the book physical structure.
Images in this review
Customer image Customer image
Customer imageCustomer image

Top reviews from other countries

Translate all reviews to English
jonathanGabriel
5.0 out of 5 stars Buen libro sobre arquitectura
Reviewed in Mexico on January 21, 2020
Es un muy buen libro con las bases de Lambda Architecture. Fácil de leer y entender. Lo recomiendo. Es teórico sobre arquitectura con algunos ejemplos prácticos.
Amazon Customer
5.0 out of 5 stars Pleasant and interesting
Reviewed in the United Kingdom on December 5, 2016
Bid Data and technologies around this subject can be very hard and low-level to understand.
With this book i found it clear, concise and explained in such a way that everyone with little or no background in IT can understand.
A very good Big Data insight and also helpful for understanding which are the best tools to achieve good results with Hadoop and other technologies.
I found it very interesting, well written and pleasant to read as well. This book helped me a lot and i'm sure it can help a lot beginners with this subject.
Eduard
5.0 out of 5 stars Clasico de la arquitectura Big Data
Reviewed in Spain on October 24, 2016
Si estas en diseñando arquitecturas para big data o incluso si piensas que algun dia tu aplicación podria llegar a big data este es tu libro. Los conceptos serian CQRS y Event Sourcing pero a gran escala y para dar respuestas en real time.
frchatel
5.0 out of 5 stars excellent
Reviewed in France on November 6, 2016
Excellent ouvrage précis, détaillé sur un cas de big data.
Ouvrage didactique, mais qui nécessite une certaine concentration en raison de la complexité technologique décrite.
S'adresse à un public averti de développeurs (nombreuses illustrations avec échantillons de code Java)
One person found this helpful
Report
Alfred Huang
5.0 out of 5 stars Five Stars
Reviewed in Canada on June 6, 2015
it is a good book.