In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. Updated for Spark 2.1, this edition acts as an introduction to these techniques and other best practices in Spark programming.
This collection represents the full spectrum of data-related content we’ve published on O’Reilly Radar over the last year. Mike Loukides kicked things off in June 2010 with “What is data science?” and from there we’ve pursued the various threads and themes that naturally emerged. Now, roughly a year later, we can look back over all we’ve covered and identify a number of core data areas:
Design, build, and analyze your data intricately using Cassandra
About This Book
Build professional data models in Cassandra using CQL and appropriate indexes
Grasp the Model-By-Query techniques through working examples
Step-by-step tutorial of a stock market technical analysis application
With your knowledge of Java and this guide, you can take the analysis of your big data to new levels using Pentaho. Covers all the essentials tools, techniques, tips, and tricks in one handy volume. Overview * A guide to using Pentaho Business Analytics for big data analysis * Learn Pentaho's visualization and reporting tools with practical examples and tips * Precise insights into churning big data into meaningful knowledge with Pentaho
$53.31$10.00
Cookie giúp chúng tôi cung cấp các dịch vụ của mình. Đồng nghĩa với việc sử dụng được dịch vụ của chúng tôi, Bạn đồng ý với việc sử dụng cookie của chúng tôi ?