div.javascriptErrorPage { background-color: rgba(0,0,0,0.3); width: 100%; text-align: center; height: 100vh; } div.javascriptErrorPageContent { position: fixed; top : 50%; left: 50%; -ms-transform: translate(-50%, -50%); transform : translate(-50%, -50%); border-radius: 20px; background: #FFFFFF; box-shadow: 0px 3px 5px rgba(0, 0, 0, 0.2), 0px 6px 10px rgba(0, 0, 0, 0.14), 0px 1px 18px rgba(0, 0, 0, 0.12); } /* This is mobile */ @media (max-width:600px) { div.javascriptErrorPage { background-color: none; width: 100%; text-align: center; height: auto; } div.javascriptErrorPageContent { width: 100%; border-radius: none; box-shadow: none; } }

JavaScript required

We’re sorry, but WorldCat does not work without JavaScript enabled. Please enable JavaScript on your browser.

Mining of massive datasets

Authors:Jurij Leskovec (Author), Anand Rajaraman (Author), Jeffrey D. Ullman (Author)

Summary:This book focuses on practical algorithms that have been used to solve key problems in data mining and can be applied successfully to even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. Other chapters cover the PageRank idea and related tricks for organizing the Web, the problems of finding frequent itemsets and clustering. This second edition includes new and extended coverage on social networks, machine learning and dimensionality reduction. It includes a range of over 150 challenging exercises. -- Edited sumamry from book

eBook, English, 2014

Edition:2nd edition View all formats and editions

Publisher: Cambridge University Press, Cambridge, 2014

Physical Description:1 online resource (xii, 467 pages) : illustrations

ISBN:

9781316147313, 9781139924801, 9781316147047, 9781107077232, 1316147312, 113992480X, 1316147045, 1107077230

OCLC Number / Unique Identifier:888463433

Subjects:

Données volumineuses

Exploration de données (Informatique)

Additional Physical Form Entry:

Print version:

Contents:

Data mining

MapReduce and the new software stack

Finding similar items

Mining data streams

Link analysis

Frequent itemsets

Clustering

Advertising on the Web

Recommendation systems

Mining social-network graphs

Dimensionality reduction

Large-scale machine learning

Notes:

Previous edition: 2012

More Information:

Cambridge University Press

Safari Books Online

resolver.library.cornell.edu Connect to full text. Access restricted to authorized subscribers.

vh7qx3xe2p.search.serialssolutions.com

Available from Skillsoft Books ITPro

Contributor biographical information

Publisher description

Table of contents only

archive.org Free eBook from the Internet Archive

openlibrary.org Additional information and access via Open Library