Amazon cover image
Image from Amazon.com

Mining of massive datasets / by Jure Leskovec

By: Contributor(s): Material type: TextTextPublication details: New York, Cambridge University Press, 2022.Edition: 3rd editionDescription: xi, 553 pISBN:
  • 9781108476348
Subject(s): DDC classification:
  • 006.312 LES/M
Summary: "The Web, social media, mobile activity, sensors, Internet commerce, and many other modern applications provide many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and can be used on even the largest datasets. It begins with a discussion of the MapReduce framework and related techniques for efficient parallel programming. The tricks of locality-sensitive hashing are explained. This body of knowledge, which deserves to be more widely known, is essential when seeking similar objects in a very large collection without having to compare each pair of objects. Stream-processing algorithms for mining data that arrives too fast for exhaustive processing are also explained. The PageRank idea and related tricks for organizing the Web are covered next. Other chapters cover the problems of finding frequent itemsets and clustering, each from the point of view that the data is too large to fit in main memory. Two applications: recommendation systems and Web advertising, each vital in e-commerce, are treated in detail. Later chapters cover algorithms for analyzing social-network graphs, compressing large-scale data, and machine learning. This third edition includes new and extended coverage on decision trees, deep learning, and mining social-network graphs. Written by leading authorities in database and Web technologies, it is essential reading for students and practitioners alike"--
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Collection Call number Status Date due Barcode
Books Books Kerala University of Digital Sciences, Innovation and Technology Knowledge Centre Computer Science Non Fiction 006.312 LES/M (Browse shelf(Opens below)) Available 6433
Browsing Kerala University of Digital Sciences, Innovation and Technology Knowledge Centre shelves, Shelving location: Computer Science, Collection: Non Fiction Close shelf browser (Hides shelf browser)
006.31 SUT/R;5 Reinforcement learning: An introduction 006.31 SUT/R;7 Reinforcement learning: An introduction 006.31 SUT/R;8 Reinforcement learning: An introduction 006.312 LES/M Mining of massive datasets / 006.33 AGA/S Statistical methods for recommender systems 006.33 AKE/B Big data computing 006.33 HUR/C Cognitive computing and big data analytics

Includes index.

"The Web, social media, mobile activity, sensors, Internet commerce, and many other modern applications provide many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and can be used on even the largest datasets. It begins with a discussion of the MapReduce framework and related techniques for efficient parallel programming. The tricks of locality-sensitive hashing are explained. This body of knowledge, which deserves to be more widely known, is essential when seeking similar objects in a very large collection without having to compare each pair of objects. Stream-processing algorithms for mining data that arrives too fast for exhaustive processing are also explained. The PageRank idea and related tricks for organizing the Web are covered next. Other chapters cover the problems of finding frequent itemsets and clustering, each from the point of view that the data is too large to fit in main memory. Two applications: recommendation systems and Web advertising, each vital in e-commerce, are treated in detail. Later chapters cover algorithms for analyzing social-network graphs, compressing large-scale data, and machine learning. This third edition includes new and extended coverage on decision trees, deep learning, and mining social-network graphs. Written by leading authorities in database and Web technologies, it is essential reading for students and practitioners alike"--

There are no comments on this title.

to post a comment.