TY - BOOK AU - Parsian,Mahmoud TI - Data algorithms U1 - 004 22 PY - 2015/// CY - Beijing PB - O'Reily KW - Apache Hadoop KW - SPARK (Electronic resource) KW - BUEsh KW - Computer programming KW - COMSCI KW - October2016 N1 - Includes bibliographic references (page 721-723) and index; Secondary sort : introduction -- Secondary sort : a detailed example -- Top 10 list -- Left outer join -- Order inversion -- Moving average -- Market basket analysis -- Common friends -- Recommendation engines using MapReduce -- Content-based recommendation : movies -- Smarter email marketing with the Markov Model -- K-means clustering -- k-nearest neighbors -- Naive bayes -- Sentiment analysis -- Finding, counting, and listing all triangles in large graphs -- K-mer counting -- DNA sequencing -- Cox regression -- Cochran-Armitage test for trend -- Allelic frequency -- The T-test -- Pearson correlation -- DNA base count -- RNA sequencing -- Gene aggregation -- Linear regression -- MapReduce and monoids -- The small files problem -- Huge cache for MapReduce -- The bloom filter ER -