000 01733cam a22002895a 4500
008 161024b xxu||||| |||| 00| 0 eng d
040 _bENG
_dEG-ScBUE
082 0 4 _a004
_222
_bPAR
100 1 _aParsian, Mahmoud,
_eauthor.
245 1 0 _aData algorithms /
_cMahmoud Parsian.
246 1 4 _aData algorithms : recipes for scaling up with Hadoop and Spark
250 _a1st ed.
260 _aBeijing :
_bO'Reily ;
_cc. 2015.
300 _axxxvii, 737 p. :
_bill. ;
_c23 cm.
500 _aIncludes bibliographic references (page 721-723) and index.
505 0 _aSecondary sort : introduction -- Secondary sort : a detailed example -- Top 10 list -- Left outer join -- Order inversion -- Moving average -- Market basket analysis -- Common friends -- Recommendation engines using MapReduce -- Content-based recommendation : movies -- Smarter email marketing with the Markov Model -- K-means clustering -- k-nearest neighbors -- Naive bayes -- Sentiment analysis -- Finding, counting, and listing all triangles in large graphs -- K-mer counting -- DNA sequencing -- Cox regression -- Cochran-Armitage test for trend -- Allelic frequency -- The T-test -- Pearson correlation -- DNA base count -- RNA sequencing -- Gene aggregation -- Linear regression -- MapReduce and monoids -- The small files problem -- Huge cache for MapReduce -- The bloom filter.
590 _ashima
630 0 0 _aApache Hadoop.
630 0 0 _aSPARK (Electronic resource)
650 7 _aApache Hadoop
_2BUEsh
650 7 _aComputer programming.
_2BUEsh
651 _2BUEsh
653 _bCOMSCI
_cOctober2016
906 _a7
_bcbc
_ccopycat
_d2
_encip
_f20
_gy-gencatlg
942 _2ddc
999 _c22837
_d22809