-
Notifications
You must be signed in to change notification settings - Fork 0
/
README
53 lines (34 loc) · 1.37 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
NOTE
====
It seems that someone has already built this. See:
- https://github.com/gray/statistics-topk
- http://search.cpan.org/dist/Statistics-TopK/
I'd recommend using that module unless you're particularly fond of mine
for some reason.
Algorithm-TopPercent version 0.02
=================================
A Perl extension for tracking the most popular items seen in a large
stream of data using fixed memory.
This module implements a simple algorithm first described to my by Udi
Manber when he was the Chief Scientist at Yahoo! Inc. It implements
a set of data structures and a counting technique that allow you to
track the top-N (or top-N percent) in a stream of data using fixed
memory, provided that certain conditions are met. See the DETAILS
section for more information.
I have reimplemented it mostly from my memory of his description
roughly 8 years ago.
INSTALLATION
To install this module type the following:
perl Makefile.PL
make
make test
make install
DEPENDENCIES
This module requires these other modules and libraries:
None.
COPYRIGHT AND LICENCE
Put the correct copyright and licence information here.
Copyright (C) 2010 by Jeremy Zawodny
This library is free software; you can redistribute it and/or modify
it under the same terms as Perl itself, either Perl version 5.10.1 or,
at your option, any later version of Perl 5 you may have available.