Streaming Flashcards
(21 cards)
what are the performance metric considered for the streaming model
size of working memory, sequential passes over the stream, processing time per item
majority problem
Boyer-Moore algorithm pseudocode + description
Prove the correctness of the Boyer-Moore algorithm
What is a m-sample of a set X
what is a m-sample of a streaming
Prof of the correctness of the reservoir sampling
Reservoir sampling pseudo
Describe the frequent items problem
Describe the e approximate frequent items problem
Describe sticky sampling + pseudo + analysis of the performance + prove the correctness of the algorithm
Prove the working memory of the sticky sampling
Prove the correctness of the sticky sampling
formula of the frequency of moments Fk (what are F0, F1 AND etc)
Probabilistic counting algorithm elements + prof
Count min sketch elements + pseudo
Count min sketch analysis (formula/relation + dim)
Count sketch ingredients + pseudo
Prove that count sketch is an unbiased estimator
Bloom filter (description + ingredients + pseudo)
False positive rate of the bloom filter + prof