Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

Set of algorithms, used for estimation statistic characteristics on streaming data.

License

NotificationsYou must be signed in to change notification settings

xeronm/rtstat

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Set of algorithms, used for estimation statistic characteristics on streaming data.

License: MIT

1. References

  • P-sqared: R. Jain and I. Chlamtac. (1986) "The P 2 algorithm for dynamic calculation of quantile and histograms without storing observations. Communications of the ACM."
  • P-sqared-extended: Kimmo E. E. Raatikainen. (1987) "Simultaneous estimation of several percentiles. SIMULATION"
  • T-digest: Dunning, T., Ertl, O. (2014) “Computing Extremely Accurate Quantiles Using t-Digests”

2. Requirements

CMake >= 3.1

3. List of algorithms

3.1. Quantile estimation

For test we used three different distribution:

  • Normal
  • Log-normal
  • Concatenation of two normal with rates 45:55

Distributions CDF, PDF

Overall table (Intel(R) Xeon(R) CPU E3-1265L V2 @ 2.50GHz):

distributionalgosamplesrmseitem(ns)
NormalP^21000.9071468.60
NormalT-digest1000.0187568.80
NormalT-digest(M)1000.0102234.60
Log-normalP^21000.4846406.20
Log-normalT-digest1000.0172556.80
Log-normalT-digest(M)1000.0099234.80
Normal-2P^210014.9544466.80
Normal-2T-digest1000.1228516.60
Normal-2T-digest(M)1000.0731237.80
NormalP^210000.0303449.60
NormalT-digest10000.0187584.60
NormalT-digest(M)10000.0226298.40
Log-normalP^210000.1575455.00
Log-normalT-digest10000.0326601.80
Log-normalT-digest(M)10000.0131302.80
Normal-2P^210000.0992431.80
Normal-2T-digest10000.0201570.40
Normal-2T-digest(M)10000.0164303.80
NormalP^2100000.0168347.80
NormalT-digest100000.1379446.20
NormalT-digest(M)100000.0244220.80
Log-normalP^2100000.0076352.60
Log-normalT-digest100000.0061422.80
Log-normalT-digest(M)100000.0326218.80
Normal-2P^2100000.0336366.20
Normal-2T-digest100000.0472460.80
Normal-2T-digest(M)100000.1525219.60

3.1.1. Extended P-Sqared

Normal Distribution: 10k samples

quantileOP^2
0.250052.983652.9851
0.500059.927759.9154
0.750066.516266.5038
0.950076.161876.1764
0.990082.466582.4827
0.999089.884989.5684

Ordinary statistics RMSE: 0.016831

Log-normal Distribution: 10k samples

quantileOP^2
0.250058.156458.1564
0.500059.997859.9985
0.750062.346962.3442
0.950066.351366.3564
0.990070.306770.3616
0.999075.501475.7077

Ordinary statistics RMSE: 0.007599

3.1.2. T-digest

The scaling function differs from original paper is used (borrowed from implementation offolly T-digest)

T-digest Scaling Functions

Clustering algorithm with delta=30, K=3

Normal Distribution: 10k samples

quantileOT-digest
0.250052.983652.9787
0.500059.927759.9392
0.750066.516266.4970
0.950076.161876.1881
0.990082.466582.5131
0.999089.884990.3902

Ordinary statistics RMSE: 0.043114

Log-normal Distribution: 10k samples

quantileOT-digest
0.250058.156458.1645
0.500059.997859.9928
0.750062.346962.3377
0.950066.351366.3247
0.990070.306770.2080
0.999075.501476.0272

Ordinary statistics RMSE: 0.047841

Merge algorithm with delta=100, K=2 and batch_size=200

Normal Distribution: 10k samples

quantileOT-digest
0.250052.983652.9591
0.500059.927759.8356
0.750066.516266.4400
0.950076.161876.1029
0.990082.466582.4698
0.999089.884989.5275

Ordinary statistics RMSE: 0.024352

Log-normal Distribution: 10k samples

quantileOT-digest
0.250058.156458.1529
0.500059.997859.9786
0.750062.346962.3438
0.950066.351366.3019
0.990070.306770.3211
0.999075.501475.0626

Ordinary statistics RMSE: 0.032607

About

Set of algorithms, used for estimation statistic characteristics on streaming data.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

[8]ページ先頭

©2009-2025 Movatter.jp