Number of sketched packets
Our next step is to study the effect of the value being predicted, the number of different packets, on the accuracy of the estimation. To run the experiments with different number of packets, just execute the following commands:
Basic estimator
First we are going to consider a sketch with a single row, and 256 columns. As we can see, except for smaller values of the number of sketched packets, the tendency for any percentile is to grow proportionally to the number of packets and the proposed estimation provides a really accurate prediction of the percentile.
Parameter | Value |
---|---|
Packets | {5,10,50,100,500,1000,5000,10000} |
Columns | 256 |
Rows | 1 |
Digest size | 32 |
Hash function | default |
Xi function | default |
Pcap | CAIDA |
Showing all the sketch types in the same figure to appreciate better the difference between them we can see that
Square sketch
For the case of a sketch with 32 columns by 32 rows the results are pretty similar.
Parameter | Value |
---|---|
Packets | {5,10,50,100,500,1000,5000,10000} |
Columns | 32 |
Rows | 32 |
Digest size | 32 |
Hash function | default |
Xi function | default |
Pcap | CAIDA |
Average function | mean |
Conclusion
As expected, the estimation of the sketch is less accurate as the number of sketched packets increases, however, because its standard error is proportional to the number of sketched packets, the relative standard error is constant. And all sketches have a pretty similar accuracy.