Skip to content

Commit

Permalink
[TCP] tcp_cubic: faster cube root
Browse files Browse the repository at this point in the history
The Newton-Raphson method is quadratically convergent so
only a small fixed number of steps are necessary.
Therefore it is faster to unroll the loop. Since div64_64 is no longer
inline it won't cause code explosion.

Also fixes a bug that can occur if x^2 was bigger than 32 bits.

Signed-off-by: Stephen Hemminger <[email protected]>
Signed-off-by: David S. Miller <[email protected]>
  • Loading branch information
Stephen Hemminger authored and David S. Miller committed Apr 26, 2007
1 parent 8570419 commit c5f5877
Showing 1 changed file with 5 additions and 11 deletions.
16 changes: 5 additions & 11 deletions net/ipv4/tcp_cubic.c
Original file line number Diff line number Diff line change
Expand Up @@ -96,23 +96,17 @@ static void bictcp_init(struct sock *sk)
*/
static u32 cubic_root(u64 a)
{
u32 x, x1;
u32 x;

/* Initial estimate is based on:
* cbrt(x) = exp(log(x) / 3)
*/
x = 1u << (fls64(a)/3);

/*
* Iteration based on:
* 2
* x = ( 2 * x + a / x ) / 3
* k+1 k k
*/
do {
x1 = x;
x = (2 * x + (uint32_t) div64_64(a, x*x)) / 3;
} while (abs(x1 - x) > 1);
/* converges to 32 bits in 3 iterations */
x = (2 * x + (u32)div64_64(a, (u64)x*(u64)x)) / 3;
x = (2 * x + (u32)div64_64(a, (u64)x*(u64)x)) / 3;
x = (2 * x + (u32)div64_64(a, (u64)x*(u64)x)) / 3;

return x;
}
Expand Down

0 comments on commit c5f5877

Please sign in to comment.