Integer factorization¶
In this article we list several algorithms for the factorization of integers, each of which can be either fast or varying levels of slow depending on their input.
Notice, if the number that you want to factorize is actually a prime number, most of the algorithms will run very slowly. This is especially true for Fermat's, Pollard's p-1 and Pollard's rho factorization algorithms. Therefore, it makes the most sense to perform a probabilistic (or a fast deterministic) primality test before trying to factorize the number.
Trial division¶
This is the most basic algorithm to find a prime factorization.
We divide by each possible divisor
The smallest divisor must be a prime number.
We remove the factored number, and continue the process.
If we cannot find any divisor in the range
vector<long long> trial_division1(long long n) {
vector<long long> factorization;
for (long long d = 2; d * d <= n; d++) {
while (n % d == 0) {
factorization.push_back(d);
n /= d;
}
}
if (n > 1)
factorization.push_back(n);
return factorization;
}
Wheel factorization¶
This is an optimization of the trial division.
Once we know that the number is not divisible by 2, we don't need to check other even numbers.
This leaves us with only
vector<long long> trial_division2(long long n) {
vector<long long> factorization;
while (n % 2 == 0) {
factorization.push_back(2);
n /= 2;
}
for (long long d = 3; d * d <= n; d += 2) {
while (n % d == 0) {
factorization.push_back(d);
n /= d;
}
}
if (n > 1)
factorization.push_back(n);
return factorization;
}
This method can be extended further.
If the number is not divisible by 3, we can also ignore all other multiples of 3 in the future computations.
So we only need to check the numbers
Here is an implementation for the prime number 2, 3 and 5. It is convenient to store the skipping strides in an array.
vector<long long> trial_division3(long long n) {
vector<long long> factorization;
for (int d : {2, 3, 5}) {
while (n % d == 0) {
factorization.push_back(d);
n /= d;
}
}
static array<int, 8> increments = {4, 2, 4, 2, 4, 6, 2, 6};
int i = 0;
for (long long d = 7; d * d <= n; d += increments[i++]) {
while (n % d == 0) {
factorization.push_back(d);
n /= d;
}
if (i == 8)
i = 0;
}
if (n > 1)
factorization.push_back(n);
return factorization;
}
If we continue exending this method to include even more primes, better percentages can be reached, but the skip lists will become larger.
Precomputed primes¶
Extending the wheel factorization method indefinitely, we will only be left with prime numbers to check.
A good way of checking this is to precompute all prime numbers with the Sieve of Eratosthenes until
vector<long long> primes;
vector<long long> trial_division4(long long n) {
vector<long long> factorization;
for (long long d : primes) {
if (d * d > n)
break;
while (n % d == 0) {
factorization.push_back(d);
n /= d;
}
}
if (n > 1)
factorization.push_back(n);
return factorization;
}
Fermat's factorization method¶
We can write an odd composite number
Fermat's factorization method tries to exploit this fact by guessing the first square
int fermat(int n) {
int a = ceil(sqrt(n));
int b2 = a*a - n;
int b = round(sqrt(b2));
while (b * b != b2) {
a = a + 1;
b2 = a*a - n;
b = round(sqrt(b2));
}
return a - b;
}
This factorization method can be very fast if the difference between the two factors
However, there are still a large number of optimization options regarding this approach.
By looking at the squares
Pollard's method¶
It is very likely that at least one factor of a number is
The idea comes from Fermat's little theorem.
Let a factorization of
This also means that
So for any
Therefore, if
It is clear, that the smallest
Notice, if
Some composite numbers don't have
In the following implementation we start with
long long pollards_p_minus_1(long long n) {
int B = 10;
long long g = 1;
while (B <= 1000000 && g < n) {
long long a = 2 + rand() % (n - 3);
g = gcd(a, n);
if (g > 1)
return g;
// compute a^M
for (int p : primes) {
if (p >= B)
continue;
long long p_power = 1;
while (p_power * p <= B)
p_power *= p;
a = power(a, p_power, n);
g = gcd(a - 1, n);
if (g > 1 && g < n)
return g;
}
B *= 2;
}
return 1;
}
Observe that this is a probabilistic algorithm. A consequence of this is that there is a possibility of the algorithm being unable to find a factor at all.
The complexity is
Pollard's rho algorithm¶
Pollard's Rho Algorithm is yet another factorization algorithm from John Pollard.
Let the prime factorization of a number be
In this instance, we are not interested in the sequence
Here is a visualization of such a sequence

Yet, there is still an open question.
How can we exploit the properties of the sequence
It's actually quite easy.
There is a cycle in the sequence
Therefore, if we find two indices
To find the cycle, we can use any common cycle detection algorithm.
Floyd's cycle-finding algorithm¶
This algorithm finds a cycle by using two pointers moving over the sequence at differing speeds.
During each iteration, the first pointer will advance one element over, while the second pointer advances to every other element.
Using this idea it is easy to observe that if there is a cycle, at some point the second pointer will come around to meet the first one during the loops.
If the cycle length is
This algorithm is also known as the Tortoise and Hare algorithm, based on the tale in which a tortoise (the slow pointer) and a hare (the faster pointer) have a race.
It is actually possible to determine the parameter
function floyd(f, x0):
tortoise = x0
hare = f(x0)
while tortoise != hare:
tortoise = f(tortoise)
hare = f(f(hare))
return true
Implementation¶
First, here is an implementation using the Floyd's cycle-finding algorithm.
The algorithm generally runs in
long long mult(long long a, long long b, long long mod) {
return (__int128)a * b % mod;
}
long long f(long long x, long long c, long long mod) {
return (mult(x, x, mod) + c) % mod;
}
long long rho(long long n, long long x0=2, long long c=1) {
long long x = x0;
long long y = x0;
long long g = 1;
while (g == 1) {
x = f(x, c, n);
y = f(y, c, n);
y = f(y, c, n);
g = gcd(abs(x - y), n);
}
return g;
}
The following table shows the values of
The implementation uses a function mult
, that multiplies two integers __int128
for 128-bit integer.
If GCC is not available, you can using a similar idea as binary exponentiation.
long long mult(long long a, long long b, long long mod) {
long long result = 0;
while (b) {
if (b & 1)
result = (result + a) % mod;
a = (a + a) % mod;
b >>= 1;
}
return result;
}
Alternatively you can also implement the Montgomery multiplication.
As stated previously, if
Brent's algorithm¶
Brent implements a similar method to Floyd, using two pointers.
The difference being that instead of advancing the pointers by one and two places respectively, they are advanced by powers of two.
As soon as
function floyd(f, x0):
tortoise = x0
hare = f(x0)
l = 1
while tortoise != hare:
tortoise = hare
repeat l times:
hare = f(hare)
if tortoise == hare:
return true
l *= 2
return true
Brent's algorithm also runs in linear time, but is generally faster than Floyd's, since it uses less evaluations of the function
Implementation¶
The straightforward implementation of Brent's algorithm can be sped up by omitting the terms
long long brent(long long n, long long x0=2, long long c=1) {
long long x = x0;
long long g = 1;
long long q = 1;
long long xs, y;
int m = 128;
int l = 1;
while (g == 1) {
y = x;
for (int i = 1; i < l; i++)
x = f(x, c, n);
int k = 0;
while (k < l && g == 1) {
xs = x;
for (int i = 0; i < m && i < l - k; i++) {
x = f(x, c, n);
q = mult(q, abs(y - x), n);
}
g = gcd(q, n);
k += m;
}
l *= 2;
}
if (g == n) {
do {
xs = f(xs, c, n);
g = gcd(abs(xs - y), n);
} while (g == 1);
}
return g;
}
The combination of a trial division for small prime numbers together with Brent's version of Pollard's rho algorithm makes a very powerful factorization algorithm.