Sieve of Eratosthenes¶
Sieve of Eratosthenes is an algorithm for finding all the prime numbers in a segment
The algorithm is very simple:
at the beginning we write down all numbers between 2 and
In the following image you can see a visualization of the algorithm for computing all prime numbers in the range

The idea behind is this: A number is prime, if none of the smaller prime numbers divides it. Since we iterate over the prime numbers in order, we already marked all numbers, which are divisible by at least one of the prime numbers, as divisible. Hence if we reach a cell and it is not marked, then it isn't divisible by any smaller prime number and therefore has to be prime.
Implementation¶
int n;
vector<bool> is_prime(n+1, true);
is_prime[0] = is_prime[1] = false;
for (int i = 2; i <= n; i++) {
if (is_prime[i] && (long long)i * i <= n) {
for (int j = i * i; j <= n; j += i)
is_prime[j] = false;
}
}
This code first marks all numbers except zero and one as potential prime numbers, then it begins the process of sifting composite numbers.
For this it iterates over all numbers from int
, the additional verification is done using type long long
before the second nested loop.
Using such implementation the algorithm consumes
Asymptotic analysis¶
It's simple to prove a running time of is_prime
check, the inner loop runs (at most)
Let's prove that algorithm's running time is
Let's recall two known facts.
- The number of prime numbers less than or equal to
- The
Thus we can write down the sum in the following way:
Here we extracted the first prime number 2 from the sum, because
Now, let's evaluate this sum using the integral of a same function over
The antiderivative for the integrand is
Now, returning to the original sum, we'll get its approximate evaluation:
You can find a more strict proof (that gives more precise evaluation which is accurate within constant multipliers) in the book authored by Hardy & Wright "An Introduction to the Theory of Numbers" (p. 349).
Different optimizations of the Sieve of Eratosthenes¶
The biggest weakness of the algorithm is, that it "walks" along the memory multiple times, only manipulating single elements.
This is not very cache friendly.
And because of that, the constant which is concealed in
Besides, the consumed memory is a bottleneck for big
The methods presented below allow us to reduce the quantity of the performed operations, as well as to shorten the consumed memory noticeably.
Sieving till root¶
Obviously, to find all the prime numbers until
int n;
vector<bool> is_prime(n+1, true);
is_prime[0] = is_prime[1] = false;
for (int i = 2; i * i <= n; i++) {
if (is_prime[i]) {
for (int j = i * i; j <= n; j += i)
is_prime[j] = false;
}
}
Such optimization doesn't affect the complexity (indeed, by repeating the proof presented above we'll get the evaluation
Sieving by the odd numbers only¶
Since all even numbers (except
First, it will allow us to halve the needed memory. Second, it will reduce the number of operations performed by algorithm approximately in half.
Memory consumption and speed of operations¶
We should notice, that these two implementations of the Sieve of Eratosthenes use vector<bool>
.
vector<bool>
is not a regular container that stores a series of bool
(as in most computer architectures a bool
takes one byte of memory).
It's a memory-optimization specialization of vector<T>
, that only consumes
Modern processors architectures work much more efficiently with bytes than with bits as they usually cannot access bits directly.
So underneath the vector<bool>
stores the bits in a large continuous memory, accesses the memory in blocks of a few bytes, and extracts/sets the bits with bit operations like bit masking and bit shifting.
Because of that there is a certain overhead when you read or write bits with a vector<bool>
, and quite often using a vector<char>
(which uses 1 byte for each entry, so 8x the amount of memory) is faster.
However, for the simple implementations of the Sieve of Eratosthenes using a vector<bool>
is faster.
You are limited by how fast you can load the data into the cache, and therefore using less memory gives a big advantage.
A benchmark (link) shows, that using a vector<bool>
is between 1.4x and 1.7x faster than using a vector<char>
.
The same considerations also apply to bitset
.
It's also an efficient way of storing bits, similar to vector<bool>
, so it takes only bitset
performs a bit worse than vector<bool>
.
Another drawback from bitset
is that you need to know the size at compile time.
Segmented Sieve¶
It follows from the optimization "sieving till root" that there is no need to keep the whole array is_prime[1...n]
at all times.
For sieving it is enough to just keep the prime numbers until the root of prime[1... sqrt(n)]
, split the complete range into blocks, and sieve each block separately.
Let
As discussed previously, the typical implementation of the Sieve of Eratosthenes is limited by the speed how fast you can load data into the CPU caches.
By splitting the range of potential prime numbers vector<bool>
with a vector<char>
, and gain some additional performance as the processors can handle read and writes with bytes directly and don't need to rely on bit operations for extracting individual bits.
The benchmark (link) shows, that using a vector<char>
is about 3x faster in this situation than using a vector<bool>
.
A word of caution: those numbers might differ depending on architecture, compiler, and optimization levels.
Here we have an implementation that counts the number of primes smaller than or equal to
int count_primes(int n) {
const int S = 10000;
vector<int> primes;
int nsqrt = sqrt(n);
vector<char> is_prime(nsqrt + 2, true);
for (int i = 2; i <= nsqrt; i++) {
if (is_prime[i]) {
primes.push_back(i);
for (int j = i * i; j <= nsqrt; j += i)
is_prime[j] = false;
}
}
int result = 0;
vector<char> block(S);
for (int k = 0; k * S <= n; k++) {
fill(block.begin(), block.end(), true);
int start = k * S;
for (int p : primes) {
int start_idx = (start + p - 1) / p;
int j = max(start_idx, p) * p - start;
for (; j < S; j += p)
block[j] = false;
}
if (k == 0)
block[0] = block[1] = false;
for (int i = 0; i < S && start + i <= n; i++) {
if (block[i])
result++;
}
}
return result;
}
The running time of block sieving is the same as for regular sieve of Eratosthenes (unless the size of the blocks is very small), but the needed memory will shorten to
Find primes in range¶
Sometimes we need to find all prime numbers in a range
To solve such a problem, we can use the idea of the Segmented sieve.
We pre-generate all prime numbers up to
vector<char> segmentedSieve(long long L, long long R) {
// generate all primes up to sqrt(R)
long long lim = sqrt(R);
vector<char> mark(lim + 1, false);
vector<long long> primes;
for (long long i = 2; i <= lim; ++i) {
if (!mark[i]) {
primes.emplace_back(i);
for (long long j = i * i; j <= lim; j += i)
mark[j] = true;
}
}
vector<char> isPrime(R - L + 1, true);
for (long long i : primes)
for (long long j = max(i * i, (L + i - 1) / i * i); j <= R; j += i)
isPrime[j - L] = false;
if (L == 1)
isPrime[0] = false;
return isPrime;
}
It's also possible that we don't pre-generate all prime numbers:
vector<char> segmentedSieveNoPreGen(long long L, long long R) {
vector<char> isPrime(R - L + 1, true);
long long lim = sqrt(R);
for (long long i = 2; i <= lim; ++i)
for (long long j = max(i * i, (L + i - 1) / i * i); j <= R; j += i)
isPrime[j - L] = false;
if (L == 1)
isPrime[0] = false;
return isPrime;
}
Obviously, the complexity is worse, which is
Linear time modification¶
We can modify the algorithm in a such a way, that it only has linear time complexity. This approach is described in the article Linear Sieve. However, this algorithm also has its own weaknesses.
Practice Problems¶
- Leetcode - Four Divisors
- Leetcode - Count Primes
- SPOJ - Printing Some Primes
- SPOJ - A Conjecture of Paul Erdos
- SPOJ - Primal Fear
- SPOJ - Primes Triangle (I)
- Codeforces - Almost Prime
- Codeforces - Sherlock And His Girlfriend
- SPOJ - Namit in Trouble
- SPOJ - Bazinga!
- Project Euler - Prime pair connection
- SPOJ - N-Factorful
- SPOJ - Binary Sequence of Prime Numbers
- UVA 11353 - A Different Kind of Sorting
- SPOJ - Prime Generator
- SPOJ - Printing some primes (hard)
- Codeforces - Nodbach Problem
- Codeforces - Colliders