Problem
You have an array with all the numbers from 1 to N, where N is at most 32,000. The array may have duplicate entries and you do not know what N is. With only 4KB of memory available, how would you print all duplicate elements in the array?
Solution
4KB = 32768 bits.
We can use a byte array to represent if we have seen number i. If so, we make it 1. If we encounter a duplicate, we would have marked the particular position with 1, so we would know we have a duplicate. We have 4KB of memory which means we can address up to 8 * 4 * (2^10) bits. Note that 32*(2^10) bits is greater than 32000. We can create a bit vector with 32000 bits, where each bit represents one integer.
Then we print it out.
public static void findDuplicates(int[] array) {
byte[] bitVector = new byte[32000 / 8];
for (int number: array) {
int posInArray = number >> 3;
int posInBit = number % 8;
if ((bitVector[posInArray] & (1<< posInBit)) > 0)
// found duplicates
System.out.println(number);
else
bitVector[posInArray] |= (1<< posInBit);
}
}
Lets use a cleaner approach, by using BitSet in java. Note that bit set starts with 0, but number starts with 1.
public static void checkDuplicates(int[] array) {
BitSet bs = new BitSet(32000);
for (int i = 0; i < array.length; i++) {
int num = array[i];
int num0 = num - 1; // bitset starts at 0, numbers start at 1
if (bs.get(num0)) {
System.out.println(num);
} else {
bs.set(num0);
}
}
}