Find duplicates between arrays

Question

Find duplicates between arrays

Suppose you are given two arrays of integers of constant length that are 3, and you are always sure that the two data elements of the two attributes will have the same value.

therefore, suppose array A has three values: a, b, c. and array B has three values: d, e, f.

We are sure that the two values will be the same. we are invited to put these four different values into an array of size 4, so that the output array C should have the same values from arrays A and B in indices 1 and 2, and at indices 0 and 3 it should have different values from array A and B I implemented it but am not really satisfied with this solution ... does anyone have a better solution? besides the one that puts my counters in an array ... :)

int[] a = { 1, 201, 354 }; int[] b = { 404, 201, 354 }; int[] c = new int[4]; for (int i = 0; i < c.Length; i++) { Console.WriteLine(c[i]); }

+7

c # algorithm

kl. Feb 11 '10 at 21:14

source share

16 answers

Replace

 // IRQ. 20100211. Deleted unncessary code

from

 var c = a.Concat(b).Distinct().ToArray();

Update:

New:

 var same = a.Intersect(b); var c = a.Except(same).Concat(same).Concat(b.Except(same)).ToArray();

or these

 var c = a.Except(b).Concat(a.Intersect(b)).Concat(b.Except(a)); var c = a.Except(b).Concat(a).Concat(b).Distinct();

+1

dh. Feb 11 '10 at 21:38

source share

What you are looking for is just a set of two arrays (the set contains each element no more than once). Solution in C ++:

 #include <set> int main () { int a[] = { 1,2,3 }; int b[] = { 4,2,3 }; std::set<int> s(a, a+3); s.insert(b, b+3); }

+1

rmn Feb 11 '10 at 21:39

source share

If you have LINQ at your disposal, the following code will suffice:

 int[] c = a.Union(b).ToArray();

The Union checks duplicates, so no further verification is required:

Returns: System.Collections.Generic.IEnumerable which contains elements from both input sequences, excluding duplicates.

+1

Aistina Feb 11 '10 at 21:52

source share

Here's a cool solution in C (++)

 int a[3], b[3]; /* the two arrays */ int c[4]; /* target */ int s=0, t=0, k; int i; for (i=0;i<3;i++) { k = a[i]-b[i]; s += k; t += k*(a[i]+b[i]); } /* At this point s is the difference of the two distinct elements and t is the difference of their squares, ie s = x - y and t = x^2 - y^2 because (xy)(x+y) = x^2-yx+yx-y^2 = x^2-y^2 Because the two elements are distinct, s != 0 and we can easily divide t by s to get (x + y), from which then we have s == x - y t == x + y ie x = (s+t)/2 and y=(ts)/2 */ t /= s; int x = (s + t) / 2; int y = (t - s) / 2; /* Now x, y are the distinct elements, x from array a and y from array b */ /* Fill in the results */ c[0] = x; c[3] = y; /* If a[0] is non-shared, then a[1] must be the first shared element; otherwise a[0] */ c[1] = (a[0] == x ? a[1] : a[0]); /* If a[2] is non-shared, then a[1] must be the last shared element; otherwise a[2] */ c[2] = (a[2] == x ? a[1] : a[2]);

Example: a = {1, 3, 5}, b = {3, 5, 2}

 s = (1-3)+(3-5)+(5-2) = -2-2+3 = -1 t = (1-3)*(1+3)+(3-5)*(3+5)+(5-2)*(5+2) = -8-16+21 = -3 t / s = 3 x = (-1 + 3) / 2 = 1 y = (3 - (-1)) / 2 = 2 c[0] = 1 c[3] = 2 c[1] = 3 c[2] = 5

therefore c gets the value {1,3,5,2} as desired!

For fun, the following version is here:

 /* Declarations */ int a[3], b[3], c[4]; int s = 0, t = 0, k, i; /* Actual algorithm */ for (i = 0; i < 3; i++) { s += (k = a[i]-b[i]); t += k * (a[i]+b[i]); } t /= s; c[0] = (s + t) >> 1; c[3] = (t - s) >> 1; c[1] = (a[0] == x ? a[1] : a[0]); c[2] = (a[2] == x ? a[1] : a[2]);

Note that it is cold enough if the problem is generalized, so that n-1 elements are separated, and in both arrays there is one unique element, this is the O (n) algorithm, while many intersection and / or union algorithms are based on O (n log n) :)

+1

Antti huima Feb 12 '10 at 2:09

source share

This part

  if (a[0] == b[0]) { counter0++; } if (a[0] == b[1]) { counter0++; } if (a[0] == b[2]) { counter0++; } if (a[1] == b[0]) { counter1++; } if (a[1] == b[1]) { counter1++; } if (a[1] == b[2]) { counter1++; } if (a[2] == b[0]) { counter2++; } if (a[2] == b[1]) { counter2++; } if (a[2] == b[2]) { counter2++; }

It may have been rewritten as

 for (i=0; i<3; i++) { for (j=0; j<3; j++) { switch(i) { case 0: if (a[i] == b[j]) { counter0++; } break; case 1: if (a[i] == b[j]) { counter1++; } break; case 2: if (a[i] == b[j]) { counter2++; } break; } } }

The other part with other counters should be written similarly. Then you could reorganize this into a separate method and just pass arrays and counters to it.

LINQ might be another option, but I don’t know exactly how to write something like this.

(Did not try to compile this, but the idea is clear?)

UPDATE: If you can put counters in an array, this might work:

 for (i=0; i<3; i++) { for (j=0; j<3; j++) { if (a[i] == b[j]) { counters[i]++; } } }

0

FrustratedWithFormsDesigner Feb 11 '10 at 21:17

source share

I am sure I do not understand the question.

You speak:

We are sure that the two values will be the same. we will be asked to put these four different meanings

What four different meanings do you mean? Two are the same? Because it means the word "these."

You mean: Take 4 unique values and put them in an array?

So that:

1, 2, 3
2, 3, 4

becomes:

1, 2, 3, 4?

This is easy:

 int[] c = a.Concat(b).Distinct().ToArray();

It uses the Linq extension methods for .NET 3.5. If you are not using .NET 3.5, you can do this:

 Dictionary<int, int> c1 = new Dictionary<int, int>(); foreach (var x in a) c1[x] = 1; foreach (var x in b) c1[x] = 1; int[] c = new List<int>(c1.Keys).ToArray();

Now if you need an order:

The first value that happened only once
First value that happened twice
The second value that happened twice
The second value that happened only once

Then, I'm afraid I don’t have a single-line interface for you, I’ll have to think about it a little more.

May I ask what is context? Why is this a requirement?

0

Lasse Vågsæther Karlsen Feb 11 '10 at 21:23

source share

 bool isUsed[6]={true, true, true, true, true, true}; int values[6]; int valuesCount = 0; int i,j; for( i = 0 ; i < 3 ; i++) { bool goodValue = true; for ( j = 0; j < valuesCount; j++) { if(values[j] == a[i]) { isUsed[j] = false; goodValue = false; break; } } if(goodValue) { values[valuesCount++]=a[i]; } } //same for b[i] for( i = 0 ; i < valuesCount; i++) { if( isUsed[i] ) printf("%i ", values[i]); }

0

botismarius Feb 11 '10 at 21:24

source share

Instead of counter1, counter2, counter3:

 counter[3];

A lot of things get easier from there. You can refer to everything in cycles, for starters.

0

Dean j Feb 11 '10 at 21:25

source share

I came up with this as the first draft, but I think it might require some improvement. It also does not satisfy the requirement for duplicates at positions 1 and 2 and unique numbers at 0 and 3 in the array. I thought I would post it anyway, so you can understand how this might look:

  int[] a = { 1, 201, 354 }; int[] b = { 404, 201, 354 }; int[] c = new int[ 4 ]; // Start by just copying over one of the arrays completely. a.CopyTo( c, 0 ); // Loop through b and compare each number against each // each number in a. foreach( int i in b ) { // Assume that you're not dealing with a duplicate bool found = true; foreach( int j in a ) { // If you find a duplicate, set found to false if( i == j ) { found = false; } } // If you haven't found a duplicate this is the // number you want - add it to the array. if (found == true) { c[3] = i; } }

0

Anne schuessler Feb 11 '10 at 21:34

source share

I am trying to give a short answer. However, he suggests that the input will be correct.

 int c1, c2, i; c1 = a[0] == b[0] ? 0 : (a[0] == b[1] ? 1 : 2); // index of a[0] in array 'b' c2 = a[1] == b[0] ? 0 : (a[1] == b[1] ? 1 : 2); // index of a[1] in array 'b' for(i=0; i<2; i++) Console.WriteLine(a[i]); Console.WriteLine(b[3-c1-c2]); // looks quite hacky but it is actually element of 'b' not in array 'a'

0

Niraj nawanit Feb 11 '10 at 21:51

source share

Here is simple code, but it assumes that the values of a and b are always positive.

 int[] a = { 1, 201, 354 }; int[] b = { 404, 201, 354 }; int[] c = { -1, -1, -1, -1}; for(int i = 0; i < 3; i++){ int notfound = 1; for(int j = 0; j < 3; j++){ if(b[j] == -1) continue; if(a[i] == b[j]){ b[j] = -1; if(c[1] == -1) c[1] = a[i]; else c[2] = a[i]; notfound = 0; break; } } if(notfound) c[0] = a[i]; } int k = 0; while(b[k++] == -1); c[3] = b[k];

I have not tested it, but I hope you understand this idea. This uses very little extra space (just space for notfound, which can be made by boolean and index variables) and should be pretty fast.

0

Justin peel Feb 11 '10 at 22:11

source share

 int[] a = { 204, 534, 1 }; int[] b = { 204, 534, 401 }; int[] c = new int[4]; int x = 3, y = 3, k = 1; for(int i=0; i<3; i++) for(int j=0; j<3; j++) if (a[i] == b[j]) { c[k++] = a[i]; x -= i; y -= j; break; } c[0] = a[x]; c[3] = b[y];

0

Jimmy Feb 11 '10 at 22:40

source share

Sapph provided an answer that is about as clean as it is, but here is one of them if performance is extremely important. Checking the boundaries of a .NET array is likely to add some overhead, but in C this compiles up to 64 instructions without branches.

 int[] a = { 204, 534, 1 }; int[] b = { 204, 534, 401 }; int[] c = new int[4]; // pick the value from a that is not in b for c[0] // a[0] not in b is implied by a[1] in b and a[2] in b int a1_not_in_b = Convert.ToInt32(a[1] != b[0] & a[1] != b[1] & a[1] != b[2]); int a2_not_in_b = Convert.ToInt32(a[2] != b[0] & a[2] != b[1] & a[2] != b[2]); // bitfield of 2 bit values equivalent to the array {0,1,2,0,1} int idxs = 0 | 1 << 2 | 2 << 4 | 0 << 6 | 1 << 8; // if a[1] not in b start at 1, if a[2] not in b start at 2, else start at 0 idxs >>= 2*a1_not_in_b | 4*a2_not_in_b; c[0] = a[(idxs >> 0) & 3]; c[1] = a[(idxs >> 2) & 3]; c[2] = a[(idxs >> 4) & 3]; // pick the value from b that is not in a // b[0] not in a is implied by b[1] in a and b[2] in a int b1_not_in_a = Convert.ToInt32(a[0] != b[1] & a[1] != b[1] & a[2] != b[1]); int b2_not_in_a = Convert.ToInt32(a[0] != b[2] & a[1] != b[2] & a[2] != b[2]); c[3] = b[b1_not_in_a | 2*b2_not_in_a];

0

Ants aasma Feb 14 '10 at 23:57

source share

Faster?

 using System; using System.Linq; using sw = System.Diagnostics.Stopwatch; class Program { static void Main() { int[] a = new int[] { 1, 2, 3 }, // try: a={1,2,2} b={2,2,3} b = new int[] { 4, 2, 3 }, c = new int[4]; sw sw = sw.StartNew(); for (int i = 5000000; i > 0; i--) { dssd1(a, b, c); dssd1(b, a, c); } Console.Write(sw.ElapsedMilliseconds); Console.Read(); } static void dssd0(int[] a, int[] b, int[] c) // 6710 ms. { int[] s = a.Intersect(b).ToArray(); // same int[] d = a.Union(b).Except(s).ToArray(); // diff c[0] = d[0]; c[1] = s[0]; c[2] = s[1]; c[3] = d[1]; } static void dssd1(int[] a, int[] b, int[] c) // 61 ms. { if (a[0] != b[0] && a[0] != b[1] && a[0] != b[2]) { c[0] = a[0]; c[1] = a[1]; c[2] = a[2]; goto L0; } if (a[1] != b[0] && a[1] != b[1] && a[1] != b[2]) { c[0] = a[1]; c[1] = a[0]; c[2] = a[2]; goto L0; } c[0] = a[2]; c[1] = a[0]; c[2] = a[1]; L0: if (b[0] != c[1] && b[0] != c[2]) { c[3] = b[0]; return; } if (b[1] != c[1] && b[1] != c[2]) { c[3] = b[1]; return; } c[3] = b[2]; } }

Fastest?

  L0: c[3] = b[0] != c[1] && b[0] != c[2] ? b[0] : // 49 ms. b[1] != c[1] && b[1] != c[2] ? b[1] : b[2];

0

P_P Nov 28 '16 at 10:54

source share

How about this?

 private static int[] FindDuplicates(int[] arrA,int[] arrB) { var aList=new List<int>(); Array.Sort(arrA); Array.Sort(arrB); for(int i=0;i<arrA.Length;i++) { if(arrB.Contains(arrA[i])) { aList.Add(arrA[i]); } } return aList.ToArray(); }

0

abhi Apr 15 '17 at 0:42

source share

Sapph · Accepted Answer · 2010-02-11T21:23:35+0000

Sorry, I am re-reading more carefully, and I think this is what you want. Please advise.:)

 int[] same = a.Intersect(b).ToArray(); ; int[] diff = a.Union(b).Except(same).ToArray(); int[] c = new int[] { diff[0], same[0], same[1], diff[1] };

Find duplicates between arrays

More articles: