How to fill a column with random numbers in SQL? I get the same value on every line

Question

How to fill a column with random numbers in SQL? I get the same value on every line

UPDATE CattleProds SET SheepTherapy=(ROUND((RAND()* 10000),0)) WHERE SheepTherapy IS NULL

If I then do a SELECT, I see that my random number is identical on every line . Any ideas on how to create unique random numbers?

+49

sql-server

NibblyPig Feb 15 '11 at 11:26

source share

5 answers

If you are using SQL Server 2008, you can also use

  CRYPT_GEN_RANDOM(2) % 10000

Which seems a bit simpler (it is also evaluated once per line as newid - shown below)

 DECLARE @foo TABLE (col1 FLOAT) INSERT INTO @foo SELECT 1 UNION SELECT 2 UPDATE @foo SET col1 = CRYPT_GEN_RANDOM(2) % 10000 SELECT * FROM @foo

Returns (2 random, probably different )

 col1 ---------------------- 9693 8573

Thinking about an inexplicable downstream the only legitimate reason I can think of is that since the random number generated is between 0-65535, which is not evenly divided by 10,000, some numbers will be slightly exceeded. The way to do this would be to wrap it in a scalar UDF that throws out any number over 60,000 and calls recursively to get the replacement number.

 CREATE FUNCTION dbo.RandomNumber() RETURNS INT AS BEGIN DECLARE @Result INT SET @Result = CRYPT_GEN_RANDOM(2) RETURN CASE WHEN @Result < 60000 OR @@NESTLEVEL = 32 THEN @Result % 10000 ELSE dbo.RandomNumber() END END

+18

Martin Smith Feb 15 '11 at 11:48

source share

While I really enjoy using CHECKSUM, I believe that the best way is to use NEWID (), simply because you don't have to go through complicated math to generate prime numbers.

 ROUND( 1000 *RAND(convert(varbinary, newid())), 0)

You can replace 1000 with any number that you want to set as a limit, and you can always use the plus sign to create a range, let's say you need a random number from 100 to 200, you can do something like:

 100 + ROUND( 100 *RAND(convert(varbinary, newid())), 0)

Combining it in your query:

 UPDATE CattleProds SET SheepTherapy= ROUND( 1000 *RAND(convert(varbinary, newid())), 0) WHERE SheepTherapy IS NULL

+5

Segev -CJ- Shmueli May 12 '13 at 5:46

source share

I tested 2 set-based randomization methods for RAND (), generating 100,000,000 lines each. To level the field, the output is a float between 0-1 and simulate RAND (). Most of the code is a testing framework, so I generalize the algorithms here:

 -- Try #1 used (CAST(CRYPT_GEN_RANDOM(8) AS BIGINT)%500000000000000000+500000000000000000.0)/1000000000000000000 AS Val -- Try #2 used RAND(Checksum(NewId())) -- and to have a baseline to compare output with I used RAND() -- this required executing 100000000 separate insert statements

Using CRYPT_GEN_RANDOM was by far the most random, since there is only a chance .000000001% can see even 1 duplicate when it plucks 10 ^ 8 numbers from a set of 10 ^ 18 numbers. IOW we should not have seen duplicates, and this was not! This kit took 44 seconds to generate on my laptop.

 Cnt Pct ----- ---- 1 100.000000 --No duplicates

SQL Server Runtime: CPU time = 134795 ms, elapsed time = 39274 ms.

 IF OBJECT_ID('tempdb..#T0') IS NOT NULL DROP TABLE #T0; GO WITH L0 AS (SELECT c FROM (VALUES (1),(1),(1),(1),(1),(1),(1),(1),(1),(1),(1),(1),(1),(1),(1),(1)) AS D(c)) -- 2^4 ,L1 AS (SELECT 1 AS c FROM L0 AS A CROSS JOIN L0 AS B) -- 2^8 ,L2 AS (SELECT 1 AS c FROM L1 AS A CROSS JOIN L1 AS B) -- 2^16 ,L3 AS (SELECT 1 AS c FROM L2 AS A CROSS JOIN L2 AS B) -- 2^32 SELECT TOP 100000000 (CAST(CRYPT_GEN_RANDOM(8) AS BIGINT)%500000000000000000+500000000000000000.0)/1000000000000000000 AS Val INTO #T0 FROM L3; WITH x AS ( SELECT Val,COUNT(*) Cnt FROM #T0 GROUP BY Val ) SELECT x.Cnt,COUNT(*)/(SELECT COUNT(*)/100 FROM #T0) Pct FROM X GROUP BY x.Cnt;

Almost 15 orders of magnitude less than random, this method was not quite twice as fast, in just 23 seconds, to generate 100 M numbers.

 Cnt Pct ---- ---- 1 95.450254 -- only 95% unique is absolutely horrible 2 02.222167 -- If this line were the only problem I'd say DON'T USE THIS! 3 00.034582 4 00.000409 -- 409 numbers appeared 4 times 5 00.000006 -- 6 numbers actually appeared 5 times

SQL Server Runtime: CPU time = 77156 ms, elapsed time = 24613 ms.

 IF OBJECT_ID('tempdb..#T1') IS NOT NULL DROP TABLE #T1; GO WITH L0 AS (SELECT c FROM (VALUES (1),(1),(1),(1),(1),(1),(1),(1),(1),(1),(1),(1),(1),(1),(1),(1)) AS D(c)) -- 2^4 ,L1 AS (SELECT 1 AS c FROM L0 AS A CROSS JOIN L0 AS B) -- 2^8 ,L2 AS (SELECT 1 AS c FROM L1 AS A CROSS JOIN L1 AS B) -- 2^16 ,L3 AS (SELECT 1 AS c FROM L2 AS A CROSS JOIN L2 AS B) -- 2^32 SELECT TOP 100000000 RAND(Checksum(NewId())) AS Val INTO #T1 FROM L3; WITH x AS ( SELECT Val,COUNT(*) Cnt FROM #T1 GROUP BY Val ) SELECT x.Cnt,COUNT(*)*1.0/(SELECT COUNT(*)/100 FROM #T1) Pct FROM X GROUP BY x.Cnt;

Only RAND () is useless for set-based generation, so generating a baseline for comparing randomness takes more than 6 hours and needs to be restarted several times to finally get the correct number of output lines. It also seems that randomness leaves much to be desired, although it is better than using a checksum (newid ()) to re-feed each line.

 Cnt Pct ---- ---- 1 99.768020 2 00.115840 3 00.000100 -- at least there were comparitively few values returned 3 times

Due to restarts, runtime cannot be captured.

 IF OBJECT_ID('tempdb..#T2') IS NOT NULL DROP TABLE #T2; GO CREATE TABLE #T2 (Val FLOAT); GO SET NOCOUNT ON; GO INSERT INTO #T2(Val) VALUES(RAND()); GO 100000000 WITH x AS ( SELECT Val,COUNT(*) Cnt FROM #T2 GROUP BY Val ) SELECT x.Cnt,COUNT(*)*1.0/(SELECT COUNT(*)/100 FROM #T2) Pct FROM X GROUP BY x.Cnt;

0

bielawski Jul 06 '17 at 19:30

source share

 require_once('db/connect.php'); //rand(1000000 , 9999999); $products_query = "SELECT id FROM products"; $products_result = mysqli_query($conn, $products_query); $products_row = mysqli_fetch_array($products_result); $ids_array = []; do { array_push($ids_array, $products_row['id']); } while($products_row = mysqli_fetch_array($products_result)); /* echo '<pre>'; print_r($ids_array); echo '</pre>'; */ $row_counter = count($ids_array); for ($i=0; $i < $row_counter; $i++) { $current_row = $ids_array[$i]; $rand = rand(1000000 , 9999999); mysqli_query($conn , "UPDATE products SET code='$rand' WHERE id='$current_row'"); }

0

user3351200 Dec 13 '17 at 8:35

source share

Andomar · Accepted Answer · 2011-02-15 11:28

Instead of rand() use newid() , which is recounted for each row of the result. The usual way is to use a modulus of the checksum. Note that checksum(newid()) can generate -2,147,483,648 and cause an integer overflow on abs() , so we need to use modulo to return the checksum value before converting it to an absolute value.

 UPDATE CattleProds SET SheepTherapy = abs(checksum(NewId()) % 10000) WHERE SheepTherapy IS NULL

This generates a random number between 0 and 9999.

How to fill a column with random numbers in SQL? I get the same value on every line

More articles: