SQL Server Insert if not exists

Question

I want to insert data into my table, but insert only data that doesn't already exist in my database.

Here is my code:

ALTER PROCEDURE [dbo].[EmailsRecebidosInsert]
  (@_DE nvarchar(50),
   @_ASSUNTO nvarchar(50),
   @_DATA nvarchar(30) )
AS
BEGIN
   INSERT INTO EmailsRecebidos (De, Assunto, Data)
   VALUES (@_DE, @_ASSUNTO, @_DATA)
   WHERE NOT EXISTS ( SELECT * FROM EmailsRecebidos 
                   WHERE De = @_DE
                   AND Assunto = @_ASSUNTO
                   AND Data = @_DATA);
END

And the error is:

Msg 156, Level 15, State 1, Procedure EmailsRecebidosInsert, Line 11
Incorrect syntax near the keyword 'WHERE'.

You should not rely on this check alone to ensure no duplicates, it is not thread safe and you will get duplicates when a race condition is met. If you really need unique data add a unique constraint to the table, and then catch the unique constraint violation error. See this answer — GarethD
– GarethD, Commented Jan 7, 2014 at 12:54
You can use MERGE query or If not exist( select statement ) begin insert values END — Abdul Hannan Ijaz
– Abdul Hannan Ijaz, Commented Jan 20, 2016 at 6:50
It depends on the scenario if you should relay or not on this check. If you are developing a deploy script that writes data to a "static" table for example, this is not an issue. — AxelWass
– AxelWass, Commented Nov 9, 2016 at 16:48
@GarethD: what do you mean "not thread safe"? It may not be elegant but it looks correct to me. A single insert statement is always a single transaction. It's not as if the SQL Server evaluates the subquery first and then at some later point, and without holding a lock, goes on to do the insert. — Ed Avis
– Ed Avis, Commented Aug 17, 2017 at 11:40
@EdAvis That is exactly what happens, unless you explicitly use a transaction and the UPDLOCK and HOLDLOCK query hints, the lock on EmailsRecebidos will be released as soon as the check is done, momentarily before the write to the same table. In this split second, another thread can still read the table and assume records don't exist and encounter the race condition. By using the explicit transactions and the locking hints, and can stop the lock on the table being released after the select statement is finished. The lock will be held until the transaction is committed. — GarethD
– GarethD, Commented Aug 17, 2017 at 12:11

Community · Accepted Answer · 2017-05-23 12:18:15Z

524

instead of below Code

BEGIN
   INSERT INTO EmailsRecebidos (De, Assunto, Data)
   VALUES (@_DE, @_ASSUNTO, @_DATA)
   WHERE NOT EXISTS ( SELECT * FROM EmailsRecebidos 
                   WHERE De = @_DE
                   AND Assunto = @_ASSUNTO
                   AND Data = @_DATA);
END

replace with

BEGIN
   IF NOT EXISTS (SELECT * FROM EmailsRecebidos 
                   WHERE De = @_DE
                   AND Assunto = @_ASSUNTO
                   AND Data = @_DATA)
   BEGIN
       INSERT INTO EmailsRecebidos (De, Assunto, Data)
       VALUES (@_DE, @_ASSUNTO, @_DATA)
   END
END

Updated : (thanks to @Marc Durdin for pointing)

Note that under high load, this will still sometimes fail, because a second connection can pass the IF NOT EXISTS test before the first connection executes the INSERT, i.e. a race condition. See stackoverflow.com/a/3791506/1836776 for a good answer on why even wrapping in a transaction doesn't solve this.

edited May 23, 2017 at 12:18

CommunityBot

11 silver badge

answered Jan 7, 2014 at 12:35

I A Khan

8,92716 gold badges57 silver badges78 bronze badges

Sign up to request clarification or add additional context in comments.

9 Comments

Marc Durdin Over a year ago

Note that under high load, this will still sometimes fail, because a second connection can pass the IF NOT EXISTS test before the first connection executes the INSERT, i.e. a race condition. See See stackoverflow.com/a/3791506/1836776 for a good answer on why even wrapping in a transaction doesn't solve this.

Reno Over a year ago

SELECT 1 FROM EmailsRecebidos WHERE De = @_DE AND Assunto = @_ASSUNTO AND Data = @_DATA To use 1 instead of * would be more efficient

Kevin Finkenbinder Over a year ago

Put a write lock around the whole thing and then you won't have any chance of duplicates.

Loudenvier Over a year ago

@jazzcat select * in this case makes no difference whatsoever because it's being used in an EXISTS clause. SQL Server will always optimize it and has been doing it for ages. Since I'm very old I usually write these queries as EXISTS (SELECT 1 FROM...) but it is not needed anymore.

drowa Over a year ago

Why does this kind of simple question generate more doubt than certainty?

|

score 159 · Accepted Answer · 2016-01-20 06:15:25Z

159

For those looking for the fastest way, I recently came across these benchmarks where apparently using "INSERT SELECT... EXCEPT SELECT..." turned out to be the fastest for 50 million records or more.

Here's some sample code from the article (the 3rd block of code was the fastest):

INSERT INTO #table1 (Id, guidd, TimeAdded, ExtraData)
SELECT Id, guidd, TimeAdded, ExtraData
FROM #table2
WHERE NOT EXISTS (Select Id, guidd From #table1 WHERE #table1.id = #table2.id)
-----------------------------------
MERGE #table1 as [Target]
USING  (select Id, guidd, TimeAdded, ExtraData from #table2) as [Source]
(id, guidd, TimeAdded, ExtraData)
    on [Target].id =[Source].id
WHEN NOT MATCHED THEN
    INSERT (id, guidd, TimeAdded, ExtraData)
    VALUES ([Source].id, [Source].guidd, [Source].TimeAdded, [Source].ExtraData);
------------------------------
INSERT INTO #table1 (id, guidd, TimeAdded, ExtraData)
SELECT id, guidd, TimeAdded, ExtraData from #table2
EXCEPT
SELECT id, guidd, TimeAdded, ExtraData from #table1
------------------------------
INSERT INTO #table1 (id, guidd, TimeAdded, ExtraData)
SELECT #table2.id, #table2.guidd, #table2.TimeAdded, #table2.ExtraData
FROM #table2
LEFT JOIN #table1 on #table1.id = #table2.id
WHERE #table1.id is null

edited Jan 20, 2016 at 6:15

answered Jan 20, 2016 at 6:10

user4023224

7 Comments

Bryan Over a year ago

I like EXCEPT SELECT

Aasish Kr. Sharma Over a year ago

But EXCEPT may not be efficient for bulk operations.

user4023224 Over a year ago

@Biswa: Not according to those benchmarks. The code is available from the site. Feel free to run it on your system to see how the results compare.

Alex from Jitbit Over a year ago

Beware,those benchmarks are testing "inserting from another table" not explicit values as requested in the question

Jason C Over a year ago

Been playing with these; looks like the downside to the EXCEPT form is that it'll only skip the insert if every field in the existing record matches the record being inserted. Which might be fine. But for me I just wanted to skip if the id existed regardless of the other values, so I had to go for the first form. Super cool though never used EXCEPT before.

|

Malcolm Swaine · Accepted Answer · 2020-04-26 19:27:26Z

67

Different SQL, same principle. Only insert if the clause in where not exists fails

INSERT INTO FX_USDJPY
            (PriceDate, 
            PriceOpen, 
            PriceLow, 
            PriceHigh, 
            PriceClose, 
            TradingVolume, 
            TimeFrame)
    SELECT '2014-12-26 22:00',
           120.369000000000,
           118.864000000000,
           120.742000000000,
           120.494000000000,
           86513,
           'W'
    WHERE NOT EXISTS
        (SELECT 1
         FROM FX_USDJPY
         WHERE PriceDate = '2014-12-26 22:00'
           AND TimeFrame = 'W')

answered Apr 26, 2020 at 19:27

Malcolm Swaine

2,31226 silver badges16 bronze badges

6 Comments

Fabio Pagano Over a year ago

Very elegant solution, this should also solve the race condition problem.

Andrew Over a year ago

@FabioPagano how?

Rob Over a year ago

Because it's in the query itself instead of ahead of it in the procedure. So there is no way this value could get duplicated as the existence is checked during the insert.

ojonasplima Over a year ago

I don't know about performance, but that works fine.

Hamish Moffatt Over a year ago

Does it work if the table is empty? Because the sub-select won't return any rows, so nothing will be inserted.

|

Dale K · Accepted Answer · 2020-03-17 09:35:06Z

34

I would use a merge:

create PROCEDURE [dbo].[EmailsRecebidosInsert]
  (@_DE nvarchar(50),
   @_ASSUNTO nvarchar(50),
   @_DATA nvarchar(30) )
AS
BEGIN
   with data as (select @_DE as de, @_ASSUNTO as assunto, @_DATA as data)
   merge EmailsRecebidos t
   using data s
      on s.de = t.de
     and s.assunte = t.assunto
     and s.data = t.data
    when not matched by target
    then insert (de, assunto, data) values (s.de, s.assunto, s.data);
END

edited Mar 17, 2020 at 9:35

Dale K

28.1k15 gold badges59 silver badges85 bronze badges

answered Jan 7, 2014 at 12:46

Brett Schneider

4,1032 gold badges19 silver badges33 bronze badges

3 Comments

jokab Over a year ago

im going with this because its fancier

Don Sam Over a year ago

I would love to use merge...but it does not work for Memory Optimized Tables.

2br-2b Over a year ago

Merge also allows you to update already-existing rows should the need arise, for example, if you're updating a list of groceries with new prices

Turnip · Accepted Answer · 2015-03-26 16:46:40Z

30

Try below code

ALTER PROCEDURE [dbo].[EmailsRecebidosInsert]
  (@_DE nvarchar(50),
   @_ASSUNTO nvarchar(50),
   @_DATA nvarchar(30) )
AS
BEGIN
   INSERT INTO EmailsRecebidos (De, Assunto, Data)
   select @_DE, @_ASSUNTO, @_DATA
   EXCEPT
   SELECT De, Assunto, Data from EmailsRecebidos
END

edited Mar 26, 2015 at 16:46

Turnip

36.8k15 gold badges92 silver badges118 bronze badges

answered Jan 7, 2014 at 12:56

SaravanaC

4263 silver badges4 bronze badges

1 Comment

Abdelghani AINOUSS Over a year ago

Please provide a nice explanation of why your code works!

Dale K · Accepted Answer · 2020-03-17 09:34:47Z

23

I did the same thing with SQL Server 2012 and it worked

Insert into #table1 With (ROWLOCK) (Id, studentId, name)
SELECT '18769', '2', 'Alex'
WHERE not exists (select * from #table1 where Id = '18769' and studentId = '2')

edited Mar 17, 2020 at 9:34

Dale K

28.1k15 gold badges59 silver badges85 bronze badges

answered Aug 16, 2016 at 11:13

Hovhannes Babayan

3824 silver badges12 bronze badges

3 Comments

drowa Over a year ago

Of course it worked, you are using a temporary table (i.e. you don't need to worry about concurrency when using temporary tables).

Blaisem Over a year ago

I got a "from not at the expected position" error

Hovhannes Babayan Over a year ago

what code are you writing and which SQL server do you use?

Leon Tayson · Accepted Answer · 2021-09-23 03:10:36Z

23

Just change your code to use SELECT instead of VALUES

   INSERT INTO EmailsRecebidos (De, Assunto, Data)
   SELECT @_DE, @_ASSUNTO, @_DATA
   WHERE NOT EXISTS (SELECT * FROM EmailsRecebidos 
                   WHERE De = @_DE
                   AND Assunto = @_ASSUNTO
                   AND Data = @_DATA);

answered Sep 23, 2021 at 3:10

Leon Tayson

5,0517 gold badges41 silver badges36 bronze badges

4 Comments

gordy Over a year ago

only correct answer here, incredible

Konrad Viltersten Over a year ago

To me, it looks like two, separate statements: one inserting regardless of the current state of the DB and another one selecting 3 columns based on some sub-query. Naturally, I assume I'm mistaken, which leads to the question what I'm missing. Please advise.

BenderBoy Over a year ago

@KonradViltersten it is a single statement. INSERT INTO can’t function without any values (try it out, it’s a syntax error), so whatever comes next will be interpreted as the values to be inserted. This can be a VALUES clause with static values or, as in this case, the result set of a fully featured SELECT.

Konrad Viltersten Over a year ago

@BenderBoy Woosh! I just saw it. I knew I was missing something! Can you imagine I've been doing backend for 17 years or so and never got there?! I always use values(...) and put in variables declared based on previous queries. This was an epiphany. Darn... Thanks, mate!

marc_s · Accepted Answer · 2014-01-07 12:33:19Z

15

The INSERT command doesn't have a WHERE clause - you'll have to write it like this:

ALTER PROCEDURE [dbo].[EmailsRecebidosInsert]
  (@_DE nvarchar(50),
   @_ASSUNTO nvarchar(50),
   @_DATA nvarchar(30) )
AS
BEGIN
   IF NOT EXISTS (SELECT * FROM EmailsRecebidos 
                   WHERE De = @_DE
                   AND Assunto = @_ASSUNTO
                   AND Data = @_DATA)
   BEGIN
       INSERT INTO EmailsRecebidos (De, Assunto, Data)
       VALUES (@_DE, @_ASSUNTO, @_DATA)
   END
END

answered Jan 7, 2014 at 12:33

marc_s

760k186 gold badges1.4k silver badges1.5k bronze badges

5 Comments

Filip De Vos Over a year ago

You need to handle errors for this procedure because there will be cases where an insert will happen between the check and insert.

marc_s Over a year ago

@FilipDeVos: true - a possibility, maybe not very likely, but still a possibility. Good point.

David Over a year ago

What if you wrap both within a transaction? Would that block the possibility? (I'm no expert on transactions, so please forgive if this is a stupid question.)

Marc Durdin Over a year ago

See stackoverflow.com/a/3791506/1836776 for a good answer on why a transaction doesn't solve this, @David.

Wessam El Mahdy Over a year ago

In the IF statement: there's no need to use BEGIN & END if the number of required command lines is just one even if you used more than one line, so you can omit it here.

codejockie · Accepted Answer · 2022-02-27 16:19:04Z

12

If your clustered index consists of only those fields then the simple, fast and reliable option is to use IGNORE_DUP_KEY

If you create the Clustered index with IGNORE_DUP_KEY ON

Then you can just use:

INSERT INTO EmailsRecebidos (De, Assunto, Data) VALUES (@_DE, @_ASSUNTO, @_DATA)

This should be safe in all cases!

edited Feb 27, 2022 at 16:19

codejockie

11.1k5 gold badges52 silver badges58 bronze badges

answered Oct 28, 2020 at 15:40

Alexander Bartosh

9,0451 gold badge24 silver badges24 bronze badges

1 Comment

Christoph Over a year ago

I didn't know this one, thank you very much, that is such a neat solution!

Don · Accepted Answer · 2014-01-07 12:50:55Z

9

Depending on your version (2012?) of SQL Server aside from the IF EXISTS you can also use MERGE like so:

ALTER PROCEDURE [dbo].[EmailsRecebidosInsert]
    ( @_DE nvarchar(50)
    , @_ASSUNTO nvarchar(50)
    , @_DATA nvarchar(30))
AS BEGIN
    MERGE [dbo].[EmailsRecebidos] [Target]
    USING (VALUES (@_DE, @_ASSUNTO, @_DATA)) [Source]([De], [Assunto], [Data])
         ON [Target].[De] = [Source].[De] AND [Target].[Assunto] = [Source].[Assunto] AND [Target].[Data] = [Source].[Data]
     WHEN NOT MATCHED THEN
        INSERT ([De], [Assunto], [Data])
        VALUES ([Source].[De], [Source].[Assunto], [Source].[Data]);
END

answered Jan 7, 2014 at 12:50

Don

9,6814 gold badges28 silver badges25 bronze badges

Comments

Qureshi Taha · Accepted Answer · 2024-01-29 07:14:51Z

0

I found a new method to do this INSERT IGNORE

INSERT IGNORE INTO `admin` (`name`, `email`, `password`) VALUES ('admin', '[email protected]', 'admin@123');

Try this or do this

INSERT INTO `admin` (`name`, `email`, `password`) VALUES ('admin', '[email protected]', 'admin@123') ON DUPLICATE KEY UPDATE `name` = VALUES(`name`), `password` = VALUES(`password`);

answered Jan 29, 2024 at 7:14

Qureshi Taha

554 bronze badges

1 Comment

gneric Over a year ago

not supported in Sql Server, that's for MySql

Dale K · Accepted Answer · 2020-03-17 09:33:45Z

-1

You could use the GO command. That will restart the execution of SQL statements after an error. In my case I have a few 1000 INSERT statements, where a handful of those records already exist in the database, I just don't know which ones. I found that after processing a few 100, execution just stops with an error message that it can't INSERT as the record already exists. Quite annoying, but putting a GO solved this. It may not be the fastest solution, but speed was not my problem.

GO
INSERT INTO mytable (C1,C2,C3) VALUES(1,2,3)
GO
INSERT INTO mytable (C1,C2,C3) VALUES(4,5,6)
 etc ...

edited Mar 17, 2020 at 9:33

Dale K

28.1k15 gold badges59 silver badges85 bronze badges

answered May 17, 2018 at 16:19

mljm

3373 silver badges13 bronze badges

1 Comment

Dale K Over a year ago

GO is a batch separator? It doesn't assist with preventing duplicate records.

Jay · Accepted Answer · 2022-05-06 00:33:36Z

-2

If you want to check whether a key exists or not, you can use:

INSERT INTO tableName (...) VALUES (...) 
ON DUPLICATE KEY 
UPDATE ...

Using this, if there is already an entry for the particular key, then it will UPDATE, else, it will INSERT.

answered May 6, 2022 at 0:33

Jay

951 silver badge3 bronze badges

1 Comment

Diego Perez Over a year ago

Hello @Jay, if I'm not wrong, "ON DUPLICATE KEY" only works in MySql, and not SQL Server.

Dale K · Accepted Answer · 2020-03-17 09:33:18Z

-4

As explained in below code: Execute below queries and verify yourself.

CREATE TABLE `table_name` (
  `id` int(11) NOT NULL auto_increment,
  `name` varchar(255) NOT NULL,
  `address` varchar(255) NOT NULL,
  `tele` varchar(255) NOT NULL,
  PRIMARY KEY  (`id`)
) ENGINE=InnoDB;

Insert a record:

INSERT INTO table_name (name, address, tele)
SELECT * FROM (SELECT 'Nazir', 'Kolkata', '033') AS tmp
WHERE NOT EXISTS (
    SELECT name FROM table_name WHERE name = 'Nazir'
) LIMIT 1;
Query OK, 1 row affected (0.00 sec)
Records: 1 Duplicates: 0 Warnings: 0

SELECT * FROM `table_name`;

+----+--------+-----------+------+
| id | name   | address   | tele |
+----+--------+-----------+------+
|  1 | Nazir  | Kolkata   | 033  |
+----+--------+-----------+------+

Now, try to insert the same record again:

INSERT INTO table_name (name, address, tele)
SELECT * FROM (SELECT 'Nazir', 'Kolkata', '033') AS tmp
WHERE NOT EXISTS (
    SELECT name FROM table_name WHERE name = 'Nazir'
) LIMIT 1;

Query OK, 0 rows affected (0.00 sec)
Records: 0  Duplicates: 0  Warnings: 0

+----+--------+-----------+------+
| id | name   | address   | tele |
+----+--------+-----------+------+
|  1 | Nazir  | Kolkata   | 033  |
+----+--------+-----------+------+

Insert a different record:

INSERT INTO table_name (name, address, tele)
SELECT * FROM (SELECT 'Santosh', 'Kestopur', '044') AS tmp
WHERE NOT EXISTS (
    SELECT name FROM table_name WHERE name = 'Santosh'
) LIMIT 1;

Query OK, 1 row affected (0.00 sec)
Records: 1 Duplicates: 0 Warnings: 0

SELECT * FROM `table_name`;

+----+--------+-----------+------+
| id | name   | address   | tele |
+----+--------+-----------+------+
|  1 | Nazir  | Kolkata   | 033  |
|  2 | Santosh| Kestopur  | 044  |
+----+--------+-----------+------+

edited Mar 17, 2020 at 9:33

Dale K

28.1k15 gold badges59 silver badges85 bronze badges

answered Dec 5, 2018 at 9:44

Vadiraj S J

7271 gold badge10 silver badges18 bronze badges

2 Comments

Douglas Gaskell Over a year ago

Isn't this for MySQL and the question is for SQL Server?

Vadiraj S J Over a year ago

Yes its for MySQL.

Collectives™ on Stack Overflow

SQL Server Insert if not exists

14 Answers 14

9 Comments

7 Comments

6 Comments

3 Comments

1 Comment

3 Comments

4 Comments

5 Comments

1 Comment

Comments

1 Comment

1 Comment

1 Comment

2 Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

14 Answers 14

9 Comments

7 Comments

6 Comments

3 Comments

1 Comment

3 Comments

4 Comments

5 Comments

1 Comment

Comments

1 Comment

1 Comment

1 Comment

2 Comments

Linked

Related