How to remove duplicate rows with the same column in SQL

Question

I have the following table in SQL:

I want to remove rows with the same id value.

Which query in SQL should I use?

The table should be as follows after the delete operation:

What is your rule for deciding which row to delete?

NickW
– NickW

2023-05-03 20:17:59 +00:00
Commented May 3, 2023 at 20:17 — NickW
– NickW, Commented May 3, 2023 at 20:17
it doesn't matter. i need just one

user
– user

2023-05-03 20:19:00 +00:00
Commented May 3, 2023 at 20:19 — user
– user, Commented May 3, 2023 at 20:19
What rdbms are you using, MS-SqlServer?

Tim Schmelter
– Tim Schmelter

2023-05-03 21:45:43 +00:00
Commented May 3, 2023 at 21:45 — Tim Schmelter
– Tim Schmelter, Commented May 3, 2023 at 21:45

Kostas Nitaf · Accepted Answer · 2023-05-03 20:40:28Z

2

You could select all unique ids by using group by:

Select 
   max(a)
   ,id
   from Table
   group by id

If the above result is the result that you want to keep in your table then you could just do that:

delete FROM Table
  where a not in (
    Select max(a)  from Table
    group by id
  )

answered May 3, 2023 at 20:40

Kostas Nitaf

1,0551 gold badge11 silver badges31 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Paranoid0X0x0 · Accepted Answer · 2023-05-03 20:34:42Z

1

Use Distinct keyword

SELECT DISTINCT a, id, c FROM table_name;

The DISTINCT keyword ensures that only unique rows are returned in the query result.

or use group by and having clause

SELECT a,id,c, COUNT(*) AS CNT FROM [SampleDB].[dbo].[Employee]
GROUP BY a,id,c HAVING COUNT(*) > 1;

==============================

DELETE FROM [SampleDB].[dbo].[Employee] WHERE a NOT IN(SELECT MAX(a) AS MaxRecordID FROM [SampleDB].[dbo].[Employee] GROUP BY id,c);

explanation is given on below source.If it donot work read the source and change accordingly

Source:https://www.sqlshack.com/different-ways-to-sql-delete-duplicate-rows-from-a-sql-table/

edited May 3, 2023 at 20:34

answered May 3, 2023 at 20:21

Paranoid0X0x0

761 silver badge7 bronze badges

2 Comments

user Over a year ago

thanks for your answer!' but how i delete the rows?

Paranoid0X0x0 Over a year ago

@user edited for deleing

Mega · Accepted Answer · 2023-05-03 20:43:57Z

1

Let's say the table is called t, maybe a way to remove duplicates can be the use the distinct, on that column, so in your case

SELECT DISTINCT id, a, c FROM t;

Bye!

answered May 3, 2023 at 20:43

Mega

312 bronze badges

Comments

marc_s · Accepted Answer · 2023-05-04 04:35:27Z

1

If you use SQL Server, I'd prefer a common table expression with an Over-clause, since it makes the SQL query readable and maintainable:

WITH CTE AS 
(
   SELECT 
       [a], [id], [c], 
       RN = ROW_NUMBER() OVER (PARTITION BY id ORDER BY a)
   FROM dbo.TableName
)
DELETE FROM CTE 
WHERE RN > 1

You could for example easily change it to a select see what you'll delete:

WITH CTE AS
(
   SELECT 
       [a], [id], [c], 
       RN = ROW_NUMBER() OVER (PARTITION BY id ORDER BY a)
   FROM dbo.TableName
)
SELECT * 
FROM CTE 
WHERE RN > 1

You can also easily control what you'll keep, in this case I keep the first duplicate ordered by a.

edited May 4, 2023 at 4:35

marc_s

760k186 gold badges1.4k silver badges1.5k bronze badges

answered May 3, 2023 at 21:49

Tim Schmelter

462k79 gold badges719 silver badges980 bronze badges

Collectives™ on Stack Overflow

How to remove duplicate rows with the same column in SQL

4 Answers 4

Comments

2 Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

2 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related