How to delete duplicate rows

Question

Some rows share same primary keys(ID) but the rest of the row maybe different. For example

ID   Age   Info
2    21    2763
2    21    6276
3    31    82756

In this case, both the first and second rows has same ID and Age, but different Info. What I want do with duplicate ID rows is to randomly keep one of them and delete the others. I have so many this kind of records in my Data Sets so I can not delete them one by one. Is there any solutions? Thanks

How can a PK allow duplicate values? Anyway, you want to remove duplicate IDs, right? or is it a combination of ID and Age that has to be treated as duplicate? — Adish
– Adish, Commented Nov 13, 2015 at 16:24
@Adish Remove duplicate IDs is good enough for my case Thanks — Gavin Niu
– Gavin Niu, Commented Nov 13, 2015 at 16:28

Giorgos Betsos · Accepted Answer · 2015-11-13 17:11:22Z

1

Try this:

DELETE t1
FROM mytable AS t1
INNER JOIN mytable AS t2 
ON t1.ID = t2.ID AND t1.Age = t2.Age AND t1.Info > t2.Info

The above should work in MySQL, SQL Server. The statement deletes all rows in a (ID, Age) slice but the one having the smallest Info value.

Note: The above works provided that Info values are unique per (ID, Age) slice.

edited Nov 13, 2015 at 17:11

answered Nov 13, 2015 at 16:38

Giorgos Betsos

72.3k10 gold badges69 silver badges103 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Gavin Niu Over a year ago

Thanks for your answer, What are t1 and t2?

Giorgos Betsos Over a year ago

@GavinNiu They are table aliases

Adish Over a year ago

This will not delete rows where IDs match but Age does not. This will not delete rows where all three columns are identical.

Giorgi Nakeuri · Accepted Answer · 2015-11-13 17:18:08Z

1

With window function:

;with cte as(select *, row_number() over(partition by id order by info) rn 
             from table)
delete from cte where rn <> 1

answered Nov 13, 2015 at 17:18

Giorgi Nakeuri

35.9k8 gold badges50 silver badges78 bronze badges

Comments

Sergio S · Accepted Answer · 2015-11-13 16:02:22Z

0

I think you're looking for something like this:

delete from TableName where info not in 
(select min(info) from TableName group by ID,Age);

try the select statement first to make sure it's returning the right rows then add the delete part to it

answered Nov 13, 2015 at 16:02

Sergio S

1927 bronze badges

11 Comments

Gavin Niu Over a year ago

Let me try! Thanks for your response!

Bacon Bits Over a year ago

This will only work if info is unique. A row of ID = 2, Info = 82756 would throw it off.

Sergio S Over a year ago

Correct, the assumption per the example is that Info is unique per grouped ID and Age.

Giorgos Betsos Over a year ago

For this query to work Info must be unique on table level

Gavin Niu Over a year ago

Yes, it works with the example, but in my real case info is not unique. I apologize that I gave a bad example..

|

Adish · Accepted Answer · 2015-11-13 17:03:13Z

I would have suggested a set based solution, but I could not get to take care of rows in which all 3 rows are identical. Therefore suggesting a solution that uses ROWCOUNT and a while loop. The ROWCOUNT will ensure that only 1 record is deleted at a time. The while loop is so that you don't have to do it manually one by one.

SET ROWCOUNT 1

DECLARE @ctr INT
SELECT TOP 1 @ctr = COUNT(*) FROM table GROUP BY ID HAVING COUNT(*) > 1 ORDER BY COUNT(*) desc
SELECT @ctr
WHILE @ctr > 1
BEGIN
    DELETE FROM table WHERE ID IN (SELECT ID FROM table GROUP BY ID HAVING COUNT(*) > 1)
    SELECT @ctr = NULL
    SELECT TOP 1 @ctr = COUNT(*) FROM table GROUP BY ID HAVING COUNT(*) > 1 ORDER BY COUNT(*) desc
If @Ctr IS NULL
    Break
ELSE
    Continue
END
SET ROWCOUNT 0

You can alter the order by clause in the delete statement to suit your requirement.

Collectives™ on Stack Overflow

How to delete duplicate rows

4 Answers 4

3 Comments

Comments

11 Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

3 Comments

Comments

11 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related