How to keep only one row of a table, removing duplicate rows?

Question

I have a table that has a lot of duplicates in the Name column. I'd like to only keep one row for each.

The following lists the duplicates, but I don't know how to delete the duplicates and just keep one:

SELECT name FROM members GROUP BY name HAVING COUNT(*) > 1;

Thank you.

Also see Deleting duplicate rows from sqlite database.

jww
– jww

2019-11-02 02:53:54 +00:00
Commented Nov 2, 2019 at 2:53 — jww
– jww, Commented Nov 2, 2019 at 2:53

Community · Accepted Answer · 2017-05-23 12:10:15Z

61

See the following question: Deleting duplicate rows from a table.

The adapted accepted answer from there (which is my answer, so no "theft" here...):

You can do it in a simple way assuming you have a unique ID field: you can delete all records that are the same except for the ID, but don't have "the minimum ID" for their name.

Example query:

DELETE FROM members
WHERE ID NOT IN
(
    SELECT MIN(ID)
    FROM members
    GROUP BY name
)

In case you don't have a unique index, my recommendation is to simply add an auto-incremental unique index. Mainly because it's good design, but also because it will allow you to run the query above.

edited May 23, 2017 at 12:10

CommunityBot

11 silver badge

answered Aug 17, 2009 at 9:01

Roee Adler

34.1k32 gold badges109 silver badges133 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Gulbahar Over a year ago

Here's how I understand the above: For each name, it groups them (only one if unique; several into one if duplicates), selects the smallest ID from the set, and then deletes any row whose ID doesn't exist in the table. Brilliant :) Thanks much Rax.

David LeBauer Over a year ago

in mysql I get the following error when sending this query: "error 1093 (HY000) but it gives an error 'You cant specify target table 'members' for update in FROM clause" any ideas?

David LeBauer Over a year ago

the problem was that 'members' was both the field and table name. this is what worked: delete from members where id not in (select min(id) from (select * from members) as x group by name)

mach2 Over a year ago

Can we do DELETE FROM members WHERE ID NOT IN ( SELECT name FROM members GROUP BY name HAVING COUNT(*) > 1; )

xamgore Over a year ago

ROWID (instead of name) can be used to differentiate rows if they are completely identical.

Paul Dixon · Accepted Answer · 2009-08-17 08:53:01Z

4

It would probably be easier to select the unique ones into a new table, drop the old table, then rename the temp table to replace it.

#create a table with same schema as members
CREATE TABLE tmp (...);

#insert the unique records
INSERT INTO tmp SELECT * FROM members GROUP BY name;

#swap it in
RENAME TABLE members TO members_old, tmp TO members;

#drop the old one
DROP TABLE members_old;

answered Aug 17, 2009 at 8:53

Paul Dixon

302k54 gold badges315 silver badges349 bronze badges

2 Comments

Gulbahar Over a year ago

Thanks Paul. For those interested... CREATE TEMP TABLE tmp_members (name VARCHAR); INSERT INTO tmp_members SELECT name FROM members GROUP BY name; SELECT COUNT(name) FROM tmp_members; DELETE FROM members; VACUUM members; SELECT COUNT(name) FROM members; INSERT INTO members (name) SELECT * FROM tmp_members; SELECT COUNT(name) FROM members; SELECT DISTINCT COUNT(name) FROM members; SELECT name FROM members LIMIT 10; DROP TABLE tmp_members;

Paul Dixon Over a year ago

Sorry, I missed that you were using SQLite!

Pidoski · Accepted Answer · 2023-08-17 15:17:18Z

1

DELETE FROM tablename WHERE ID IN( SELECT MAX(ID) ID FROM tablename GROUP BY IDNumber HAVING COUNT(IDNumber) > 1 )

edited Aug 17, 2023 at 15:17

answered Aug 17, 2023 at 15:11

Pidoski

811 silver badge4 bronze badges

Comments

G Berdal · Accepted Answer · 2009-08-17 09:06:30Z

0

We have a huge database where deleting duplicates is part of the regular maintenance process. We use DISTINCT to select the unique records then write them into a TEMPORARY TABLE. After TRUNCATE we write back the TEMPORARY data into the TABLE.

That is one way of doing it and works as a STORED PROCEDURE.

answered Aug 17, 2009 at 9:06

G Berdal

1,1703 gold badges14 silver badges29 bronze badges

1 Comment

G Berdal Over a year ago

I have to admit Rax Olgud's answer is much-much more sophisticated and probably runs 100 times quicker! :) - I'm thinking about adopting the solution... Deserves +1!

Lauri Lubi · Accepted Answer · 2016-09-05 18:56:55Z

0

If we want to see first which rows you are about to delete. Then delete them.

with MYCTE as (
    SELECT DuplicateKey1
        ,DuplicateKey2 --optional
        ,count(*) X
    FROM MyTable
    group by DuplicateKey1, DuplicateKey2
    having count(*) > 1
) 
SELECT E.*
FROM MyTable E
JOIN MYCTE cte
ON E.DuplicateKey1=cte.DuplicateKey1
    AND E.DuplicateKey2=cte.DuplicateKey2
ORDER BY E.DuplicateKey1, E.DuplicateKey2, CreatedAt

Full example at http://developer.azurewebsites.net/2014/09/better-sql-group-by-find-duplicate-data/

answered Sep 5, 2016 at 18:56

Lauri Lubi

5596 silver badges8 bronze badges

Comments

AnyKey · Accepted Answer · 2017-07-17 19:23:13Z

0

You can join table with yourself by matched field and delete unmatching rows

DELETE t1 FROM table_name t1 
LEFT JOIN tablename t2 ON t1.match_field = t2.match_field
WHERE t1.id <> t2.id;

answered Jul 17, 2017 at 19:23

AnyKey

772 bronze badges

Comments

Akhil Singh · Accepted Answer · 2017-09-26 04:39:31Z

delete dup row keep one table has duplicate rows and may be some rows have no duplicate rows then it keep one rows if have duplicate or single in a table. table has two column id and name if we have to remove duplicate name from table and keep one. Its Work Fine at My end You have to Use this query.

DELETE FROM tablename
WHERE id NOT IN(

 SELECT id FROM
(
    SELECT MIN(id)AS id
    FROM tablename
    GROUP BY name HAVING 
    COUNT(*) > 1
)AS a )
AND id NOT IN(
(SELECT ids FROM
(
SELECT MIN(id)AS ids
    FROM tablename
    GROUP BY name HAVING 
    COUNT(*) =1
)AS a1
)
)

before delete table is below see the screenshot: enter image description here after delete table is below see the screenshot this query delete amit and akhil duplicate rows and keep one record (amit and akhil):

enter image description here

pappu kumar · Accepted Answer · 2019-02-02 10:21:21Z

0

if you want to remove duplicate record from table.

CREATE TABLE tmp SELECT lastname, firstname, sex
FROM user_tbl;
GROUP BY (lastname, firstname);

DROP TABLE user_tbl;

ALTER TABLE tmp RENAME TO user_tbl;

answered Feb 2, 2019 at 10:21

pappu kumar

3363 silver badges6 bronze badges

Comments

Ibrahim Hammed · Accepted Answer · 2022-12-06 20:04:32Z

0

show record

SELECT `page_url`,count(*) FROM wl_meta_tags GROUP BY page_url HAVING count(*) > 1

delete record

DELETE FROM wl_meta_tags 
WHERE meta_id NOT IN( SELECT meta_id 
FROM ( SELECT MIN(meta_id)AS meta_id FROM wl_meta_tags GROUP BY page_url HAVING COUNT(*) > 1 )AS a ) 
AND meta_id NOT IN( (SELECT ids FROM (
SELECT MIN(meta_id)AS ids FROM wl_meta_tags GROUP BY page_url HAVING COUNT(*) =1 )AS a1 ) )

Source url

edited Dec 6, 2022 at 20:04

Ibrahim Hammed

9201 gold badge10 silver badges19 bronze badges

answered Aug 19, 2021 at 11:49

subash pandey

631 silver badge5 bronze badges

Comments

Jayendran · Accepted Answer · 2018-10-02 11:26:28Z

-1

WITH CTE AS
(
    SELECT ROW_NUMBER() OVER (PARTITION BY [emp_id] ORDER BY [emp_id]) AS Row, * FROM employee_salary
)


DELETE FROM CTE
WHERE ROW <> 1

edited Oct 2, 2018 at 11:26

Jayendran

11.1k13 gold badges73 silver badges124 bronze badges

answered Oct 2, 2018 at 10:56

user3125000

1

Collectives™ on Stack Overflow

How to keep only one row of a table, removing duplicate rows?

10 Answers 10

5 Comments

2 Comments

Comments

1 Comment

Comments

Comments

Comments

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

10 Answers 10

5 Comments

2 Comments

Comments

1 Comment

Comments

Comments

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related