select unique rows based on single distinct column [duplicate]

Question

I want to select rows that have a distinct email, see the example table below:

+----+---------+-------------------+-------------+
| id | title   | email             | commentname |
+----+---------+-------------------+-------------+
|  3 | test    | [email protected]   | rob         |
|  4 | i agree | [email protected]   | rob         |
|  5 | its ok  | [email protected]   | rob         |
|  6 | hey     | [email protected]   | rob         |
|  7 | nice!   | [email protected] | simon       |
|  8 | yeah    | [email protected]  | john        |
+----+---------+-------------------+-------------+

The desired result would be:

+----+-------+-------------------+-------------+
| id | title | email             | commentname |
+----+-------+-------------------+-------------+
|  3 | test  | [email protected]   | rob         |
|  7 | nice! | [email protected] | simon       |
|  8 | yeah  | [email protected]  | john        |
+----+-------+-------------------+-------------+

Where I don't care which id column value is returned. What would be the required SQL?

ypercubeᵀᴹ · Accepted Answer · 2016-03-16 14:06:14Z

121

Quick one in TSQL

SELECT a.*
FROM emails a
INNER JOIN 
  (SELECT email,
    MIN(id) as id
  FROM emails 
  GROUP BY email 
) AS b
  ON a.email = b.email 
  AND a.id = b.id;

edited Mar 16, 2016 at 14:06

ypercubeᵀᴹ

116k19 gold badges181 silver badges249 bronze badges

answered Nov 25, 2011 at 20:51

Turbot

5,2511 gold badge24 silver badges31 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Adam Over a year ago

Wow that was fast guys!:) laptop's answer was the shortest and easiest, thanks!

Adam Robinson Over a year ago

The distinct keyword is not necessary here. Also, it seems like a join on just id would do the trick as well.

AurA Over a year ago

I have a huge table with primary key an aggregate of two columns, it is not working in that case

Turbot Over a year ago

@downvoter , what do you mean by not working, perhaps will be another question?

drmaa Over a year ago

Excellent, I changed the min to max to get the last row in the duplicate instead of first

Adam Robinson · Accepted Answer · 2011-11-25 20:40:10Z

45

I'm assuming you mean that you don't care which row is used to obtain the title, id, and commentname values (you have "rob" for all of the rows, but I don't know if that is actually something that would be enforced or not in your data model). If so, then you can use windowing functions to return the first row for a given email address:

select
    id,
    title,
    email,
    commentname

from
(
select 
    *, 
    row_number() over (partition by email order by id) as RowNbr 

from YourTable
) source

where RowNbr = 1

answered Nov 25, 2011 at 20:40

Adam Robinson

186k36 gold badges294 silver badges351 bronze badges

3 Comments

Antony Booth Over a year ago

This is the best solution, because it can apply to duplicate rows that do not have a unique identity column, or ones that do.

A S Over a year ago

....Yes this solved the issue for me....the solution above only grouped the table data together.....i.e for Microsoft SQL 2008 Server/data .........thanks Adam......

David Over a year ago

This is a really good solution that works great for smaller tables. Is there a way to do this without having to list each column in the SELECT statement?

Community · Accepted Answer · 2017-05-23 10:31:12Z

6

If you are using MySql 5.7 or later, according to these links (MySql Official, SO QA), we can select one record per group by with out the need of any aggregate functions.

So the query can be simplified to this.

select * from comments_table group by commentname;

Try out the query in action here

edited May 23, 2017 at 10:31

CommunityBot

11 silver badge

answered Jun 9, 2016 at 12:31

RamValli

4,4952 gold badges37 silver badges47 bronze badges

4 Comments

starwed Over a year ago

Unfortunately, the question is tagged with tsql and sqlserver.

Rick Kukiela Over a year ago

Even though it was the right answer to the wrong question I ended up here looking for this solution for mysql so take my updoot

Kai Wang Over a year ago

Nice solution deserves more respects

UMR Over a year ago

didn't work with mysql Ver 8.0.29-0ubuntu0.20.04.3 for Linux on x86_64 ((Ubuntu))

sll · Accepted Answer · 2011-11-25 20:43:15Z

3

Since you don't care which id to return I stick with MAX id for each email to simplify SQL query, give it a try

;WITH ue(id)
 AS
 (
   SELECT MAX(id)
   FROM table
   GROUP BY email
 )
 SELECT * FROM table t
 INNER JOIN ue ON ue.id = t.id

answered Nov 25, 2011 at 20:43

sll

62.7k22 gold badges109 silver badges157 bronze badges

Comments

Deepak Raj · Accepted Answer · 2022-07-23 09:57:40Z

-2

SELECT * FROM emails GROUP BY email;

answered Jul 23, 2022 at 9:57

Deepak Raj

273 bronze badges

Collectives™ on Stack Overflow

select unique rows based on single distinct column [duplicate]

5 Answers 5

5 Comments

3 Comments

4 Comments

Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

5 Comments

3 Comments

4 Comments

Comments

Comments

Linked

Related