萬盛學電腦網

 萬盛學電腦網 >> 數據庫 >> mysql教程 >> mysql中distinct和group by過濾刪除重復行

mysql中distinct和group by過濾刪除重復行

在mysql中distinct就是可以直接去重的而group by 是分組顯示的,但是有朋友在應用中可能會發現distinct並不像官方講得那有實用了,下面我來介紹一下它們是怎麼過濾刪除重復行.

下面先來看看例子:

 代碼如下 復制代碼

table

id name

1 a

2 b

3 c

4 c

5 b

庫結構大概這樣,這只是一個簡單的例子,實際情況會復雜得多。

比如我想用一條語句查詢得到name不重復的所有數據,那就必須使用distinct去掉多余的重復記錄。

 代碼如下 復制代碼

select distinct name from table

得到的結果是:

name

a

b

c

好像達到效果了,可是,我想要得到的是id值呢?改一下查詢語句吧:

 代碼如下 復制代碼

select distinct name, id from table

結果會是:

id name

1 a

2 b

3 c

4 c

5 b

distinct怎麼沒起作用?作用是起了的,不過他同時作用了兩個字段,也就是必須得id與name都相同的才會被排除。。。。。。。

我們再改改查詢語句:

select id, distinct name from table

現在將完整語句放出:

 代碼如下 復制代碼

select *, count(distinct name) from table group by name

結果:

id name count(distinct name)

1 a 1

2 b 1

3 c 1

上面簡單但有些地方是不能完成我們的需要的,下面記錄了些常用的重復記錄操作語句

查詢及刪除重復記錄的方法
(一)
1、查找表中多余的重復記錄,重復記錄是根據單個字段(peopleId)來判斷

 代碼如下 復制代碼 select * from people
where peopleId in (select peopleId from people group by peopleId having count(peopleId) > 1)

2、刪除表中多余的重復記錄,重復記錄是根據單個字段(peopleId)來判斷,只留有rowid最小的記錄

 代碼如下 復制代碼 delete from people
where peopleId in (select peopleId from people group by peopleId having count(peopleId) > 1)
and rowid not in (select min(rowid) from people group by peopleId having count(peopleId )>1)

3、查找表中多余的重復記錄(多個字段)

 代碼如下 復制代碼 select * from vitae a
where (a.peopleId,a.seq) in (select peopleId,seq from vitae group by peopleId,seq having count(*) > 1)

4、刪除表中多余的重復記錄(多個字段),只留有rowid最小的記錄

 代碼如下 復制代碼 delete from vitae a
where (a.peopleId,a.seq) in (select peopleId,seq from vitae group by peopleId,seq having count(*) > 1)
and rowid not in (select min(rowid) from vitae group by peopleId,seq having count(*)>1)

5、查找表中多余的重復記錄(多個字段),不包含rowid最小的記錄

 代碼如下 復制代碼 select * from vitae a
where (a.peopleId,a.seq) in (select peopleId,seq from vitae group by peopleId,seq having count(*) > 1)
and rowid not in (select min(rowid) from vitae group by peopleId,seq having count(*)>1)
copyright © 萬盛學電腦網 all rights reserved