萬盛學電腦網

浏覽器 windows 10 wps office 殺毒軟件 數據庫 excel教程 文件管理 word教程 網頁制作 裝機必備軟件 linux教程

萬盛學電腦網 >> 數據庫 >> mysql教程 >> mysql中innodb表中count()優化

mysql中innodb表中count()優化

count()是用來統計數據表中所有記錄的一個函數了，但在此函數在innodb中性能不怎麼樣了，下面我們來看看mysql中innodb表中count()優化，希望例子對各位有幫助．

起因：在innodb表上做count(*)統計實在是太慢了，因此想辦法看能不能再快點。

現象：先來看幾個測試案例，如下

一、 sbtest 表上的測試

show create table sbtest＼G
*************************** 1. row ***************************
Table: sbtest
Create Table: CREATE TABLE `sbtest` (
`aid` bigint(20) unsigned NOT NULL auto_increment,
`id` int(10) unsigned NOT NULL default '0',
`k` int(10) unsigned NOT NULL default '0',
`c` char(120) NOT NULL default '',
`pad` char(60) NOT NULL default '',
PRIMARY KEY (`aid`),
KEY `k` (`k`),
KEY `id` (`id`)
) ENGINE=InnoDB AUTO_INCREMENT=1000001 DEFAULT CHARSET=latin1

1、直接 count(*)

2、count(*) 使用 primary key 字段做條件

3、 count(*) 使用 secondary index 字段做條件

explain SELECT COUNT(*) FROM sbtest WHERE id>=0;
+----+-------------+--------+-------+---------------+------+---------+------+--------+--------------------------+
| id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra |
+----+-------------+--------+-------+---------------+------+---------+------+--------+--------------------------+
| 1 | SIMPLE | sbtest | range | id | id | 4 | NULL | 500049 | Using where; Using index |
+----+-------------+--------+-------+---------------+------+---------+------+--------+--------------------------+
SELECT COUNT(*) FROM sbtest WHERE id>=0;
+----------+
| COUNT(*) |
+----------+
| 1000000 |
+----------+
1 row in set (0.43 sec)
可以看到，采用這種方式查詢會非常快。有人也許會問了，會不會是因為 id 字段的長度比 aid 字段的長度來的小，導致它掃描起來比較快呢？先不著急下結論，咱們來看看下面的測試例子。

二、 sbtest1 表上的測試

show create table sbtest1＼G
*************************** 1. row ***************************
Table: sbtest1
Create Table: CREATE TABLE `sbtest1` (
`aid` int(10) unsigned NOT NULL AUTO_INCREMENT,
`id` bigint(20) unsigned NOT NULL DEFAULT '0',
`k` int(10) unsigned NOT NULL DEFAULT '0',
`c` char(120) NOT NULL DEFAULT '',
`pad` char(60) NOT NULL DEFAULT '',
PRIMARY KEY (`aid`),
KEY `k` (`k`),
KEY `id` (`id`)
) ENGINE=InnoDB AUTO_INCREMENT=1000001 DEFAULT CHARSET=latin1
show index from sbtest1;
+---------+------------+----------+--------------+-------------+-----------+-------------+----------+--------+------+------------+---------+
| Table   | Non_unique | Key_name | Seq_in_index | Column_name | Collation | Cardinality | Sub_part | Packed | Null | Index_type | Comment |
+---------+------------+----------+--------------+-------------+-----------+-------------+----------+--------+------+------------+---------+
| sbtest1 |          0 | PRIMARY |            1 | aid         | A         |     1000099 |     NULL | NULL   |      | BTREE      |         |
| sbtest1 |          1 | k        |            1 | k           | A         |          18 |     NULL | NULL   |      | BTREE      |         |
| sbtest1 |          1 | id       |            1 | id          | A         |     1000099 |     NULL | NULL   |      | BTREE      |         |
+---------+------------+----------+--------------+-------------+-----------+-------------+----------+--------+------+------------+---------+

這個表裡，把 aid 和 id 的字段長度調換了一下，也填充了 1000萬條記錄。

1、直接 count(*)

可以看到，如果不加任何條件，那麼優化器優先采用 primary key 來進行掃描。

2、count(*) 使用 primary key 字段做條件

可以看到，盡管優化器認為只需要掃描 485600 條記錄(其實是索引)，比剛才少多了，但其實仍然要做全表(索引)掃描。因此耗時和第一種相當。

3、 count(*) 使用 secondary index 字段做條件

explain SELECT COUNT(*) FROM sbtest1 WHERE id>=0;
+----+-------------+---------+-------+---------------+------+---------+------+--------+--------------------------+
| id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra |
+----+-------------+---------+-------+---------------+------+---------+------+--------+--------------------------+
| 1 | SIMPLE | sbtest1 | range | id | id | 8 | NULL | 500049 | Using where; Using index |
+----+-------------+---------+-------+---------------+------+---------+------+--------+--------------------------+
1 row in set (0.00 sec)
SELECT COUNT(*) FROM sbtest1 WHERE id>=0;
+----------+
| COUNT(*) |
+----------+
| 1000000 |
+----------+
1 row in set (0.45 sec)

可以看到，采用這種方式查詢會非常快。

上面的所有測試，均在 mysql 5.1.24 環境下通過，並且每次查詢前都重啟了 mysqld。

可以看到，把 aid 和 id 的長度調換之後，采用 secondary index 查詢仍然是要比用 primary key 查詢來的快很多。看來主要不是字段長度引起的索引掃描快慢，而是采用 primary key 以及 secondary index 引起的區別。那麼，為什麼用 secondary index 掃描反而比 primary key 掃描來的要快呢？我們就需要了解innodb的 clustered index 和secondary index 之間的區別了。

innodb 的 clustered index 是把 primary key 以及 row data 保存在一起的，而 secondary index 則是單獨存放，然後有個指針指向 primary key。因此，需要進行 count(*) 統計表記錄總數時，利用 secondary index 掃描起來，顯然更快。而primary key則主要在掃描索引，同時要返回結果記錄時的作用較大，例如：

SELECT * FROM sbtest WHERE aid = xxx;

那既然是使用 secondary index 會比 primary key 更快，為何優化器卻優先選擇 primary key 來掃描呢，Heikki Tuuri 的回答是：

in the example table, the secondary index is inserted into in a perfect order! That is
very unusual. Normally the secondary index would be fragmented, causing random disk I/O,
and the scan would be slower than in the primary index.
I am changing this to a feature request: keep 'clustering ratio' statistics on a secondary
index and do the scan there if the order is almost the same as in the primary index. I
doubt this feature will ever be implemented, though.

上一頁:linux中Mysql的登陸與設置密碼步驟
下一頁:MySQL數據庫慢日志分析工具mysqlsla使用教程

萬盛學電腦網

萬盛學電腦網 >> 數據庫 >> mysql教程 >> mysql中innodb表中count()優化

mysql中innodb表中count()優化

mysql教程排行

程序編程推薦

熱門文章

相關文章

圖片文章

網站交互設計的小細節：表單必填項設計思考

Dreamweaver怎樣清理繁瑣多余的網頁代碼

ubuntu12.04安裝tftp、配置tftp服務錯誤

增強網站與用戶的互動性

萬盛學電腦網 | 設為首頁 | 加入收藏