慢速内部联接订单查询

时间:2016-02-04 15:20:01

标签: mysql performance sql-order-by inner-join

我的查询速度有问题。问题与此类似,但无法找到解决方案。解释说MySQL在公司表上使用:Using index condition; Using where; Using temporary; Using filesort

Mysql slow query: INNER JOIN + ORDER BY causes filesort

慢查询:

SELECT * FROM companies 
    INNER JOIN post_indices
        ON companies.post_index_id = post_indices.id
    WHERE companies.deleted_at is NULL
    ORDER BY post_indices.id
    LIMIT 1;
# 1 row in set (5.62 sec)

但是如果我从查询中删除where语句,它真的很快:

SELECT * FROM companies
    INNER JOIN post_indices
        ON companies.post_index_id = post_indices.id
    ORDER BY post_indices.id
    LIMIT 1;
# 1 row in set (0.00 sec)

我尝试在companies表上使用不同的索引:

  1. index_companies_on_deleted_at
  2. index_companeis_on_post_index_id
  3. index_companies_on_deleted_at_and_post_index_id
  4. index_companies_on_post_index_id_and_deleted_at
  5. MySQL会自动选择

    index_companies_on_deleted_at索引。使用上述索引进行相同查询的统计信息:

    1. 5.6秒
    2. 3.4秒
    3. 8.5秒
    4. 3.5秒
    5. 任何想法如何提高我的查询速度?再说一遍 - 没有where deleted_at is null条件查询是即时的..

      • 公司表有1.3密耳的行。
      • PostIndices表有3k行。

      更新1:

      post_indices.id的排序用于简单,因为它已被索引。但它将用于连接表的其他列(post_indices)。所以排序companies.post_index_id不会解决这个问题

      Rick James的

      更新2:

      您的查询只需0.04秒即可完成。并解释说使用index_companies_on_deleted_at_and_post_index_id索引。所以是的,它效果更好,但这并不能解决我的问题(需要在post_indices列上订购,将来会这样做,所以id post_indices.id用于简单示例。将来它将是例如{ {1}})。

      我使用WHERE查询,但没有ORDER BY是即时的。

      更新3:

      EXPLAIN查询。我也注意到索引的顺序很重要。如果post_indices.city索引更高(先前创建)然后index_companies_on_deleted_at,则使用index_companies_on_deleted_at_and_post_index_id索引。否则使用后面的索引。我的意思是由MySQL自动选择。

      mysql> EXPLAIN SELECT * FROM companies INNER JOIN post_indices ON post_indices.id = companies.post_index_id WHERE companies.deleted_at IS NULL ORDER BY post_indices.id LIMIT 1;
      +----+-------------+--------------+------------+--------+----------------------------------------------------------------------------------------------------------------+-------------------------------+---------+------------------------------------------------------+--------+----------+---------------------------------------------------------------------+
      | id | select_type | table        | partitions | type   | possible_keys                                                                                                  | key                           | key_len | ref                                                  | rows   | filtered | Extra                                                               |
      +----+-------------+--------------+------------+--------+----------------------------------------------------------------------------------------------------------------+-------------------------------+---------+------------------------------------------------------+--------+----------+---------------------------------------------------------------------+
      |  1 | SIMPLE      | companies    | NULL       | ref    | index_companies_on_post_index_id,index_companies_on_deleted_at,index_companies_on_deleted_at_and_post_index_id | index_companies_on_deleted_at | 6       | const                                                | 638692 |   100.00 | Using index condition; Using where; Using temporary; Using filesort |
      |  1 | SIMPLE      | post_indices | NULL       | eq_ref | PRIMARY                                                                                                        | PRIMARY                       | 4       | enbro_purecrm_eu_development.companies.post_index_id |      1 |   100.00 | NULL                                                                |
      +----+-------------+--------------+------------+--------+----------------------------------------------------------------------------------------------------------------+-------------------------------+---------+------------------------------------------------------+--------+----------+---------------------------------------------------------------------+
      2 rows in set, 1 warning (0.00 sec)
      
      
      mysql> EXPLAIN SELECT * FROM companies USE INDEX(index_companies_on_post_index_id) INNER JOIN post_indices ON post_indices.id = companies.post_index_id WHERE companies.deleted_at IS NULL ORDER BY post_indices.id LIMIT 1;
      +----+-------------+--------------+------------+--------+----------------------------------+---------+---------+------------------------------------------------------+---------+----------+----------------------------------------------+
      | id | select_type | table        | partitions | type   | possible_keys                    | key     | key_len | ref                                                  | rows    | filtered | Extra                                        |
      +----+-------------+--------------+------------+--------+----------------------------------+---------+---------+------------------------------------------------------+---------+----------+----------------------------------------------+
      |  1 | SIMPLE      | companies    | NULL       | ALL    | index_companies_on_post_index_id | NULL    | NULL    | NULL                                                 | 1277385 |    10.00 | Using where; Using temporary; Using filesort |
      |  1 | SIMPLE      | post_indices | NULL       | eq_ref | PRIMARY                          | PRIMARY | 4       | enbro_purecrm_eu_development.companies.post_index_id |       1 |   100.00 | NULL                                         |
      +----+-------------+--------------+------------+--------+----------------------------------+---------+---------+------------------------------------------------------+---------+----------+----------------------------------------------+
      2 rows in set, 1 warning (0.00 sec)
      
      
      mysql> EXPLAIN SELECT * FROM companies USE INDEX(index_companies_on_deleted_at_and_post_index_id) INNER JOIN post_indices ON post_indices.id = companies.post_index_id WHERE companies.deleted_at IS NULL ORDER BY post_indices.id LIMIT 1;
      +----+-------------+--------------+------------+--------+-------------------------------------------------+-------------------------------------------------+---------+------------------------------------------------------+--------+----------+--------------------------------------------------------+
      | id | select_type | table        | partitions | type   | possible_keys                                   | key                                             | key_len | ref                                                  | rows   | filtered | Extra                                                  |
      +----+-------------+--------------+------------+--------+-------------------------------------------------+-------------------------------------------------+---------+------------------------------------------------------+--------+----------+--------------------------------------------------------+
      |  1 | SIMPLE      | companies    | NULL       | ref    | index_companies_on_deleted_at_and_post_index_id | index_companies_on_deleted_at_and_post_index_id | 6       | const                                                | 638692 |   100.00 | Using index condition; Using temporary; Using filesort |
      |  1 | SIMPLE      | post_indices | NULL       | eq_ref | PRIMARY                                         | PRIMARY                                         | 4       | enbro_purecrm_eu_development.companies.post_index_id |      1 |   100.00 | NULL                                                   |
      +----+-------------+--------------+------------+--------+-------------------------------------------------+-------------------------------------------------+---------+------------------------------------------------------+--------+----------+--------------------------------------------------------+
      2 rows in set, 1 warning (0.00 sec)
      

      更新4:

      我删除了非相关列:

      | companies | CREATE TABLE `companies` (
      `id` int(11) NOT NULL AUTO_INCREMENT,
      `name` varchar(255) COLLATE utf8_unicode_ci DEFAULT NULL,
      `address` varchar(255) COLLATE utf8_unicode_ci DEFAULT NULL,
      `post_index_id` int(11) DEFAULT NULL,
      `vat` varchar(255) COLLATE utf8_unicode_ci DEFAULT NULL,
      `note` text COLLATE utf8_unicode_ci,
      `state` varchar(255) COLLATE utf8_unicode_ci NOT NULL DEFAULT 'new',
      `deleted_at` datetime DEFAULT NULL,
      `lead_list_id` int(11) DEFAULT NULL,
      PRIMARY KEY (`id`),
      UNIQUE KEY `index_companies_on_vat` (`vat`),
      KEY `index_companies_on_post_index_id` (`post_index_id`),
      KEY `index_companies_on_state` (`state`),
      KEY `index_companies_on_deleted_at` (`deleted_at`),
      KEY `index_companies_on_deleted_at_and_post_index_id` (`deleted_at`,`post_index_id`),
      KEY `index_companies_on_lead_list_id` (`lead_list_id`),
      CONSTRAINT `fk_rails_5fc7f5c6b9` FOREIGN KEY (`lead_list_id`) REFERENCES `lead_lists` (`id`),
      CONSTRAINT `fk_rails_79719355c6` FOREIGN KEY (`post_index_id`) REFERENCES `post_indices` (`id`)
      ) ENGINE=InnoDB AUTO_INCREMENT=2523518 DEFAULT CHARSET=utf8 COLLATE=utf8_unicode_ci |
      
      | post_indices | CREATE TABLE `post_indices` (
      `id` int(11) NOT NULL AUTO_INCREMENT,
      `county` varchar(255) COLLATE utf8_unicode_ci DEFAULT NULL,
      `postal_code` int(11) DEFAULT NULL,
      `group_part` int(11) DEFAULT NULL,
      `group_number` int(11) DEFAULT NULL,
      `group_name` varchar(255) COLLATE utf8_unicode_ci DEFAULT NULL,
      `city` varchar(255) COLLATE utf8_unicode_ci DEFAULT NULL,
      `created_at` datetime NOT NULL,
      `updated_at` datetime NOT NULL,
      PRIMARY KEY (`id`)
      ) ENGINE=InnoDB AUTO_INCREMENT=3101 DEFAULT CHARSET=utf8 COLLATE=utf8_unicode_ci |
      

      更新5:

      另一位开发人员在本地计算机上使用完全相同的数据集(转储/恢复)测试了相同的查询。他得到了完全不同的解释:

      mysql> explain SELECT * FROM companies      INNER JOIN post_indices         ON companies.post_index_id = post_indices.id     WHERE companies.deleted_at is NULL     ORDER BY post_indices.id     LIMIT 1;
      +----+-------------+--------------+-------+----------------------------------------------------------------------------------------------------------------+-------------------------------------------------+---------+----------------------------------------------------+------+-----------------------+
      | id | select_type | table        | type  | possible_keys                                                                                                  | key                                             | key_len | ref                                                | rows | Extra                 |
      +----+-------------+--------------+-------+----------------------------------------------------------------------------------------------------------------+-------------------------------------------------+---------+----------------------------------------------------+------+-----------------------+
      |  1 | SIMPLE      | post_indices | index | PRIMARY                                                                                                        | PRIMARY                                         | 4       | NULL                                               |    1 | NULL                  |
      |  1 | SIMPLE      | companies    | ref   | index_companies_on_post_index_id,index_companies_on_deleted_at,index_companies_on_deleted_at_and_post_index_id | index_companies_on_deleted_at_and_post_index_id | 11      | const,enbro_purecrm_eu_development.post_indices.id |  283 | Using index condition |
      +----+-------------+--------------+-------+----------------------------------------------------------------------------------------------------------------+-------------------------------------------------+---------+----------------------------------------------------+------+-----------------------+
      2 rows in set (0,00 sec)
      

      他的电脑上的相同查询是即时的。不知道为什么会发生这种情况..我也试过使用STRAIGHT_JOIN。当我强制post_indices表被MySQL首先读取时,它也非常快速。但是对我来说仍然是个谜,为什么另一台机器上的同一查询速度很快(mysql -v 5.6.27)而且机器上的速度很慢(mysql -v 5.7.10)

      所以似乎问题是MySQL使用错误的表作为第一个表来阅读。

2 个答案:

答案 0 :(得分:2)

这会更好吗?

SELECT * FROM companies AS c
INNER JOIN post_indices AS pi
    ON c.post_index_id = pi.id
WHERE    c.deleted_at is NULL
ORDER BY c.post_index_id           -- Note
LIMIT 1;

INDEX(deleted_at, post_index_id)  -- note

就此而言,使用 WHERE运行的速度有多快,但没有运行ORDER BY

答案 1 :(得分:0)

使用以下优化程序提示,应强制MySQL使用您的同事观察到的计划:

SELECT * FROM post_indices 
    STRAIGHT_JOIN companies FORCE INDEX(index_companies_on_deleted_at_and_post_index_id)
        ON companies.post_index_id = post_indices.id
    WHERE companies.deleted_at is NULL
    ORDER BY post_indices.id
    LIMIT 1;

如果要对post_indices的其他列进行排序,则需要对这些列进行索引才能使此计划正常运行。

请注意,最佳计划是什么取决于deleted_at为NULL的频率。如果deleted_at经常为NULL,则上述计划将很快。如果没有,使用上述计划,必须在找到匹配项之前运行多行post_indices。另请注意,对于使用OFFSET的查询,同一计划可能不是最有效的。

我认为这里的问题是MySQL决定了连接顺序而没有考虑ORDER BY和LIMIT的影响。换句话说,它将选择它认为最快执行完全连接的连接顺序。 由于公司表有一个限制(deleted_at为NULL),我不会惊讶它将从这个表开始。