多个表上的mysql group_concat

时间:2018-11-26 11:11:36

标签: mysql join group-concat

这是我第一次使用stackoverflow,..请对我温柔;)

在映射表上使用多个JOIN时,从GROUP_CONCAT获得重复的结果在这里没有什么问题。

这很难解释,但我会尝试:

我已经创建了一个SQLFiddle进行测试:http://sqlfiddle.com/#!9/d2b347/3

对于所有帖子,我希望查询仅是一个查询,而不是1个,然后在每次测试时进行查询。但是,由于GROUP_CONCAT合并了这些测试结果,因此我获得的数据量是我想要的两倍。

可以通过某种方式使查询更可靠。要始终使GROUP_CONCAT成为测试的确切数量?

我希望/希望输出为:

|---------|-----------------|------------|---------|-------------|
| post_id | flows           | flow_types | powers  | power_types |
|---------|-----------------|------------|---------|-------------|
|       1 | 100,140         | a,b        | 1,1     | a,b         |
|---------|-----------------|------------|---------|-------------|
|       2 | 200,200,200     | a,b,c      | (null)  | (null)      |
|---------|-----------------|------------|---------|-------------|

但它是:

|---------|-----------------|------------|---------|-------------|
| post_id | flows           | flow_types | powers  | power_types |
|---------|-----------------|------------|---------|-------------|
|       1 | 100,100,140,140 | a,a,b,b    | 1,1,1,1 | a,b,a,b     |
|---------|-----------------|------------|---------|-------------|
|       2 | 200,200,200     | a,b,c      | (null)  | (null)      |
|---------|-----------------|------------|---------|-------------|

通过GROUP_CONCAT DISTINCT,我得到:

|---------|-----------------|------------|---------|-------------|
| post_id | flows           | flow_types | powers  | power_types |
|---------|-----------------|------------|---------|-------------|
|       1 | 100,140         | a,b        | 1       | a,b         |
|---------|-----------------|------------|---------|-------------|
|       2 | 200             | a,b,c      | (null)  | (null)      |
|---------|-----------------|------------|---------|-------------|

这是创建查询:

DROP TABLE IF EXISTS `posts`;
CREATE TABLE IF NOT EXISTS `posts` (
  `post_id` bigint(20) unsigned NOT NULL AUTO_INCREMENT,
  `post` varchar(256) CHARACTER SET ascii NOT NULL,
  PRIMARY KEY (`post_id`),
  UNIQUE KEY `UNQ_post` (`post`) USING HASH
) ENGINE=MyISAM DEFAULT CHARSET=utf8;

DROP TABLE IF EXISTS `posts_test1`;
CREATE TABLE IF NOT EXISTS `posts_test1` (
  `post_id` bigint(20) unsigned NOT NULL,
  `test1_id` bigint(20) unsigned NOT NULL,
  `type_id` int(10) unsigned NOT NULL DEFAULT 1,
  PRIMARY KEY (`post_id`,`test1_id`,`type_id`)
) ENGINE=MyISAM DEFAULT CHARSET=utf8;

DROP TABLE IF EXISTS `test1`;
CREATE TABLE IF NOT EXISTS `test1` (
  `test1_id` bigint(20) unsigned NOT NULL AUTO_INCREMENT,
  `flow` int(10) unsigned NOT NULL,
  PRIMARY KEY (`test1_id`),
  KEY `IDX_FLOW` (`flow`)
) ENGINE=MyISAM DEFAULT CHARSET=utf8;

DROP TABLE IF EXISTS `posts_test2`;
CREATE TABLE IF NOT EXISTS `posts_test2` (
  `post_id` bigint(20) unsigned NOT NULL,
  `test2_id` bigint(20) unsigned NOT NULL,
  `type_id` int(10) unsigned NOT NULL DEFAULT 1,
  PRIMARY KEY (`post_id`,`test2_id`,`type_id`)
) ENGINE=MyISAM DEFAULT CHARSET=utf8;

DROP TABLE IF EXISTS `test2`;
CREATE TABLE IF NOT EXISTS `test2` (
  `test2_id` bigint(20) unsigned NOT NULL AUTO_INCREMENT,
  `power` int(10) unsigned NOT NULL,
  PRIMARY KEY (`test2_id`),
  KEY `IDX_POWER` (`power`)
) ENGINE=MyISAM DEFAULT CHARSET=utf8;

DROP TABLE IF EXISTS `types`;
CREATE TABLE IF NOT EXISTS `types` (
  `type_id` int(10) unsigned NOT NULL AUTO_INCREMENT,
  `type` varchar(50) CHARACTER SET ascii DEFAULT NULL,
  PRIMARY KEY (`type_id`)
) ENGINE=MyISAM DEFAULT CHARSET=utf8;


INSERT INTO `types` (`type_id`, `type`) VALUES
  (1, 'a'),
  (2, 'b'),
  (3, 'c');

INSERT INTO `posts` (`post_id`, `post`) VALUES
  (1, 'test1'),
  (2, 'test2');

INSERT INTO `test1` (`test1_id`, `flow`) VALUES
  (1, 100),
  (2, 140),
  (3, 200),
  (4, 200),
  (5, 200);

INSERT INTO `posts_test1` (`post_id`, `test1_id`, `type_id`) VALUES
  (1, 1, 1),
  (1, 2, 2),
  (2, 3, 1),
  (2, 4, 2),
  (2, 5, 3);

INSERT INTO `test2` (`test2_id`, `power`) VALUES
  (1, 1),
  (2, 1);

INSERT INTO `posts_test2` (`post_id`, `test2_id`, `type_id`) VALUES
  (1, 1, 1),
  (1, 2, 2);

这是我的精选查询。

SELECT
p.post_id, p.post,
GROUP_CONCAT(t1.flow) flow,
GROUP_CONCAT(t1t.type) flow_types
FROM posts p
LEFT JOIN posts_test1 pt1 USING (post_id)
    LEFT JOIN test1 t1 USING (test1_id)
        LEFT JOIN types t1t ON (t1t.type_id = pt1.type_id)
GROUP BY p.post_id; # works fine


SELECT
p.post_id, p.post,
GROUP_CONCAT(t2.power) powers,
GROUP_CONCAT(t2t.type) power_types
FROM posts p
LEFT JOIN posts_test2 pt2 USING (post_id)
    LEFT JOIN test2 t2 USING (test2_id)
        LEFT JOIN types t2t ON (t2t.type_id = pt2.type_id)
GROUP BY p.post_id; # works fine


SELECT
p.post_id, p.post,
GROUP_CONCAT(t1.flow) flow,
GROUP_CONCAT(t1t.type) flow_types,
GROUP_CONCAT(t2.power) powers,
GROUP_CONCAT(t2t.type) power_types
FROM posts p
LEFT JOIN posts_test1 pt1 USING (post_id)
    LEFT JOIN test1 t1 USING (test1_id)
        LEFT JOIN types t1t ON (t1t.type_id = pt1.type_id)
LEFT JOIN posts_test2 pt2 USING (post_id)
    LEFT JOIN test2 t2 USING (test2_id)
        LEFT JOIN types t2t ON (t2t.type_id = pt2.type_id)
GROUP BY p.post_id; # getting duplicated GROUP_CONCAT results

SELECT
p.post_id, p.post,
GROUP_CONCAT(DISTINCT t1.flow) flow,
GROUP_CONCAT(DISTINCT t1t.type) flow_types,
GROUP_CONCAT(DISTINCT t2.power) powers,
GROUP_CONCAT(DISTINCT t2t.type) power_types
FROM posts p
LEFT JOIN posts_test1 pt1 USING (post_id)
    LEFT JOIN test1 t1 USING (test1_id)
        LEFT JOIN types t1t ON (t1t.type_id = pt1.type_id)
LEFT JOIN posts_test2 pt2 USING (post_id)
    LEFT JOIN test2 t2 USING (test2_id)
        LEFT JOIN types t2t ON (t2t.type_id = pt2.type_id)
GROUP BY p.post_id; # DISTINCT wipes the GROUP_CONCAT if same result...

谢谢,祝你有美好的一天!

编辑:根据建议添加了预期结果,谢谢:)

1 个答案:

答案 0 :(得分:1)

这里的问题是有两个不同的联结表(和两个不同的连接链),它们起源于单个表post。因此,线性JOIN链不起作用。线性连接完成后,其中一个连接表中的重复会导致其他链中的重复。

一种方法是在两个单独的Derived TablesJOIN子句中的子查询)中考虑这两个不同的FROM链,并确定它们各自的分组/聚集表达式。然后,我们可以使用JOIN post_id退回这两个链。

查询

SELECT
  dt1.post_id, 
  dt1.flows, 
  dt1.flow_types, 
  dt2.powers, 
  dt2.power_types 
FROM 
(
  SELECT 
    p.post_id, 
    GROUP_CONCAT(t1.flow) AS flows, 
    GROUP_CONCAT(typ.type) AS flow_types
  FROM posts p
  LEFT JOIN posts_test1 pt1 
    ON pt1.post_id = p.post_id 
  LEFT JOIN test1 t1 
    ON t1.test1_id = pt1.test1_id 
  LEFT JOIN types typ 
    ON typ.type_id = pt1.type_id 
  GROUP BY p.post_id 
) AS dt1 
JOIN 
(
  SELECT 
    p.post_id, 
    GROUP_CONCAT(t2.power) AS powers, 
    GROUP_CONCAT(typ.type) AS power_types 
  FROM posts p
  LEFT JOIN posts_test2 pt2 
    ON pt2.post_id = p.post_id 
  LEFT JOIN test2 t2 
    ON t2.test2_id = pt2.test2_id 
  LEFT JOIN types typ 
    ON typ.type_id = pt2.type_id 
  GROUP BY p.post_id 
) AS dt2
  ON dt1.post_id = dt2.post_id;

结果

| post_id | flows       | flow_types | powers | power_types |
| ------- | ----------- | ---------- | ------ | ----------- |
| 1       | 100,140     | a,b        | 1,1    | a,b         |
| 2       | 200,200,200 | a,b,c      |        |             |

View on DB Fiddle