Scala中几个Seqs的懒惰笛卡尔积

时间:2011-11-30 06:19:36

标签: scala scala-collections

我实现了一个简单的方法来生成几个Seq的笛卡尔积,如下所示:

object RichSeq {
  implicit def toRichSeq[T](s: Seq[T]) = new RichSeq[T](s)
}

class RichSeq[T](s: Seq[T]) {

  import RichSeq._

  def cartesian(ss: Seq[Seq[T]]): Seq[Seq[T]] = {

    ss.toList match {
      case Nil        => Seq(s)
      case s2 :: Nil  => {
        for (e <- s) yield s2.map(e2 => Seq(e, e2))
      }.flatten
      case s2 :: tail => {
        for (e <- s) yield s2.cartesian(tail).map(seq => e +: seq)
      }.flatten
    }
  }
}

显然,这个很慢,因为它一次计算整个产品。有人在Scala中为这个问题实现了一个懒惰的解决方案吗?

UPD

好的,所以我在笛卡尔产品上实现了一个非常愚蠢但是工作的迭代器版本。在这里发布给未来的爱好者:

object RichSeq {
  implicit def toRichSeq[T](s: Seq[T]) = new RichSeq(s) 
}

class RichSeq[T](s: Seq[T]) {

  def lazyCartesian(ss: Seq[Seq[T]]): Iterator[Seq[T]] = new Iterator[Seq[T]] {

    private[this] val seqs = s +: ss

    private[this] var indexes = Array.fill(seqs.length)(0)

    private[this] val counts = Vector(seqs.map(_.length - 1): _*)

    private[this] var current = 0

    def next(): Seq[T] = {
      val buffer = ArrayBuffer.empty[T]
      if (current != 0) {
        throw new NoSuchElementException("no more elements to traverse")
      }
      val newIndexes = ArrayBuffer.empty[Int]
      var inside = 0
      for ((index, i) <- indexes.zipWithIndex) {
        buffer.append(seqs(i)(index))
        newIndexes.append(index)
        if ((0 to i).forall(ind => newIndexes(ind) == counts(ind))) {
          inside = inside + 1
        }
      }
      current = inside
      if (current < seqs.length) {
        for (i <- (0 to current).reverse) {
          if ((0 to i).forall(ind => newIndexes(ind) == counts(ind))) {
            newIndexes(i) = 0
          } else if (newIndexes(i) < counts(i)) {
            newIndexes(i) = newIndexes(i) + 1
          }
        }
        current = 0
        indexes = newIndexes.toArray
      }
      buffer.result()
    }

    def hasNext: Boolean = current != seqs.length
  }
}

4 个答案:

答案 0 :(得分:17)

这是我对给定问题的解决方案。请注意,懒惰只是在用于理解的“根集合”上使用.view引起的。

scala> def combine[A](xs: Traversable[Traversable[A]]): Seq[Seq[A]] =
     |  xs.foldLeft(Seq(Seq.empty[A])){
     |    (x, y) => for (a <- x.view; b <- y) yield a :+ b }
combine: [A](xs: Traversable[Traversable[A]])Seq[Seq[A]]
scala> combine(Set(Set("a","b","c"), Set("1","2"), Set("S","T"))) foreach (println(_))
List(a, 1, S)
List(a, 1, T)
List(a, 2, S)
List(a, 2, T)
List(b, 1, S)
List(b, 1, T)
List(b, 2, S)
List(b, 2, T)
List(c, 1, S)
List(c, 1, T)
List(c, 2, S)
List(c, 2, T)

为了实现这一点,我从https://stackoverflow.com/a/4515071/53974中定义的函数combine开始,将函数(a, b) => (a, b)传递给它。但是,这并没有直接起作用,因为该代码需要类型为(A, A) => A的函数。所以我稍微调整了一下代码。

答案 1 :(得分:3)

答案 2 :(得分:1)

怎么样:

  def cartesian[A](list: List[Seq[A]]): Iterator[Seq[A]] = {
    if (list.isEmpty) {
      Iterator(Seq())
    } else {
      list.head.iterator.flatMap { i => cartesian(list.tail).map(i +: _) }
    }
  }

简单而懒惰;)

答案 3 :(得分:0)

您可以在这里查看:https://stackoverflow.com/a/8318364/312172如何将数字转换为所有可能值的索引,而不生成每个元素。

此技术可用于实现流。

相关问题