How can I combine two scalaz threads with a predicate selector?

Question

How can I combine two scalaz threads with a predicate selector?

I would like to combine two scalaz threads with a predicate that selects the next item from any thread. For example, I would like this test to pass:

val a = Process(1, 2, 5, 8) val b = Process(3, 4, 5, 7) choose(a, b)(_ < _).toList shouldEqual List(1, 2, 3, 4, 5, 5, 7, 8)

As you can see, we cannot do something smart like zip and order two elements, because one of the processes can be selected sequentially at times.

I took a hit on a decision that I thought would work. It is compiled! But hell, if you do nothing. JVM just freezes :(

 import scalaz.stream.Process._ import scalaz.stream._ object StreamStuff { def choose[F[_], I](a:Process[F, I], b:Process[F, I])(p: (I, I) => Boolean): Process[F, I] = (a.awaitOption zip b.awaitOption).flatMap { case (Some(ai), Some(bi)) => if(p(ai, bi)) emit(ai) ++ choose(a, emit(bi) ++ b)(p) else emit(bi) ++ choose(emit(ai) ++ a, b)(p) case (None, Some(bi)) => emit(bi) ++ b case (Some(ai), None) => emit(ai) ++ a case _ => halt } }

Please note that this was my second attempt. In my first attempt, I tried to create a Tee , but I could not figure out how not to use the loser item. I felt that I needed something recursive, as I have here.

I am using thread version 0.7.3a .

Any tips (including incremental hints, because I would just like to learn how to describe these things myself) are greatly appreciated!

+5

scala scalaz scalaz-stream

joescii Jul 13 '16 at 13:32

source share

2 answers

Travis brown · Answer 1 · 2016-07-13T17:29:46+0000

I will give some tips and an implementation below, so you may want to close the screen if you want to solve the problem yourself.

Disclaimer: this is only the first approach that came to mind, and my acquaintance with the scalaz-stream API is a bit rusty, so there may be better ways to implement this operation, it may be completely wrong in some terrible way, etc.

Tip 1

Instead of trying to “buy out” the losers, you can pass them in the next recursive call.

Hint 2

You can avoid accumulating more than one lost item by indicating which side lost the last.

Tip 3

It is often easier for me to sketch out an implementation using regular collections when I work with Scalaz streams. Here is a helper method that we will need for lists:

 /** * @param p if true, the first of the pair wins */ def mergeListsWithHeld[A](p: (A, A) => Boolean)(held: Either[A, A])( ls: List[A], rs: List[A] ): List[A] = held match { // Right is the current winner. case Left(l) => rs match { // ...but it empty. case Nil => l :: ls // ...and it still winning. case r :: rt if p(r, l) => r :: mergeListsWithHeld(p)(held)(ls, rt) // ...upset! case r :: rt => l :: mergeListsWithHeld(p)(Right(r))(ls, rt) } // Left is the current winner. case Right(r) => ls match { case Nil => r :: rs case l :: lt if p(l, r) => l :: mergeListsWithHeld(p)(held)(lt, rs) case l :: lt => r :: mergeListsWithHeld(p)(Left(l))(lt, rs) } }

It is assumed that we already have a lost element, but now we can write a method that we really want to use:

 def mergeListsWith[A](p: (A, A) => Boolean)(ls: List[A], rs: List[A]): List[A] = ls match { case Nil => rs case l :: lt => rs match { case Nil => ls case r :: rt if p(l, r) => l :: mergeListsWithHeld(p)(Right(r))(lt, rt) case r :: rt => r :: mergeListsWithHeld(p)(Left(l))(lt, rt) } }

And then:

 scala> org.scalacheck.Prop.forAll { (ls: List[Int], rs: List[Int]) => | mergeListsWith[Int](_ < _)(ls.sorted, rs.sorted) == (ls ++ rs).sorted | }.check + OK, passed 100 tests.

Good Excellent. There are nicer ways to write this for lists, but this implementation matches the form of what we need to do for Process .

Implementation

And here is more or less equivalent to stream-stream:

 import scalaz.{ -\/, \/, \/- } import scalaz.stream.Process.{ awaitL, awaitR, emit } import scalaz.stream.{ Process, Tee, tee } def mergeWithHeld[A](p: (A, A) => Boolean)(held: A \/ A): Tee[A, A, A] = held.fold(_ => awaitR[A], _ => awaitL[A]).awaitOption.flatMap { case None => emit(held.merge) ++ held.fold(_ => tee.passL, _ => tee.passR) case Some(next) if p(next, held.merge) => emit(next) ++ mergeWithHeld(p)(held) case Some(next) => emit(held.merge) ++ mergeWithHeld(p)( held.fold(_ => \/-(next), _ => -\/(next)) ) } def mergeWith[A](p: (A, A) => Boolean): Tee[A, A, A] = awaitL[A].awaitOption.flatMap { case None => tee.passR case Some(l) => awaitR[A].awaitOption.flatMap { case None => emit(l) ++ tee.passL case Some(r) if p(l, r) => emit(l) ++ mergeWithHeld(p)(\/-(r)) case Some(r) => emit(r) ++ mergeWithHeld(p)(-\/(l)) } }

And check it again:

 scala> org.scalacheck.Prop.forAll { (ls: List[Int], rs: List[Int]) => | Process.emitAll(ls.sorted).tee(Process.emitAll(rs.sorted))( | mergeWith(_ < _) | ).toList == (ls ++ rs).sorted | }.check + OK, passed 100 tests.

I would not do this without additional tests, but it seems to work.

Captain o. · Answer 2 · 2017-04-30T05:09:28+0000

You must implement a custom tee, as Travis Brown suggested. Here is my tee implementation :

 /* A tee which sequentially compares elements from left and right and passes an element from left if predicate returns true, otherwise passes an element from right. */ def predicateTee[A](predicate: (A, A) => Boolean): Tee[A, A, A] = { def go(stack: Option[A \/ A]): Tee[A, A, A] = { def stackEither(l: A, r: A) = if (predicate(l, r)) emit(l) ++ go(\/-(r).some) else emit(r) ++ go(-\/(l).some) stack match { case None => awaitL[A].awaitOption.flatMap { lo => awaitR[A].awaitOption.flatMap { ro => (lo, ro) match { case (Some(l), Some(r)) => stackEither(l, r) case (Some(l), None) => emit(l) ++ passL case (None, Some(r)) => emit(r) ++ passR case _ => halt } } } case Some(-\/(l)) => awaitR[A].awaitOption.flatMap { case Some(r) => stackEither(l, r) case None => emit(l) ++ passL } case Some(\/-(r)) => awaitL[A].awaitOption.flatMap { case Some(l) => stackEither(l, r) case None => emit(r) ++ passR } } } go(None) } val p1: Process[Task, Int] = Process(1, 2, 4, 5, 9, 10, 11) val p2: Process[Task, Int] = Process(0, 3, 7, 8, 6) p1.tee(p2)(predicateTee(_ < _)).runLog.run //res0: IndexedSeq[Int] = Vector(0, 1, 2, 3, 4, 5, 7, 8, 6, 9, 10, 11)

How can I combine two scalaz threads with a predicate selector?

Tip 1

Hint 2

Tip 3

Implementation

More articles: