Merge Sort
> module MergeSort where
> -- see http://www.seas.upenn.edu/~cis5520/current/hw/hw02/MergeSort.html
> import SortedList (SortedList)
> import qualified SortedList as SL
> import qualified Data.List as List
> import Data.Monoid
> ( Product(Product, getProduct), Sum(Sum, getSum) )
> import Test.HUnit ( (~?=), Test(TestList), runTestTT )
A warm-up exercise: write the function insert
, which takes an element and a
sorted list and inserts the element into the list at the first position where
it is less than or equal to the next element. For this definition, do not use
any functions from the Data.List
library (which, indeed, contains such a
function).
> -- >>> insert 2 [0,1,3,4]
> -- [0,1,2,3,4]
> insert :: Ord a => a -> [a] -> [a]
> insert = undefined
Using this function, we can define the insertion sort algorithm over
lists. Insertion sort, like several other sorting algorithms, is incremental
-- it works by processing a single element of the input unsorted list at a
time, and when it finishes processing the input, its work is done. Using the
insert
function above, write a function which takes a list and returns the
list sorted. You must express your answer in terms of a fold.
> insertionSort :: Ord a => [a] -> [a]
> insertionSort = undefined
Keep in mind that this sorting algorithm, although succinct, is not efficient -- it has O(N ^ 2) asymptotic complexity.
You may have noticed that the insert
function above has an invariant
attached to its description; namely, that its list argument is already
sorted. If that invariant holds, then insert
guarantees that its output will
also be sorted; otherwise, there is no such guarantee.
A leading question: what if we could keep track of the "sorted-ness" of a list
using the type system? You already know that we can -- that's precisely what
we did in SortedList
.
> -------------------------------------------------------------
SortedLists to the Rescue
I previously promised that the interface we built for SortedList
s would be
sufficient to do useful things with them. One particularly obvious useful
thing to do with SortedList
s is to construct them from arbitrary lists --
that is, to sort an arbitrary list and produce a SortedList
with the result.
> sortedFromList :: Ord a => [a] -> SortedList a
> sortedFromList = undefined
By projecting out the underlying list, we get a sorting function, that we'll
call sortedListSort
.
> sortedListSort :: Ord a => [a] -> [a]
> sortedListSort = SL.toList . sortedFromList
> testSortedFromList :: Test
> testSortedFromList =
> let unsorted = [51,67,89,95,14,31,28,87,0,25]
> sorted = [0,14,25,28,31,51,67,87,89,95] in
> sortedListSort unsorted ~?= sorted
> -- >>> runTestTT testSortedFromList
> -- Counts {cases = 1, tried = 1, errors = 0, failures = 0}
One thing you may have noticed while writing the above function is that there
is only one place you could have made reference to the SortedList
type
specifically: in the use of the singleton
operation. Indeed, this operation
is the only SortedList
-specific way to create new SortedList
s -- any other
way comes through the Monoid
instance. Perhaps there's some common pattern
here that we could abstract! (There is.) Let's express it by making the
singleton
function into a parameter of a new function, that we will call
foldMapList
(see below):
> sortedFromList' :: Ord a => [a] -> SortedList a
> sortedFromList' = foldMapList SL.singleton
In order to make this work, define the foldMapList
combinator.
> foldMapList :: Monoid m => (a -> m) -> [a] -> m
> foldMapList f = undefined
Again, we can project out the underlying list to get a list sorting function.
> sortedListSort' :: Ord a => [a] -> [a]
> sortedListSort' = SL.toList . sortedFromList'
> testSortedFromList' :: Test
> testSortedFromList' =
> let unsorted :: [Int]
> unsorted = [47,80,28,47,45,76,1,35,19,1] in
> sortedListSort' unsorted ~?= sortedListSort unsorted -- old & new agree
> -- >>> runTestTT testSortedFromList'
> -- Counts {cases = 1, tried = 1, errors = 0, failures = 0}
The type of foldMapList
is very general --- we can use this function to
combine arbitrary lists by providing a function that maps their contents to
some particular Monoid
(such as SortedList
). For instance, foldMapList Sum
gives us the sum of the numbers in the list; foldMap Product
gives us
their product.
> -- >>> foldMapList Sum [1,2,3,4]
> -- Sum {getSum = 10}
> -- >>> foldMapList Product [1,2,3,4]
> -- Product {getProduct = 24}
A small exercise: using foldMapList
and the Sum
and Product
newtypes you
learned about in SortedList
, implement the following function, which takes
a doubly-nested list of numbers as input, and returns the sum of the product of
each of the inner lists. In your solution, do not explicitly use any of the
ordinary numeric operations like (+)
, (*)
, sum
, or product
, and eschew
explicit recursion.
> sumOfProducts :: Num a => [[a]] -> a
> sumOfProducts = undefined
> testSumOfProducts :: Test
> testSumOfProducts = sumOfProducts [[1],[2,3],[4,5,6],[7,8,9,10]] ~?= (5167 :: Int)
> -- >>> runTestTT testSumOfProducts
> -- Counts {cases = 1, tried = 1, errors = 0, failures = 0}
Like Merge Sort, the sortedListSort
function is based on merging sorted
lists together. This merge-sort-like algorithm has a flaw, though: it's
quadratic in runtime. Why?
> benchmark :: IO ()
> benchmark = (print . last . sortedListSort') ([10000,9999..0] :: [Int])
(Try it yourself by setting :set +s
in ghci. On my machine, it take 26.61 secs
and allocates 17,604,539,672 bytes))
For any singleton SortedList [a]
and any other SortedList as
, computing
SortedList [a] <> SortedList as
is identical not only in resultant value,
but also in algorithmic structure to computing the result of insert a as
. The definition of foldMapList
linearly scans across its input list,
successively combining values using (<>)
-- and so, like insertion sort, the
whole whole algorithm ends up executing a quadratic number of comparisons.
A real merge sort algorithm, as you likely know, divides its input more
intelligently than the one we've written above in terms of foldMapList
. By
dividing its input roughly in half every iteration, it only has to do a
logarithmic amount of merging.
To make our merge sort do this, we need to use a different kind of foldMap
!
The Foldable Typeclass
At this point, I'd like to point something out: foldMapList
can itself be
even further generalized. We already know that lists are not the only data
structures which support folding -- we've seen folds for trees of various kinds
and for other data structures as well. As a result, it makes sense to allow
some kind of foldMap
operation for those structures also. In the standard
library, we therefore have:
foldMap :: (Monoid m, Foldable t) => (a -> m) -> t a -> m
That is to say, foldMap
is a method of yet another type class, Foldable
, of
which the type constructor []
is an instance. Implementing this interface
roughly corresponds to saying, "this data structure contains some elements, and
I know how to do a fold across them." To implement the Foldable
class for some
type, we just need to implement foldMap
.
Implement the Functor
and Foldable
instances for the Crispy
datatype.
Remember to keep in mind a guiding principle: when you are confused, don't think
about what things are supposed to mean; just follow the types and see where they
take you.
> data Crispy a = Snap a [a] a
> | Crackle [[Crispy a]]
> | Pop Integer deriving (Eq,Show)
> instance Functor Crispy where
> fmap = undefined
> instance Foldable Crispy where
> foldMap = undefined
> testCrispy :: Test
> testCrispy =
> let c1, c2, c3, c5 :: Crispy Integer
> c1 = fmap (+1) (Snap 0 [1,2,3] 4)
> c2 = Snap 700 [] 600
> c3 = Pop 1234567890
> c5 = fmap (subtract 1) (Crackle [[c1, c2], [c1, c3]]) in
> TestList [ 15 ~?= getSum (foldMap Sum c1)
> , 1 ~?= getProduct (foldMap Product c3)
> , "0123469959901234" ~?= foldMap show c5]
> -- >>> runTestTT testCrispy
> -- Counts {cases = 3, tried = 3, errors = 0, failures = 0}
> -------------------------------------------------------------
Back to Sorting
In order to express an efficient merge sort in terms of foldMap
, we need to
design a data structure that represents a sequence of elements (just like a
list), but whose Foldable
instance uses a divide-and-conquer strategy, rather
than the []
instance's linear fold pattern of recursion.
> newtype DivideList a = DivideList { getDivideList :: [a] } deriving (Eq, Show)
That means that we need DivideList
s to be Foldable
, but in a
different way. First, implement the divide
function, which splits a
DivideList
in its middle, returning the result of the split.
> divide :: DivideList a -> (DivideList a, DivideList a)
> divide = undefined
> testDivide :: Test
> testDivide = TestList [ divide (DivideList "abcd") ~?=
> (DivideList "ab", DivideList "cd"),
> divide (DivideList "abcde") ~?=
> (DivideList "ab", DivideList "cde"),
> divide (DivideList "") ~?=
> (DivideList "", DivideList "") ]
> -- >>> runTestTT testDivide
> -- Counts {cases = 3, tried = 3, errors = 0, failures = 0}
Using this function, we can define the Foldable
instance for DivideList
s.
Note that this definition is trickier than it seems. If you encounter an
infinite loop, it means that you have not covered one of a particular set of
slightly non-trivial edge cases.
> instance Foldable DivideList where
> foldMap f xs =
> case divide xs of
> (DivideList as, DivideList bs) -> undefined
> testDivideList :: Test
> testDivideList =
> let xs = DivideList [1,2,3]
> ys = DivideList [] in
> TestList [ Product (6 :: Int) ~?= foldMap Product xs
> , Sum (0 :: Int) ~?= foldMap Sum ys
> ]
> -- >>> runTestTT testDivideList
> -- Counts {cases = 2, tried = 2, errors = 0, failures = 0}
Now that we know how general the foldMap
function is, have a look at the
implementation of sortedListSort'
above -- does its input type need to only be
a list? Generalize its type signature so that it outputs a list of sorted
elements located inside an arbitrary Foldable
structure.
> -- foldSort ::
> foldSort = undefined -- implementation should use foldMap
By parameterizing over any Foldable
container, what we've done is to factor
out the folding strategy into the choice of original container! To pick a
different divide-and-conquer strategy, we need only specify a different
container type, and give it a Foldable
instance that folds along different
creases.
So, while our sortedListSort
was O(N ^ 2), we can produce a differently
structured algorithm by instead folding over a DivideList
instead:
> realMergeSort :: Ord a => [a] -> [a]
> realMergeSort = foldSort . DivideList
If you've done everything correctly, this main function should return rather quickly. This is much faster than the example above. On my machine it takes (0.07 secs, 41,595,632 bytes).
> main :: IO ()
> main = (print . last . realMergeSort) ([10000,9999..0] :: [Int])
If you find that your merge sort implementation works correctly but isn't
any faster, you may have an error in your instance of Foldable
for DivideList
. For
each use of foldMap
in this definition, make sure that you know what
instance is used in your code. You can force the type checker to use a particular
instance with a type application: foldMap @DivideList f x
will produce an error
if x
is a list instead of a DivideList
.
Concluding Thoughts About This Exercise
The important takeaway here is this: foldMap
defines once and for all a
universal "divide-and-conquer" algorithm -- all we need to do to use it is to
provide a way to "divide" an input container (i.e. give a Foldable instance
),
then give a way to compute on those elements (i.e. the mapped function a -> m
)
and a way to combine ("conquer") the results of that computation (i.e. a
Monoid
instance for the result type).
Almost any divide-and-conquer algorithm can be fit into this framework, and that
means we can avoid repeating ourselves when writing such programs. We can reuse
the division strategy of DivideList
when writing some other algorithm, and
likewise for the sorted-merging combination strategy of SortedList
.