Wrote blog post on the knapsack problem.

6 years ago · 580983076d
--- a/blogContent/posts/programming/knapsack-problem.md
+++ b/blogContent/posts/programming/knapsack-problem.md
@ -0,0 +1,144 @@
 In this post I will go over how you can frame a question in terms of recursion and then slowly massage the question into a dynamic programming question.

 # Problem Description

 The knapsack problem is a famous optimization problem in computer science.
 Given a set of items, a thief has to determine which items he should steal to maximize his total profits.
 The thief knows the maximum capacity of his knapsack and the weights and values of all the items.

 The fractional knapsack problem is where a thief can steal fractions of the items instead of being mandated to take the whole item.
 This problem can be solved really easily using a greedy algorithm where the thief just steals the most valuable dense objects first.
 The 0-1 knapsack problem is where the thief can't steal fractions of items.
 By nature this object is more difficult since it requires more computations.
 A naive brute force algorithm would take exponential time, but, dynamic programming can slash the complexity.

 # Recursive Formula

 The initial recursive way of defining the problem will look something like this:

 $$
 k([], W) = 0\\
 k((v,w)::items, W) =
  \begin{cases}
   k(items, W) & \text{if } w > W\\
   max(k(items, W), k(items, W- w) + v) & otherwise\\
  \end{cases}
 $$

 In this example the knapsack problem takes in two parameters, a list of items and a max capacity of the knapsack.
 We have three cases in this formulation. If the knapsack is empty, the max value we can steal is 0.
 If the weight of the item at the start of the list is greater than the capacity of the knapsack, we can't steal it.
 The final case is were we decide whether it would be better to take the item or not.

 It is often nice to work with lists when theoretically formulating a recursive definition.
 However, it is better to have arrays when actually coding the problem.

 $$
 k(a, W) =
  \begin{cases}
   0 & \text{if } |a| = 0\\
   k(a[1:], W) & \text{if } a[0].w > W\\
   max(k(a[1:], W), k(a[1:], W- a[0].w) + a[0].v) & otherwise\\
  \end{cases}
 $$

 In this formulation we converted the list into a array. The array is filled with objects which has a "v" value and "w" weight property.
 The [1:] syntax represents array slicing where you take everything after the first element.
 Slicing is nice theoretically, but, in practice they are slow.
 To stop using slicing we will have to introduce a slicing index to our formulation.
 This will make all of our array operations constant time and prevent our memory from growing exponentially under recursive calls.

 $$
 k'(a, W) =
  \begin{cases}
   0 & \text{if } i = |a|\\
   k'(a, W, i+1) & \text{if } a[i].w > W\\
   max(k'(a, W, i+1), k'(a, W- a[i].w, i+1) + a[i].v) & otherwise\\
  \end{cases}
 $$

 # Dynamic Programming Formulation

 Now that we have a recursive definition which has overlapping sub problems, we can convert it to imperative pseudo code which uses dynamic programming.
 Think of the two-dimensional array as a way to store the results of the recursive calls bottom up.
 This will prevent us from having an exponential number of computations.

 ````Python
 def knapsack(a, W):
    n = |a|
    k = array(0...W, 0... n)

    for w = 0 to W:  # base case i = n
        k[w, n] = 0

    for i = n-1 down to 0:
        for w = 0 to W:
            if a[i].w > W: # unable to take current item
                k[w, i] = k[w, i+1]
            else: #decides whether to include item
                k[w, i] = max(k[w,i+1], k[w-a[i].w, i+1] + a[i].v)
    return k[W, 0]
 ````

 We can now easily implement the pseudo code in python.
 Instead of just returning the maximum value which our thief can steal, we can return a list of objects which the thief actually stole.

 ````Python
 def knapsack(V, W, capacity):
    """
    Dynamic programming implementation of the knapsack problem

    :param V: List of the values
    :param W: List of weights
    :param capacity: max capacity of knapsack
    :return: List of tuples of objects stolen in form (w, v)
    """
    choices = [[[] for i in range(capacity + 1)] for j in range(len(V) + 1)]
    cost = [[0 for i in range(capacity + 1)] for j in range(len(V) + 1)]

    for i in range(0, len(V)):
        for j in range(0, capacity + 1):
            if W[i] > j:  # don't include another item
                cost[i][j] = cost[i -1][j]
                choices[i][j] = choices[i - 1][j]
            else:  # Adding another item
                cost[i][j] = max(cost[i-1][j], cost[i-1][j - W[i]] + V[i])
                if cost[i][j] != cost[i-1][j]:
                    choices[i][j] = choices[i - 1][j - W[i]] + [(W[i], V[i])]
                else:
                    choices[i][j] = choices[i - 1][j]
    return choices[len(V) -1][capacity]


 def printSolution(S):
    """
    Takes the output of the knapsack function and prints it in a
    pretty format.

    :param S: list of tuples representing items stolen
    :return: None
    """
    print("Thief Took:")
    for i in S:
        print("Weight: " + str(i[0]) + "\tValue: \t" + str(i[1]))

    print()
    print("Total Value Stolen: " + str(sum(int(v[0]) for v in S)))
    print("Total Weight in knapsack: " + str(sum(int(v[1]) for v in S)))


 values =  [1,1,1,1,1,1]
 weights = [1,2,3,4,5,1]
 printSolution(knapsack(values, weights, 5))
 ````

 # Time Complexity

 Since we simply calculate each value in our two dimensional array, the complexity is 0(n*W).
 The W is the max capacity of the knapsack and the n is the number of objects you have.

 The interesting caveat with this problem is that it is NP-Complete.
 Notice that we have a solution which runs in "polynomial" time.
 NP hard problems currently don't have any known polynomial solutions for them.
 In our case here, we simply have a pseudo polynomial solution -- size required to solve solution grows exponentially with respect to input.
 Since it has this pseudo polynomial solution, the knapsack is [weak NP-complete](https://en.wikipedia.org/wiki/Weak_NP-completeness).