Solving the Cubic Equation for Dummies

Solving the Cubic Equation for Dummies


/

Everybody learns the “quadratic formula” for solving equations of the form Ax2+Bx+C=0, even though you don’t really need such a formula, because you can solve for x through the technique of “completing the square”. What you need a formula for is the solution to the cubic equation: Ax3+Bx2+Cx+D=0. There is no obvious way that “completing the cube” makes the solution into a matter of just taking cube roots in the same way that “completing the square” solves the quadratic in terms of square roots. So how could we derive such a cubic formula? Let’s see if I can convince you that you could derive it, if you thought about it hard enough, in the right way.

We’re given Ax3+Bx2+Cx+D=0. Let’s assume that A=1, so we’re interested in solving an equation of the form x3+Bx2+Cx+D=0. (A moment’s thought will show you that if you can solve that case, you can solve the general case). So by finding a “cubic formula”, we mean finding a function f(B,C,D) that gives the solution. However, in general, a cubic equation has three solutions. So such a function must be multi-valued, with three possible values.

In the quadratic case, the solution is x=b±b24ac2a. We can rewrite this in the following form:
x=b2a+rb24ac2a, where r is either plus or minus 1, that is, one of the two solutions of r2=1.

Generalizing from the quadratic case, let’s assume that the multi-valuedness comes from the three branches of the cube root function. In other words, let’s look for a function of the form:

f(B,C,D,r)

such that you get a solution whenever the parameter r is one of the cube-roots of 1. Let’s fix B,C,D and consider f as a function of r. The simplest possible non-constant functions are polynomials. So let’s assume that f has the form: f(r)=α+βr+γr2 (where α,β,γ are implicitly dependent on the coefficients B,C,D of the original cubic equaion.) Since we ultimately want r to be a cube-root of 1, we don’t need to consider terms of the form r3 or higher-order, because those terms are reducible to lower-order terms if r3=1

A few facts about the cube roots of 1. There are three cube roots, and they can be written in the form e2nπi3. Concretely,

r1=12+32i

r2=1232i

r3=1 corresponding to n=1,2,3

We can see that r2 is just the square of r1, so we can just write the three solutions as:

r1=q

r2=q2

r3=1

where q is e2πi3. Another interesting fact about the solutions is that

r1+r2+r3=0

 

a fact that can be proved by substitution, or geometrically, by representing the cube roots of 1 as vectors in 2-D complex space.

cube-roots

So we know that 1+q+q2=0 where q is as defined.

So in terms of our function f(r), we want the three solutions to our cubic equation to be: f(q),f(q2),f(1). Using our polynomial for f, this means that we want the three solutions to be:

x1=α+βq+γq2

x2=α+βq2+γq4=α+βq2+γq

x3=α+β+γ

Let’s work out a few symmetric combinations of these solutions that will be needed:

x1+x2+x3=3α (All other terms vanish, when we use the fact that 1+q+q2=0)
x1x2+x1x3+x2x3=3(α2βγ) (Again, we used 1+q+q2=0 to get this).
x1x2x3=α3+β3+γ33αβγ

The significance of these combinations is that zeros of a polynomial can be used to factor the polynomial: If x1,x2,x3 are the three solutions of x3+Bx2+Cx+D=0, then we can write:

x3+Bx2+Cx+D=(xx1)(xx2)(xx3)=x3(x1+x2+x3)x2+(x1x2+x1x3+x2x3)x(x1x2x3)

Comparing coefficients tells us that

(x1+x2+x3)=B
(x1x2+x1x3+x2x3)=C
(x1x2x3)=D

So using our previous results for those combinations of solutions, we find:

3α=B. So α=B3
3(α2βγ)=C. So γ=B29C3β
α3+β3+γ33αβγ=D.

In the last equation, we can substitute for α and γ to get:

B327+D+B(B29C3)+β3+(B29C3)3β3=0

We can multiply by β3 to get:

β6+(B327+D+B(B29C3))β3+(B29C3)3)=0

Oh no! After all that work to find the solution to a 3rd degree equation, we end up with a 6th degree equation for the coefficient β. Is that progress? Well, yes it is. Because it’s only a quadratic equation in β3:

(β3)2+(B327+D+B(B29C3))β3+(B29C3)3)=0

So we can solve this equation for β3, take the cube root of the result to get β (any cube root will do), and then use the previously derived relationship between β and γ to find γ. We already solved for α. So then we know all three coefficients in our function.

Then in terms of these values for α,β,γ, we can find our three solutions to the original cubic equation:
x1=α+βq+γq2
x2=α+βq2+γq
x3=α+β+γ
(where, remember, q is one of the complex cube roots of 1)

I’m not going to go to the trouble of writing out the explicit solutions in terms of the coefficients of the original cubic equation. The point here is to convince you that you could compute them yourself, if forced to at gunpoint.

Obviously, this general strategy for solving a polynomial equation generalizes to any degree. However, the last stroke of luck, that the 6th degree polynomial equation for one of the coefficients can be rewritten as a quadratic equation, was unexpected. There is no reason to think that such luck will hold out for higher-degree polynomials. As a matter of fact, the French mathematician Evariste Galois, who tragically died in a duel, proved that techniques along the lines of those used here can’t be used to solve a general 5th order (or higher order) equation.