Convex Functions and Jensen's Inequality

by always_correct, Nov 20, 2016, 3:21 AM

What does it mean for a function to be convex? Concave? Intuitively we see that all convex functions on an interval $I$ look like a smooth valley, and all concave functions look like smooth hills. We want a rigorous definition of convex and concave functions, we first tackle this by realizing a concave function is just the negative of some convex function.

We can notice by drawing a convex function $f(x)$ on an interval $I$ , that for any line $L(x)$ through any two points on the function, $L(x)$ is above the graph, more precisely, $L(x) \geq f(x)$ for $x \in I$ .

For a non-convex function, we see that this line may be below the graph.

Let's define a convex function $f(x)$ this way. First, we need a concise way to say all points on this line segment are above or on the graph. Let $(a,f(a))$ and $(b,f(b))$ be the endpoints of a line in the interval where the function is convex, let this interval be $I$ . The midpoint of these two points is the average of the $x$ and $y$ coordinates. This is in the line segment, as the average of two numbers is between those numbers. Generally, we can say any for any $\mu \in [0,1]$ ,(Letting $\mu = \frac{1}{2}$ gives us the midpoint case.)
$\mu a + [1-\mu]b \in [a,b]$ To prove this we can write $\mu a + [1-\mu]b = b+\mu(a-b)$ and note this is the equation of a line $L(\mu)$ , and use the Intermediate Value Theorem. We can equally say that for any $\mu \in [0,1]$ ,
$\mu f(a) + [1-\mu]f(b) \in [f(a),f(b)]$ Note that if $L(x)$ is the equation of the line segment from $(a,f(a))$ to $(b,f(b))$ , by linearity*:
$L(\mu a + [1-\mu]b)=\mu L(a) + [1-\mu]L(b) =\mu f(a) + [1-\mu]f(b)$ The last equality comes from the fact, looking at the picture, that the line is equal to the function at $x=a$ and $x=b$ , in other words, $L(a)=f(a)$ and $L(b)=f(b)$ . Thus we come to the final conclusion that all points on the line segment connecting any two points of $f(x)$ can be written as:
$(h,k)=(\mu a + [1-\mu]b, \mu f(a) + [1-\mu]f(b))$ By our definition, this point has to be above or on $f(x)$ for $f(x)$ to be convex, where $x=h$ . Thus, if a function is convex in the interval $I$ , for all $a,b \in I$ and $\mu \in [0,1]$ ,
$\mu f(a) + [1-\mu]f(b) \geq f(\mu a + [1-\mu]b)$ *Actually, the function $L(x)$ is affine not linear, meaning $L(x)=ax+b$ and NOT $L(x)=ax$ . However we can still note that if $m + n = 1$ , then:
$\begin{align*} L(mx+ny) &= a(mx+ny)+b \\ &= amx+any+b \\ &= amx + mb + any + nb \\ &= m(ax+b) + n(ay+b) \\ &= mL(x)+nL(y) \end{align*}$ So "linearity" sometimes applies if the coefficients $m,n$ add up to one... make sure you understand this!

Great! We just rigorously defined convex functions, now I need to make sure you aren't lying when you say you understand.

Try proving the inequality in the definition of convexity fails for $\mu \in \mathbb{R}\setminus [0,1]$ .

Moving onward, we can now discuss Jensen's Inequality. Here is the statement of Jensen's Inequality.

Let $f(x)$ be a real convex function defined in the interval $I$ , and let $x_1,x_2,\ldots,x_n \in I$ and $\mu_1,\mu_2,\ldots,\mu_n$ be positive real numbers such that $\mu_1 + \mu_2 + \ldots + \mu_n = 1$ . Then
$\mu_1 f(x_1)+\mu_2 f(x_2)+\ldots+\mu_n f(x_n) \geq f(\mu_1 x_1 + \mu_2 x_2 + \ldots + \mu_n x_n)$
See if you can prove this. Here I will discuss an outline to think about.

Now that we understand Jensen's inequality(hopefully), we can prove powerful inequalities, and find maximums where no other methods provide help. For example look at this problem:

In $\triangle ABC$ , find the maximum of $\sin A + \sin B + \sin C$ .

Solution

Thank you for reading, and comment if you have any questions or feedback.

This post has been edited 7 times. Last edited by always_correct, Nov 20, 2016, 10:35 PM

inequalities

convex

concave

Jensen

Comment

U VIEW ATTACHMENTS T PREVIEW J CLOSE PREVIEW rREFRESH

2 Comments

The post below has been deleted. Click to close.

This post has been deleted. Click here to see post.

This is really good. Thanks.

by shiningsunnyday, Nov 20, 2016, 7:00 AM

Report

The post below has been deleted. Click to close.

This post has been deleted. Click here to see post.

Excelente sir!

by Entrenamiento, May 24, 2021, 10:55 PM

Report

Submit

there are over 6000! anyway, I'm resting with the whole Euler line problem right now
by always_correct, Aug 6, 2017, 8:16 PM
Have you seen something about Kimberling centers (triangle centers) and center lines?
by alevini98, Aug 5, 2017, 11:19 PM
no its not dead
by always_correct, Jul 12, 2017, 1:19 AM
Wow this is great, is this dead?
by Ankoganit, Jul 3, 2017, 8:41 AM
oh its a math blog.....
by Swimmer2222, Apr 13, 2017, 12:58 PM
Why is everyone shouting
Oh wait...
by ArsenalFC, Mar 26, 2017, 8:45 PM
Shout! ^^
by DigitalDagger, Mar 4, 2017, 3:49 AM
14th shout!
by arkobanerjee, Feb 8, 2017, 2:48 AM
14th shout! be more active
by Mathisfun04, Jan 25, 2017, 9:13 PM
13th shout
by Ryon123, Jan 3, 2017, 3:47 PM
Dang, so OP. Keep it up!
by monkey8, Dec 28, 2016, 6:23 AM
How much time do you spend writing these posts???
by Designerd, Dec 5, 2016, 5:02 AM
good blog!
by AlgebraFC, Dec 5, 2016, 2:09 AM
Nice finally material for calculus people to read thanks
by Math1331Math, Nov 27, 2016, 2:50 AM
Whoa how much time do you spend typing these posts?

They're nice posts!
by MathLearner01, Nov 24, 2016, 3:30 AM
i am remove it please
by always_correct, Nov 22, 2016, 2:18 AM
5th shout!
by algebra_star1234, Nov 20, 2016, 2:41 PM
fourth shout >
by budu, Nov 20, 2016, 4:59 AM
3rd shout!
by monkey8, Nov 20, 2016, 3:24 AM
2nd shout!
by RYang, Nov 20, 2016, 3:01 AM
first shout >
by doitsudoitsu, Oct 9, 2016, 7:54 PM

21 shouts

Mathematical

Convex Functions and Jensen's Inequality

by always_correct, Nov 20, 2016, 3:21 AM

2 Comments