Join Trial or Access Free ResourcesThis concept of independence, conditional probability and information contained always fascinated me. I have thus shared some thoughts upon this.
When do you think some data is useless?
Some data/ information is useless if it has no role in understanding the hypothesis we are interested in.
We are interested in understanding the following problem.
We can model an event by a random variable. So, let's reframe the problem as follows.
There is something called entropy. But, I will not go into that. Rather I will give a probabilistic view only. The conditional probability marches in here. We have to use the idea that we have used the information of \(Y\), i.e. conditioned on \(Y\). Hence, we will see how \(X \mid Y\) will behave?
How does \( X \mid Y\) behave? If \(Y\) has any effect on \(X\), then \(X \mid Y\) would have changed right?
But, if \(Y\) has no effect on \(X\), then \(X \mid Y\) will not change and remain same as X. Mathematically, it means
We cannot distinguish between the initial and the final even after conditioning on \(Y\).
\(X\) and \(Y\) are independent \( \iff \) \( f(x,y) = P(X =x \mid Y = y) \) is only a function of \(x\).
\( \Rightarrow\)
\(X\) and \(Y\) are independent \( \Rightarrow \) \( f(x,y) = P(X =x \mid Y = y) = P(X = x)\) is only a function of \(x\).
\( \Leftarrow \)
Let \( \Omega \) be the support of \(Y\).
\( P(X =x \mid Y = y) = g(x) \Rightarrow \)
\( P(X=x) = \int_{\Omega} P(X =x \mid Y = y).P(Y = y)dy \)
\(= g(x) \int_{\Omega} P(Y = y)dy = g(x) = P(X =x \mid Y = y) \)
Information contained in \(X\) = Entropy of a random variable \(H(X)\) is defined by \( H(X) = E(-log(P(X)) \).
Now define the information of \(Y\) contained in \(X\) as \(\mid H(X) - H(X|Y) \mid\).
Thus, it turns out that \(H(X) - H(X|Y) = E_{(X,Y)} (log(\frac{P(X \mid Y)}{P(X)})) = H(Y) - H(Y|X) = D(X,Y)\).
\(D(X,Y)\) = Amount of information contained in \(X\) and \(Y\) about each other.
Note: This is just a mental construction I did, and I am not sure of the existence of the measure of this information contained in literature. But, I hope I have been able to share some statistical wisdom with you. But I believe this is a natural construction, given the properties are satisfied. It will be helpful, if you get hold of some existing literature and share it to me in the comments.

In 2025, 8 students from Cheenta Academy cracked the prestigious Regional Math Olympiad. In this post, we will share some of their success stories and learning strategies. The Regional Mathematics Olympiad (RMO) and the Indian National Mathematics Olympiad (INMO) are two most important mathematics contests in India.These two contests are for the students who are […]

Cheenta Academy proudly celebrates the success of 27 current and former students who qualified for the Indian Olympiad Qualifier in Mathematics (IOQM) 2025, advancing to the next stage — RMO. This accomplishment highlights their perseverance and Cheenta’s ongoing mission to nurture mathematical excellence and research-oriented learning.

Cheenta students shine at the Purple Comet Math Meet 2025 organized by Titu Andreescu and Jonathan Kanewith top national and global ranks.

Celebrate the success of Cheenta students in the Stanford Math Tournament. The Unified Vectors team achieved Top 20 in the Team Round.