Skip to content
All paths
Advanced5 equations~10 hours

AI and Information Theory

Entropy, optimization, and the math behind intelligence.

0 of 5 equations completed0%
  1. 1

    Shannon Entropy

    Intermediate
    H=ipilog2piH = -\sum_{i} p_i \log_2 p_i

    Measures the average information (surprise) in a random variable—how many bits needed to encode it.

    Formal Sciences
    Information Theory
  2. 2
    θt+1=θtηθL(θt)\theta_{t+1} = \theta_t - \eta \nabla_\theta L(\theta_t)

    Update parameters by stepping opposite to the gradient of the loss—learning by hill descent.

    Information Sciences
    Machine Learning
  3. 3

    Cross-Entropy Loss

    Intermediate
    L=iyilog(y^i)L = -\sum_{i} y_i \log(\hat{y}_i)

    Measures difference between true labels and predicted probabilities in classification.

    Information Sciences
    Machine Learning
    Open equation
  4. 4
    Attention(Q,K,V)=softmax(QKTdk)V\text{Attention}(Q,K,V) = \text{softmax}\left(\frac{QK^T}{\sqrt{d_k}}\right)V

    Each token attends to all others—weighted by query-key similarity, scaled by dimension.

    Information Sciences
    Deep Learning
  5. 5
    PR(pi)=1dN+dpjM(pi)PR(pj)L(pj)PR(p_i) = \frac{1-d}{N} + d \sum_{p_j \in M(p_i)} \frac{PR(p_j)}{L(p_j)}

    Ranks web pages by importance based on the quality and quantity of links pointing to them.

    Formal Sciences
    Computer Science
    Open equation