Skip to content

Entropy, Coarse-Graining, and the Nature of Time

An expanded treatment of what entropy means when microstate information is ignored, compressed, hidden in correlations, or rendered inaccessible by horizons—with Shannon and von Neumann entropy included.

.

Entropy, Coarse-Graining, and the Nature of Time
Deep analysis

Entropy, Coarse-Graining, and the Nature of Time

An expanded treatment of what entropy means when microstate information is ignored, compressed, hidden in correlations, or rendered inaccessible by horizons—with Shannon and von Neumann entropy included.

Entropy here means the information missing from a restricted description. It is often not a new substance added to the world. It is the logarithmic measure of how many fine-grained physical possibilities are compatible with the description available to some observer, subsystem, measuring apparatus, or effective theory.

That is the bridge between thermodynamics, Shannon information, von Neumann entropy, decoherence, horizons, and the idea that time becomes meaningful through interaction, information exchange, and local entropy production. A sharper formulation is: the arrow of time becomes meaningful when physical processes create stable, coarse-grained records while hiding microscopic reversibility in inaccessible correlations.

1. Microstate vs macrostate: the basic move

A microstate is the most detailed state description available in a theory. For a classical gas, a microstate would specify every particle’s position and momentum:

\[ x = (q_1,p_1,q_2,p_2,\ldots,q_N,p_N). \]

A macrostate is a compressed description: pressure, volume, temperature, density profile, magnetization, charge distribution, pointer position, detector click, neural record, and so on.

So there is a many-to-one map:

\[ \text{microstate } x \longrightarrow \text{macrostate } M. \]

Many distinct microscopic configurations produce the same macro-description. Boltzmann entropy measures the size of that compatible set:

\[ S_B(M)=k_B\ln |\Gamma_M|, \]

where \(\Gamma_M\) is the region of phase space corresponding to macrostate \(M\).

At this level, entropy is the logarithm of microscopic multiplicity hidden behind a macroscopic description. A low-entropy macrostate is special because only a small number of microstates realize it. A high-entropy macrostate is generic because an enormous number of microstates realize it.

2. Shannon entropy: uncertainty over classical alternatives

Shannon entropy is the cleanest mathematical expression of missing information. For a discrete random variable \(X\) with probabilities \(p_i\),

\[ H(X)=-\sum_i p_i\log_2 p_i. \]

Its units are bits when the logarithm is base 2.

Interpretation:

\[ H(X)=\text{average number of bits needed to specify the actual outcome}. \]

For a certain coin outcome,

\[ P(H)=1,\quad P(T)=0, \]

we get \(H=0\). There is no uncertainty. But for a fair coin,

\[ P(H)=P(T)=1/2, \]

we get \(H=1\) bit. One bit is needed to specify which outcome occurred.

For \(N\) equally likely alternatives,

\[ H=\log_2 N. \]

So if a macrostate is compatible with \(N\) equally likely microstates, the missing information is \(\log_2 N\) bits.

Thermodynamic entropy is Shannon entropy with physical units:

\[ S = k_B \ln 2 \, H. \]

Equivalently, using natural logarithms,

\[ S=-k_B\sum_i p_i\ln p_i. \]

That is the Gibbs entropy.

3. Boltzmann, Gibbs, Shannon: three versions of one intuition

Boltzmann entropy

\[ S_B(M)=k_B\ln W. \]

Here \(W\) is the number, or phase-space volume, of microstates compatible with a macrostate. This is entropy of a macrostate.

It asks: given that the system looks macroscopically like \(M\), how many microscopic ways could that be true?

Gibbs entropy

\[ S_G[p]=-k_B\sum_i p_i\ln p_i. \]

This is entropy of a probability distribution over microstates. It asks: given my probability distribution over microstates, how much microscopic uncertainty remains?

Shannon entropy

\[ H[p]=-\sum_i p_i\log_2 p_i. \]

This is the dimensionless information-theoretic version. The relation is:

\[ S_G = k_B\ln 2 \, H. \]

If all \(W\) compatible microstates are equally likely,

\[ p_i=\frac{1}{W}, \]

then Gibbs/Shannon reduces to Boltzmann:

\[ S_G=-k_B\sum_{i=1}^{W}\frac{1}{W}\ln\frac{1}{W} = k_B\ln W = S_B. \]
Entropy = logarithm of unresolved physical alternatives.

4. Conditional entropy: entropy after coarse-graining

Let

\[ X=\text{microstate}, \]

and

\[ M=f(X)=\text{macrostate}. \]

If you know \(M\) but not \(X\), the remaining ignorance is the conditional entropy:

\[ H(X|M). \]

For a particular macrostate \(M=m\), \(H(X|M=m)\) is the number of missing bits needed to specify the exact microstate after the macrostate is already known.

If all microstates compatible with \(m\) are equally likely and there are \(W_m\) of them,

\[ H(X|M=m)=\log_2 W_m, \]

so

\[ S_B(m)=k_B\ln W_m = k_B\ln 2 \, H(X|M=m). \]

This is the most precise version of the sentence: entropy appears when we coarse-grain because coarse-graining replaces \(X\), the exact microstate, with \(M=f(X)\), a many-to-one description. Entropy measures the information lost in that compression.

5. Correlations: entropy is often hidden in relationships

Entropy does not simply mean “lack of order.” Often it means lost access to correlations.

Shannon entropy obeys:

\[ H(X,Y)\leq H(X)+H(Y), \]

with equality only when \(X\) and \(Y\) are independent. Define mutual information:

\[ I(X:Y)=H(X)+H(Y)-H(X,Y). \]

This measures how much information is stored in the correlation between \(X\) and \(Y\).

Example: two perfectly correlated coins.

\[ P(HH)=1/2,\quad P(TT)=1/2. \]

Each coin alone has entropy:

\[ H(X)=1,\quad H(Y)=1. \]

But the pair has only:

\[ H(X,Y)=1, \]

not 2. Once you know one coin, you know the other. The mutual information is:

\[ I(X:Y)=1+1-1=1. \]

If you ignore the correlation, you falsely say the pair has 2 bits of uncertainty. If you keep the correlation, it has only 1 bit.

A closed system may evolve reversibly, preserving all microscopic information, while entropy appears to increase because information becomes stored in inaccessible correlations among many degrees of freedom.

6. Fine-grained entropy may be conserved while coarse-grained entropy increases

This is the subtle core of the arrow of time.

In exact classical Hamiltonian mechanics, the fine-grained Gibbs entropy is conserved under microscopic dynamics. The usual route to entropy increase is coarse-graining: replace a detailed probability distribution with a smoothed one over macrostates.

So the microscopic picture is:

\[ \text{fine-grained information is preserved}. \]

The macroscopic picture is:

\[ \text{accessible information decays}. \]

The missing information has not necessarily been destroyed. It has become practically inaccessible.

This is why entropy is linked to irreversibility. Micro-dynamics may be reversible, but the macro-description is not.

7. Von Neumann entropy: the quantum version

In quantum mechanics, the state of a system is represented by a density matrix \(\rho\). The quantum analogue of Gibbs/Shannon entropy is von Neumann entropy:

\[ S_{\mathrm{vN}}(\rho) = -k_B\mathrm{Tr}(\rho\ln\rho). \]

In bits, one often writes:

\[ H_{\mathrm{vN}}(\rho) = -\mathrm{Tr}(\rho\log_2\rho). \]

If \(\rho\) has eigenvalues \(\lambda_i\), then

\[ S_{\mathrm{vN}}(\rho) = -k_B\sum_i\lambda_i\ln\lambda_i. \]

So von Neumann entropy is the Shannon entropy of the eigenvalue spectrum of \(\rho\).

Important cases:

A pure state,

\[ \rho=|\psi\rangle\langle\psi|, \]

has eigenvalues \(1,0,0,\ldots\), so

\[ S_{\mathrm{vN}}=0. \]

A maximally mixed \(d\)-dimensional state,

\[ \rho=\frac{I}{d}, \]

has eigenvalues \(1/d\), so

\[ S_{\mathrm{vN}}=k_B\ln d. \]

In bits:

\[ H_{\mathrm{vN}}=\log_2 d. \]
Von Neumann entropy measures how mixed a quantum state is, independent of measurement basis.

8. Shannon entropy of measurement outcomes is not always von Neumann entropy

This distinction is essential.

Take the pure qubit state:

\[ |+\rangle=\frac{|0\rangle+|1\rangle}{\sqrt{2}}. \]

Its density matrix is:

\[ \rho=|+\rangle\langle+|. \]

This is pure, so:

\[ S_{\mathrm{vN}}(\rho)=0. \]

But if you measure it in the \(|0\rangle,|1\rangle\) basis, the outcomes are:

\[ P(0)=1/2,\quad P(1)=1/2. \]

The Shannon entropy of that measurement is:

\[ H=1\text{ bit}. \]
Measurement entropy can be nonzero even when von Neumann entropy is zero.

Why? Shannon entropy here refers to uncertainty about the outcome of a particular measurement. Von Neumann entropy refers to mixedness of the quantum state itself.

9. Entanglement entropy: entropy from losing access to part of a pure whole

Consider a Bell pair:

\[ |\Phi^+\rangle = \frac{|00\rangle+|11\rangle}{\sqrt{2}}. \]

The full two-qubit system is pure:

\[ \rho_{AB}=|\Phi^+\rangle\langle\Phi^+|. \]

So:

\[ S_{\mathrm{vN}}(\rho_{AB})=0. \]

But if you only have access to subsystem \(A\), you trace out \(B\):

\[ \rho_A=\mathrm{Tr}_B(\rho_{AB}). \]

This gives:

\[ \rho_A=\frac{I}{2}. \]

So:

\[ S_{\mathrm{vN}}(\rho_A)=k_B\ln 2. \]

Or, in bits:

\[ H_{\mathrm{vN}}(\rho_A)=1. \]

The whole state is perfectly pure, but each part looks maximally mixed. The entropy is not in \(A\) alone. It is not in \(B\) alone. It is in the relation between \(A\) and \(B\), and it appears locally when one side is inaccessible.

Local entropy can arise from global entanglement plus restricted access.

10. Decoherence: entropy appears when the environment takes the phase information

Decoherence is the physical process by which quantum phase relations become inaccessible to local observers.

Suppose a system begins in a superposition:

\[ |\psi_S\rangle = \alpha|0\rangle+\beta|1\rangle. \]

The environment begins in:

\[ |E_0\rangle. \]

Interaction correlates the system with the environment:

\[ (\alpha|0\rangle+\beta|1\rangle)|E_0\rangle \longrightarrow \alpha|0\rangle|E_0'\rangle + \beta|1\rangle|E_1'\rangle. \]

Globally, this may still be a pure state. But if you do not track the environment, you use:

\[ \rho_S=\mathrm{Tr}_E(\rho_{SE}). \]

When \(|E_0'\rangle\) and \(|E_1'\rangle\) become nearly orthogonal, the reduced state of \(S\) loses its interference terms:

\[ \rho_S \approx |\alpha|^2|0\rangle\langle0| + |\beta|^2|1\rangle\langle1|. \]

That looks like a classical probability distribution.

So in decoherence:

\[ \text{quantum coherence} \longrightarrow \text{system-environment correlation} \longrightarrow \text{local entropy}. \]

Nothing mystical is necessarily destroyed. Information becomes delocalized into the environment.

11. Why this matters for time

A closed global quantum state evolving unitarily does not necessarily increase its von Neumann entropy. For unitary evolution,

\[ \rho(t)=U(t)\rho(0)U^\dagger(t), \]

the eigenvalues of \(\rho\) are unchanged, so

\[ S_{\mathrm{vN}}(\rho(t))=S_{\mathrm{vN}}(\rho(0)). \]

At the most exact quantum level:

\[ \text{unitary evolution preserves total von Neumann entropy}. \]

But subsystems become entangled with other subsystems, and their reduced entropies can increase:

\[ S(\rho_A),\quad S(\rho_B),\quad S(\rho_{\text{observer}}),\quad S(\rho_{\text{environment}}). \]

This gives the experienced arrow of time:

\[ \text{more records}, \quad \text{more decoherence}, \quad \text{more inaccessible correlations}, \quad \text{more coarse-grained entropy}. \]
The arrow of time is the growth of entropy in the descriptions available to embedded subsystems that cannot track all microscopic correlations.

12. Entropy and the Unruh/Rindler horizon

This is where the acceleration-temperature argument becomes relevant.

In the Unruh effect, the inertial Minkowski vacuum is globally a vacuum state. But a uniformly accelerated observer has access only to a Rindler wedge of spacetime. The other wedge is causally inaccessible.

The standard result is:

\[ \rho_R = \mathrm{Tr}_L \left( |0_M\rangle\langle 0_M| \right), \]

where \(L\) is the inaccessible left Rindler wedge and \(R\) is the accessible right Rindler wedge. The reduced state \(\rho_R\) is thermal.

Restoring constants, the Unruh temperature is:

\[ T_U=\frac{\hbar a}{2\pi c k_B}. \]
Global pure vacuum + restricted causal access = local thermal mixed state.

This is not ordinary ignorance like “I forgot where I put my keys.” It is physical restricted access. The accelerated observer literally cannot access the full set of field degrees of freedom across the horizon.

So the Unruh effect is a perfect example of the phrase: entropy appears when we lose access to microstate information.

13. Entropy depends on the accessible algebra of observables

In quantum theory, a state gives expectation values for a chosen set of observables. If you restrict the observables, different density matrices may become indistinguishable.

Entropy is not purely subjective, but it is description-relative:

\[ \text{entropy relative to what observables?} \] \[ \text{entropy relative to what subsystem?} \] \[ \text{entropy relative to what horizon?} \] \[ \text{entropy relative to what coarse-graining?} \]

For an omniscient global description, entropy may be low or constant. For an embedded observer with limited access, entropy can be high and increasing.

14. The relation to “time as interaction”

The attached argument says roughly: no interaction, no exchange of information, no entropy increase, therefore no local time.

The mainstream-safe version would be:

Without interaction, there may be no new records. Without new records, there is no local thermodynamic arrow. Without an internal or external clock correlation, time becomes operationally empty for that system.

One must separate three claims:

Claim A: no interaction means no entropy production

Often true locally. If a subsystem is perfectly isolated and remains pure, then its von Neumann entropy does not increase:

\[ S_{\mathrm{vN}}(\rho_S)=0 \]

can remain zero.

Claim B: no entropy production means no arrow of time

Often plausible. The thermodynamic arrow requires irreversible record formation, decoherence, heat flow, coarse-graining, or some analogous increase in accessible entropy.

Claim C: no arrow of time means no time at all

This is speculative. A closed quantum system can still evolve unitarily. A system can have changing relative phases even if its total von Neumann entropy is constant. A perfectly stationary energy eigenstate, by contrast, has no observable internal change except a global phase.

So the better distinction is:

\[ \text{time as parameter} \neq \text{time as experienced change} \neq \text{time as thermodynamic arrow} \neq \text{time as records}. \]

The safest deep synthesis is:

Time as a parameter may remain in the formalism, but time as experienced sequence requires correlations, records, and distinguishable change. Entropy enters because records are formed by irreversible amplification and by the dispersal of microscopic information into inaccessible degrees of freedom.

15. Entropy is not “disorder” exactly

“Disorder” is a useful metaphor, but it breaks down.

A crystal at high temperature may look ordered spatially but have thermal entropy from phonons. A black hole looks simple from the outside but has enormous entropy. A scrambled hard drive can have high Shannon entropy but low semantic usefulness. A pure quantum superposition can produce random measurement results but have zero von Neumann entropy.

Entropy is better understood as:

Entropy = missing distinguishability relative to a chosen physical description.

Or even more compactly:

Entropy is the cost of not tracking the microstate.

16. Where did the information go?

When entropy increases, the information usually goes into one of three places.

1. Unobserved microscopic degrees of freedom

Example: heat in air molecules. You know the temperature, not every molecular velocity.

2. Correlations

Example: two subsystems become correlated, but you examine each one separately. The local entropy rises even if the joint state remains pure.

3. Inaccessible regions

Example: horizons. A Rindler observer traces out the opposite wedge. A black-hole exterior observer cannot access interior degrees of freedom.

In all three cases, entropy means:

\[ \text{the information is not available in the variables you are using}. \]

17. One compact dictionary

Concept Formula Meaning
Shannon entropy \(H(X)=-\sum_i p_i\log_2p_i\) Classical uncertainty in bits
Gibbs entropy \(S_G=-k_B\sum_i p_i\ln p_i\) Physical entropy of a probability distribution over microstates
Boltzmann entropy \(S_B=k_B\ln W\) Entropy of a macrostate with \(W\) compatible microstates
Conditional entropy \(H(X|M)\) Missing micro-information after macrostate is known
Mutual information \(I(X:Y)=H(X)+H(Y)-H(X,Y)\) Information stored in correlations
Von Neumann entropy \(S_{\mathrm{vN}}=-k_B\mathrm{Tr}(\rho\ln\rho)\) Quantum mixedness / entropy of a density matrix
Entanglement entropy \(S(\rho_A)\), with \(\rho_A=\mathrm{Tr}_B\rho_{AB}\) Entropy caused by tracing out part of a pure quantum whole
Thermal entropy \(S=-\partial F/\partial T\) or \(S=-k_B\mathrm{Tr}\rho\ln\rho\) for thermal \(\rho\) Entropy tied to heat, work, and equilibrium

18. The final philosophical compression

In this context, entropy is not merely “messiness.”

It is the quantitative shadow cast by limited access.

A god’s-eye exact description may see a pure state evolving unitarily. An embedded observer sees subsystems, macrostates, records, clicks, temperatures, horizons, and memories.

The gap between those descriptions is entropy.

So the sentence—“Entropy often appears when we coarse-grain, ignore, or lose access to microstate information”—means:

Entropy is what a many-to-one description leaves unspecified.

And in the time/acceleration context:

The arrow of time is what that loss looks like from inside the world.

Interaction creates correlations. Correlations create records. Records require coarse-graining. Coarse-graining creates entropy. Entropy gives the arrow. The arrow gives the experienced sequence we call time.

Sources and context

This HTML export preserves the conceptual structure of the prior answer and its scientific source trail. External references are included for orientation.

  1. Claude Shannon, “A Mathematical Theory of Communication,” for Shannon entropy and information measured in bits: PDF.
  2. Stanford Encyclopedia of Philosophy, “Philosophy of Statistical Mechanics,” for Boltzmann entropy, macroregions, and coarse-graining: entry.
  3. Stanford Encyclopedia of Philosophy, “Information Processing and Thermodynamic Entropy,” for Gibbs entropy and coarse-graining: entry.
  4. Review material on quantum entropy and von Neumann entropy: arXiv PDF.
  5. W. H. Zurek, “Decoherence, einselection, and the quantum origins of the classical,” for decoherence, environment-induced monitoring, and record formation: PDF.
  6. Crispino, Higuchi, and Matsas, “The Unruh effect and its applications,” for the Rindler-wedge tracing interpretation of the Unruh effect: PDF.
  7. Nature Communications article on Page-Wootters-style relational time: article.
  8. User-provided source note: the attached text summarized McCulloch’s claim that time is emergent from change and interaction, and that low-interaction systems may have an enlarged “now.”

Comments

Latest