Whether the data was generated by a substitution rule — a process that replaces symbols with fixed patterns at every scale.
A substitution rule maps each symbol to a word: Fibonacci does a→ab, b→a; Thue-Morse does a→ab, b→ba. Iterating the rule produces an infinite aperiodic sequence whose statistical properties repeat at geometrically spaced scales (powers of the inflation factor λ). This geometry binarizes the data and tests five signatures that substitution sequences must satisfy and random or periodic sequences cannot.
How linear is the growth of distinct subwords? For a substitution sequence, the number of distinct n-grams p(n) grows as c·n + d (exactly linear). For random sequences, p(n) grows exponentially (2ⁿ). For periodic sequences, p(n) flatlines. Fibonacci word and Thue-Morse both score near 1.0 (textbook linear). White noise scores 0.76 (exponential growth, poor linear fit). Logistic period-5 scores 0.46 — its near-periodicity confuses the linearity test.
Topological entropy: does the subword count grow? Zero for substitution and periodic, positive for random and chaos. White noise and PRNG outputs score ~1.0 (exponential growth = maximal entropy). Fibonacci word scores 0.19 (barely above zero — its linear complexity growth produces a small but nonzero growth ratio). Periodic orbits score exactly 0.0.
How evenly are the symbols distributed? Measures max|D(n)|/√N, where D(n) is the deviation of cumulative symbol counts from expected. Substitution sequences have bounded discrepancy (D/√N → 0). Random sequences have D/√N ≈ 0.5 (random walk). Thue-Morse (0.006) and Fibonacci word (0.006) have the lowest nonzero discrepancy in the atlas — their symbols are distributed with almost crystalline uniformity. Devil's staircase (28.1) has the highest: its long constant plateaus create massive cumulative imbalance.
How regular are the gaps between repeated subwords? Substitution sequences have few distinct return times because the hierarchical tiling constrains where each pattern can appear. Logistic period-2 and edge-of-chaos score 1.0 (perfectly regular returns). Fibonacci word scores 0.82. fBm scores 0.08 (widely scattered, irregular returns).
Are autocorrelation peaks at geometric (λ^k) rather than arithmetic (nT) spacings? This is the substitution fingerprint: Fibonacci has ACF peaks at Fibonacci numbers (φ^k spacings), Thue-Morse at powers of 2. Quantum walk scores highest (0.91) — its interference pattern creates geometric-ratio ACF peaks through a completely different mechanism. Fibonacci word (0.88), phyllotaxis (0.89), and circle map quasiperiodic (0.89) cluster together. White noise and logistic chaos score 0.0 (no ACF peaks at all).
| Source | Domain | Value |
|---|---|---|
| Quantum Walk | quantum | 0.9123 |
| Wave Height (Buoy) | geophysics | 0.8984 |
| Phyllotaxis | bio | 0.8888 |
| ··· | ||
| Logistic Chaos | chaos | 0.0000 |
| Tent Map | chaos | 0.0000 |
| Prime Gaps | number_theory | 0.0000 |
| Source | Domain | Value |
|---|---|---|
| Projectile with Drag | motion | 1.0000 |
| Damped Pendulum | motion | 1.0000 |
| Lotka-Volterra | bio | 1.0000 |
| ··· | ||
| Constant 0xFF | noise | 0.0000 |
| Constant 0x00 | noise | 0.0000 |
| Logistic r=3.74 (Period-5 Window) | chaos | 0.4619 |
| Source | Domain | Value |
|---|---|---|
| Devil's Staircase | exotic | 28.0993 |
| Hilbert Walk | exotic | 27.2802 |
| BTC Close Price | financial | 27.2032 |
| ··· | ||
| Constant 0xFF | noise | 0.0000 |
| Constant 0x00 | noise | 0.0000 |
| Forest Fire | exotic | 0.0000 |
| Source | Domain | Value |
|---|---|---|
| Wichmann-Hill | binary | 1.0000 |
| BSL Residues | number_theory | 0.9999 |
| Benford's Law | number_theory | 0.9999 |
| ··· | ||
| Constant 0xFF | noise | 0.0000 |
| Constant 0x00 | noise | 0.0000 |
| Logistic r=3.5 (Period-4) | chaos | 0.0000 |
| Source | Domain | Value |
|---|---|---|
| Logistic r=3.2 (Period-2) | chaos | 1.0000 |
| Logistic Edge-of-Chaos | chaos | 1.0000 |
| Forest Fire | exotic | 1.0000 |
| ··· | ||
| Constant 0xFF | noise | 0.0000 |
| Constant 0x00 | noise | 0.0000 |
| fBm (Persistent) | noise | 0.0847 |
Inflation is the framework's substitution-rule detector. The combination of linear complexity + near-zero discrepancy + geometric ACF peaks is essentially unique to substitution sequences. In the ordinal view, it separates quasicrystalline sources (Fibonacci, Thue-Morse, L-System) from both periodic and random, which the other ordinal geometries struggle to do — Ordinal Partition sees Fibonacci as "moderately constrained" without distinguishing it from other aperiodic sequences. Inflation's acf_geometric metric also catches quantum walk and ocean/wave signals, suggesting that geometric self-similarity in autocorrelation structure is a broader phenomenon than pure substitution dynamics.