2. Indicator Engineering

2. Indicator Engineering Apr 23, 2026 8 min

2.1 The Indicator Is More Important Than the Model

The indicator sets the ceiling that no model can break through. A linear regression on a high-quality indicator beats a deep neural network on a low-quality one. Most R&D effort is spent on the model, where the marginal returns are smallest. The bigger gains live in the inputs.

2. Indicator Engineering Apr 24, 2026 9 min

2.2 Garbage Indicators, Garbage Predictions

A garbage indicator has four structural defects: non-stationary distribution, heavy tails, clumped values, or lookback artifacts. The model treats your input as the truth and propagates the defect into the forecast. Diagnose the indicator before you train anything.

2. Indicator Engineering Apr 25, 2026 10 min

2.3 Why Most Indicators Should Be Transformed Before Modeling

Raw indicators rarely satisfy the geometric assumptions a model needs: stable scale, spread distribution, bounded tails. Six transforms cover most repairs. Each fixes a specific defect, costs a hyperparameter, and risks lookahead if computed non-causally.

2. Indicator Engineering Apr 26, 2026 10 min

2.4 Relative Entropy as an Indicator Quality Score

Relative entropy as a quality score is the cheapest single-number test for whether an indicator uses the range it lives on. The catch: a well-shaped histogram of pure noise scores as well as a well-shaped histogram of signal.

2. Indicator Engineering Apr 27, 2026 9 min

2.5 Range/IQR: A Simple Test for Indicator Tail Problems

R/IQR is the ratio of total range to interquartile range. The denominator is anchored to the body of the distribution. The numerator follows the tails. The ratio is the only honest tail measurement on data where the standard deviation is already contaminated by the tails it is supposed to describe.

2. Indicator Engineering Apr 28, 2026 9 min

2.6 Why Predictive Power Often Lives in the Tails

R/IQR detects stretched distributions but says nothing about whether the stretch carries the signal. On market data the stretch usually carries it. The Tail Concentration Ratio splits per-decile mutual information and tells you whether the tails are noise to squash or signal to preserve.

2. Indicator Engineering Apr 29, 2026 9 min

2.7 How to Test Indicator Thresholds Without Fooling Yourself

scanning 41 RSI thresholds and reporting the best one inflates the naive p-value by an order of magnitude. The right test shuffles the target, re-runs the full threshold scan thousands of times, and compares the observed best statistic to the distribution of best statistics from noise.

2. Indicator Engineering Apr 30, 2026 12 min

2.8 Why You Should Test Long and Short Thresholds Separately

Long-side and short-side threshold scans on the same indicator are two hypotheses, not one. Equity drift, return skew, and conditional-distribution asymmetry break the mirror.

2. Indicator Engineering May 1, 2026 12 min

2.9 The Case Against Raw Price Indicators

Raw price is non-stationary in mean, non-stationary in variance, and incomparable across instruments. A model trained on SPX from 1990 to 2010 sees 71% of the 2010 to 2026 test rows outside its training support. The in-sample AUC of 0.582 collapses to 0.498 live.

2. Indicator Engineering May 2, 2026 13 min

2.10 How to Build Stationary Indicators from Non-Stationary Prices

Six transforms turn non-stationary prices into stationary indicators, log returns to forced centering. Pick the lightest passing ADF, coverage, rolling-variance. ATR-norm 20d momentum wins on SPX.

2. Indicator Engineering May 3, 2026 12 min

2.11 Why ATR Normalization Is More Than a Volatility Trick

ATR captures within-bar and between-bar movement in the instrument's own units. On SPX 20d momentum, 2020/2017 std ratio drops 5.0 raw → 1.05 ATR-normalized, no MI loss. Structural, not heuristic.

2. Indicator Engineering May 4, 2026 13 min

2.12 CMMA: A Better Momentum Primitive Than Price-minus-MA Alone

CMMA fixes Close − MA(k) with log prices, prev-bar MA, ATR with current bar, √(k+1) divisor. On SPX, test/train std ratio drops 5.25 → 1.03; MI lifts 1.4 → 2.0. Pool-compatible.

2. Indicator Engineering May 5, 2026 11 min

2.13 Why Indicator Histograms Matter

The histogram is the primary diagnostic. Scalars (R/IQR, RE, MI, TCR) confuse shapes needing different fixes. Identical scalars hide light-tail, heavy-tail, or bimodal — only bimodal needs a split.

2. Indicator Engineering May 6, 2026 12 min

2.14 Taming Indicator Tails with Sigmoid Transforms

A wrong-scale sigmoid destroys the signal. Center on training median; scale α so IQR lands in the linear region (α≈1.2 for tanh). On SPX Range/Close, α=1.2 lifts AUC 0.503 → 0.519.

2. Indicator Engineering May 7, 2026 12 min

2.15 Why the Median Often Beats the Mean in Trading Features

Mean breakdown 1/n; median 0.5. On SPX 100-bar center through March 2020, mean spikes 5σ; median moves 0.4. Median for centering, MAD/IQR for spread, Spearman for correlation, LAD for regression.

2. Indicator Engineering May 8, 2026 11 min

2.16 Feature Engineering Before Machine Learning

Feature engineering contributes 90-95% of the edge; model selection 5-10%. Same XGBoost: raw OHLC → SPX AUC 0.502; full statistical pipeline → 0.524. Features are IP. The model is a commodity.

2. Indicator Engineering May 9, 2026 11 min

2.17 No Filter Is Predictive: What Traders Misunderstand About Smoothing

Every linear filter is y(t)=Σ b_k x(t−k)−Σ a_k y(t−k). H(z) has only negative powers, no z⁺¹. Filters summarize the past; they don't extrapolate. SMA(50) on next-day return: R²≈0.0002.

2. Indicator Engineering May 10, 2026 10 min

2.18 The Hidden Cost of Every Moving Average: Lag

Every MA pays a lag tax. Symmetric FIR length N has lag (N−1)/2. On SPX, 200-day SMA captures 18% of a 10% move; 50-day captures 56%. WMA buys lag with phase distortion. Structural trade.

2. Indicator Engineering May 11, 2026 11 min

2.19 Why the SMA Is Often a Terrible Smoother

SMA's sinc response has the worst sidelobes of any common smoother: −13.3 dB (22% leakage) regardless of N. Same lag, Hann leaks 2.6%, Blackman 0.12%. Critical period ~2N. Use Hann or Blackman.

2. Indicator Engineering May 12, 2026 11 min

2.20 EMA vs SMA: Why Simplicity Still Matters

EMA is the 1-pole IIR low-pass: y_t = α x_t + (1−α) y_{t−1}. One state, one parameter, no sidelobes. EMA wins on O(1) state and composability — the building block for HPF, BPF, AGC, decycler.

2. Indicator Engineering May 13, 2026 9 min

2.21 The Trader's Guide to Low-Pass Filters

Each pole adds 6 dB/oct rolloff and one bar of lag at critical period T. 1-pole EMA: 6 dB/oct. 2-pole and super-smoother: 12 dB/oct. The super-smoother is critically damped with clean step response.

2. Indicator Engineering May 14, 2026 11 min

2.22 High-Pass Filters for Traders

HPF output = input − LPF. Each pole adds 6 dB/octave rolloff and one bar of lag. The 2-pole HPF at critical period T is the cleanest detrender for daily mean-reversion features.

2. Indicator Engineering May 15, 2026 11 min

2.23 Band-Pass Filters: The Most Underused Tool in Technical Analysis

The band-pass rejects both trend and noise, keeping only the cycle band. Direct second-order form has two parameters (T, δ). On SPX at T = 20, δ = 0.3, BPF MI is 1.9 vs 1.2 for close-minus-MA.

2. Indicator Engineering May 16, 2026 10 min

2.24 Decyclers: Extracting Trend by Removing Cycle Energy

The decycler extracts trend by subtracting the cycle, not by smoothing. On SPX, decycler at T=30 cuts EMA(50)'s 18% cycle leakage to 4% with less lag. Specify the cycle to cancel, not a lookback.

2. Indicator Engineering May 17, 2026 12 min

2.25 Why Moving Averages Can Lie at Turning Points

At a turning point, the MA reports the prior regime as the present. EMA(50) signaled the SPX March 2020 bottom 27 bars late; cycle-mode detectors fired in 2-5 bars. Structural lag, not a tuning bug.

2. Indicator Engineering May 18, 2026 13 min

2.26 The Frequency Response of Trading Indicators

Every linear indicator is a filter with a unique frequency response. RSI(14), MACD(12,26), Stochastic(14) read the same 15-40 bar cycle band three different ways. The confluence is redundancy.

2. Indicator Engineering May 19, 2026 13 min

2.27 How to Think About Indicator Lag Before Backtesting

Every linear indicator has known lag. Sum cascades, compare to time-to-half-move. EMA+RSI on 5-bar mean reversion: 17-bar lag, structurally broken. Audit lag before backtesting, not after deployment.

2. Indicator Engineering May 20, 2026 13 min

2.28 Why Median Filters Are Useful for Volume and Outliers

Linear filters integrate every bar including outliers. Median filters select the typical value. For volume, true range, and tick data, median (or Hampel) is the default; linear smoothers come after.

2. Indicator Engineering May 21, 2026 14 min

2.29 Automatic Gain Control for Trading Indicators

Indicators inherit the input's amplitude. BPF on SPX swings ±0.4% in 2017 and ±3.1% in March 2020. AGC rescales continuously so thresholds become regime-independent. The fix is one pipeline stage.

2. Indicator Engineering May 22, 2026 13 min

2.30 Dominant Cycle Estimation Without Astrology

The dominant cycle is measurable. Autocorrelation periodogram gives a number with known precision. Elliott Wave, Gann, Bradley dates produce no falsifiable measurement. Numbers feed adaptive logic.

2. Indicator Engineering May 23, 2026 13 min

2.31 Why Market Cycles Are Evanescent

Market cycles exist but are evanescent: period drifts, amplitude decays, phase loses coherence. SPX cycle ranged 8 to 28 bars across six regimes in seven years. Gate cycle-mode strategies by regime.

2. Indicator Engineering Jun 16, 2026 23 min

2.75 DSP and Digital Filters for Traders: The Primer Nobody Wrote First

Every indicator is a filter: a machine that reshapes price. Learn signal, frequency, lag, and the four filter jobs once, and the whole cycles literature stops being a wall.