Pillar

2. Indicator Engineering

Treating indicators as engineered signals, not chart art. Transformations, stationarity, tail behavior, entropy, smoothing, lag, and frequency response, the math that decides whether a feature carries real information.

82 articles

Sort:
2. Indicator Engineering 8 min

2.1 The Indicator Is More Important Than the Model

The indicator sets the ceiling that no model can break through. A linear regression on a high-quality indicator beats a deep neural network on a low-quality one. Most R&D effort is spent on the model, where the marginal returns are smallest. The bigger gains live in the inputs.

2. Indicator Engineering 9 min

2.2 Garbage Indicators, Garbage Predictions

A garbage indicator has four structural defects: non-stationary distribution, heavy tails, clumped values, or lookback artifacts. The model treats your input as the truth and propagates the defect into the forecast. Diagnose the indicator before you train anything.

2. Indicator Engineering 9 min

2.5 Range/IQR: A Simple Test for Indicator Tail Problems

R/IQR is the ratio of total range to interquartile range. The denominator is anchored to the body of the distribution. The numerator follows the tails. The ratio is the only honest tail measurement on data where the standard deviation is already contaminated by the tails it is supposed to describe.

2. Indicator Engineering 9 min

2.6 Why Predictive Power Often Lives in the Tails

R/IQR detects stretched distributions but says nothing about whether the stretch carries the signal. On market data the stretch usually carries it. The Tail Concentration Ratio splits per-decile mutual information and tells you whether the tails are noise to squash or signal to preserve.

2. Indicator Engineering 12 min

2.9 The Case Against Raw Price Indicators

Raw price is non-stationary in mean, non-stationary in variance, and incomparable across instruments. A model trained on SPX from 1990 to 2010 sees 71% of the 2010 to 2026 test rows outside its training support. The in-sample AUC of 0.582 collapses to 0.498 live.

2. Indicator Engineering 5 min

2.65 ADX Done Right: Two-Level Smoothing

The ADX is not a price indicator, it is a ratio of ATR-normalized range expansions with two hidden smoothing stages. Build it right and you get a stationary trend-strength filter, but respect the lag: it stays high after sharp trends die.

2. Indicator Engineering 5 min

2.68 The Price Change Oscillator

Short-term movement minus a long-term baseline, normalized by ATR, reads volatility expansion against contraction. But normalize too hard and you delete the very regime signal you wanted: the stationarity-vs-information trade-off, with a dial.

2. Indicator Engineering 6 min

2.71 Normalized On-Balance Volume

OBV's running sum wanders like a random walk and depends on when you started counting. Window it, divide signed volume by total volume, scale by root-lookback, and you get a bounded, stationary flow oscillator.

2. Indicator Engineering 5 min

2.72 The Volume-Weighted MA Ratio

The volume-weighted MA ratio asks where the crowd transacted, not which way it pushed: VWMA over SMA, above one when expensive bars carried the volume. Log it, scale by root-lookback, compress, and it's model-ready.

2. Indicator Engineering 5 min

2.73 Volume Momentum

Volume Momentum ignores price and asks one thing: is the tape hotter than its own baseline. Short volume over long volume, logged and CDF-squashed into a bounded regime gauge that tells you if your signals have fuel.

2. Indicator Engineering 6 min Premium

2.76 The Limits of Linear Models

The clean linear factor model fails because its weights are not constants. Momentum and carry only pay in the right vol regime, and interactions are the one thing a linear model cannot represent.

2. Indicator Engineering 4 min

2.81 Volume and Volatility Are the Same Feature

Replace the volatility term with volume in most alphas and the backtest barely moves, because they ride the same information clock. Feeding a model both is double-counting one factor. Keep one scale, and add their ratio, Amihud illiquidity, as the residual that actually carries new information.