Feature Engineering

What Is Feature Engineering?

Feature engineering trading bots start with a fundamental problem: machine learning algorithms don't understand "price went up" the way humans do. They need structured, numerical representations of market conditions. Feature engineering is the craft of converting messy, raw trading data—timestamps, prices, volumes, blockchain transactions—into clean, predictive input variables that algorithms can actually learn from.

Think of it like cooking. Raw ingredients (price ticks, transaction volumes) aren't immediately useful. You've got to chop, season, and combine them into something digestible. A price of $42,573.18 at 14:23:07 UTC means nothing to a neural network. But a feature like "percentage change from 20-period moving average" or "volume-weighted bid-ask imbalance over 5-minute windows"? That's actionable information a model can pattern-match against.

The difference between a profitable algorithmic strategy and one that bleeds capital often comes down to feature quality. I've seen traders spend months optimizing model architecture when their real problem was feeding garbage features into a perfectly good algorithm.

Core Components of Trading Features

Price-Derived Features

These transform raw OHLCV (open, high, low, close, volume) data into meaningful signals:

Returns and momentum: Simple returns, log returns, cumulative returns over various windows. Most models need percentage changes, not absolute prices. A move from $100 to $110 carries the same signal as $50,000 to $55,000—both are 10% gains.
Moving averages and crossovers: The 50-day/200-day golden cross isn't just chart astrology—it's a compressed representation of medium and long-term trend relationships. Models learn these relationships faster when you explicitly compute them. Check out momentum trading indicators for practical applications.
Volatility measures: Historical volatility, ATR (Average True Range), Bollinger Band width. Crypto markets switch between low-vol consolidation and explosive moves. A good bot needs to know which regime it's operating in.
Range statistics: High-low spread, close position within the range (where close sits relative to the day's high/low). A close near the high in an uptrend means something different than a close near the low.

Volume-Based Features

Volume confirms price action. Or contradicts it. Either way, it's signal:

Volume ratios: Current volume versus 20-period average. A breakout on 5x average volume hits different than one on declining volume.
On-balance volume (OBV): Cumulative volume direction. Divergences between OBV and price often precede reversals.
Volume-weighted metrics: VWAP deviations, volume-weighted momentum. Institutional traders anchor to VWAP—your bot should know where price stands relative to it.

For strategies that depend heavily on volume analysis, grid trading bot performance shows how volume features help identify optimal ranging conditions.

Time-Based Features

Markets have rhythms. Asian session behaves differently than NY open. Weekends see lower liquidity. Your features should capture this:

Hour of day, day of week: Encoded as cyclical features (sine/cosine transformations) so the model understands that 23:00 and 01:00 are close together.
Time since last significant event: Hours since last 5% move, time since volume spike, bars since crossover signal. These context features matter.
Volatility regime indicators: Binary flags for high/low volatility environments help models switch behavior.

On-Chain Features (Crypto-Specific)

This is where crypto feature engineering diverges from traditional markets:

Exchange flow metrics: Net inflow/outflow to exchanges signals accumulation or distribution. A model that tracks centralized exchange reserves can anticipate selling pressure.
Active addresses and transaction counts: Network activity correlates with price—sometimes. The relationship isn't linear, but it's signal. See the active addresses metric for deeper context.
Gas prices and network congestion: High gas often coincides with volatility peaks. On Ethereum, when gas hits 300+ gwei, retail is FOMOing hard.
Whale wallet movements: Large wallet transfers, especially to exchanges. Understanding whale wallet movements explains why these matter for predictive models.

Advanced Feature Engineering Techniques

Lag Features

Don't just use current values. Include previous periods:

Price 1 bar ago, 5 bars ago, 20 bars ago
Volume from yesterday same hour
Volatility from last week

This gives models temporal context. A single snapshot doesn't reveal momentum or trend changes.

Rolling Window Statistics

Compute metrics over multiple timeframes:

5-minute, 15-minute, 1-hour, 4-hour, daily windows
Rolling standard deviation, skewness, kurtosis
Percentile ranks (where current price sits in the last 100 periods)

Multi-timeframe features let a single model operate across different trading horizons. A scalping strategy needs 1-minute features. Swing trading needs daily. Why not include both and let the model figure out which matters when?

Interaction Features

Multiply or divide features to create relationships:

(Volume × Volatility) — explosive moves combine both
(RSI / Bollinger Band Width) — overbought in tight ranges versus wide ranges
(Price Change / Average Volume) — abnormal price movement per unit volume

Most models won't discover these interactions on their own, especially simpler ones like linear regression or random forests.

Feature Crosses

Categorical combinations:

(Hour of Day × Day of Week) — Monday 9am behaves differently than Friday 4pm
(Volatility Regime × Momentum Direction) — upward momentum in low-vol versus high-vol are different beasts

These let models learn context-dependent rules without complex neural architecture.

The Overfitting Trap

Here's where most algo traders screw up: they engineer 200+ features, backtest shows 85% win rate, then live trading implodes within a week. That's overfitting in action.

You've essentially taught your model to memorize historical noise rather than learn genuine patterns. It's like studying for a test by memorizing the answer key instead of understanding the material. Works great on the practice test. Fails spectacularly on new questions.

The fix: Feature selection and regularization. Use techniques like:

Correlation analysis: Drop features that are 95%+ correlated. You don't need both SMA20 and EMA20—they're saying the same thing.
Feature importance scores: Random forests and gradient boosting models rank features by predictive power. Drop the bottom 50%.
Recursive feature elimination: Iteratively remove least important features and retest.
L1 regularization (LASSO): Penalizes models for using too many features, forcing sparse solutions.

I've seen strategies improve after cutting from 180 features down to 15. Less is more when those 15 actually capture the signal.

Domain Knowledge Beats Automation

You can use automated feature generation tools—libraries that create hundreds of polynomial combinations, statistical transforms, and time-series decompositions. But they're shotgunning in the dark.

Domain expertise matters. Knowing that funding rates on perpetual futures predict short-term price pressure means you engineer a feature specifically for that. Knowing arbitrage opportunities exist across DEX pairs means you create spread features between Uniswap and Sushiswap liquidity pools.

Generic statistical features work. Domain-informed features work better.

Practical Implementation Example

Let's engineer features for a mean reversion bot:

Core price features:
- Distance from 20-period SMA (standardized by ATR)
- Z-score of current price versus 100-period distribution
- RSI deviation from 50 (absolute distance)
Volume confirmation:
- Volume ratio (current / 20-period average)
- OBV slope over last 10 periods
Regime filters:
- Bollinger Band width percentile rank
- ATR percentile rank (volatility regime)
- Linear regression slope of 50-period SMA (trend strength)
Time context:
- Hour of day (sine/cosine encoded)
- Bars since last reversion signal
- Days since last high-volatility event

That's 11 features. Clean, interpretable, each serving a specific purpose. Compare this to blindly dumping in 50 technical indicators and hoping the model sorts it out.

For implementation details, see how to build a simple mean reversion trading bot.

Testing Feature Quality

You've engineered features. How do you know they're good?

Backtesting matters, but backtest on out-of-sample data. Train your model on 2022-2024 data, test on 2025. If performance collapses, your features aren't generalizing. See backtesting strategy for proper methodology.

Walk-forward analysis: Roll your training window forward month by month. Retrain, test on next month, repeat. If features are robust, performance stays relatively consistent. If it swings wildly, you're overfitting to specific market conditions.

Feature stability: Do your engineered features maintain consistent statistical properties over time? If your "volume surge" feature triggered 20 times in 2023 but 200 times in 2024, something changed in the underlying market structure. That feature might not age well.

Correlation with returns: Plot each feature against forward returns. Binned scatter plots work great. If a feature shows no monotonic relationship with future price movement, it's probably noise.

Common Mistakes

Mistake #1: Look-ahead bias. You engineer a feature using information not available at decision time. Computing "distance from day's high" at 10am using the actual daily high (which includes price action from 11am-4pm) creates magical backtest results that evaporate live.

Mistake #2: Data leakage. Your validation set influences feature engineering decisions. You notice your model struggles with weekend data, so you add a "is_weekend" flag and retest on the same validation set. You've leaked information. Re-engineer features, then test on a fresh dataset.

Mistake #3: Non-stationary features. Creating features from absolute price levels in trending markets. A feature that "price is above $50,000" worked great in the 2021 bull run, then became useless post-crash. Use normalized, ratio-based, or percentage-based features instead.

Mistake #4: Ignoring computational costs. Engineering a feature that requires 50 API calls per decision works fine in backtesting with cached data. Live trading? Your bot times out and misses entries because it's busy computing some complex on-chain aggregation.

Real-World Performance Impact

Feature engineering isn't theoretical. The difference between mediocre and profitable bots lives here.

A basic copy trading bot using only raw signals might achieve 52% win rate. Add properly engineered features that capture regime changes, volatility clusters, and correlation breakdowns? You're looking at 58-62% win rates. That gap means everything over thousands of trades.

The most successful algorithmic traders I know spend 60% of their time on feature engineering, 20% on model selection, and 20% on infrastructure. Not the other way around.

Because you can throw a sophisticated neural network at bad features and get bad results. But give a simple linear model genuinely predictive features? It'll print money while the deep learning enthusiast is still tuning hyperparameters.