Lumin — Structural Signal Model for Cryptocurrency Futures

Author: Houssem Ben Salem
Date: March 7, 2026 | White Paper
Asset Class: Cryptocurrency Perpetual Futures (Binance USDT-M)
Strategy: Systematic Directional (LONG + SHORT), Multi-Timeframe, Fully Autonomous


1. Abstract

Lumin is a systematic signal model for cryptocurrency futures that mines statistically robust price patterns from historical data across 22 pairs selected by top market capitalization, validates them out-of-sample, and deploys on 4 institutional-grade pairs (BTCUSDT, ETHUSDT, SOLUSDT, SUIUSDT) across four timeframes (1d, 8h, 4h, 2h) on Binance perpetual futures. The model decomposes every price bar into 125+ independent, measurable dimensions — momentum, volume structure, cluster density, macro context, prior signal outcomes — and discovers combinations (signatures) where a specific directional outcome has occurred with ≥90% probability over ≥200 historical instances. The current database holds 2,959 validated signatures. Every signal is generated, validated, and executed autonomously — no human intervention, no discretionary decisions, no manual overrides.

2. Executive Proposition

Currently deployed with $500,000 initial capital, scalable to $10M+ on these liquid pairs without structural modification. Capital is sought to scale deployment.

MetricOut-of-Sample (Jan–Mar 2026)
Return+41.5% in 63 days (as of March 7, 2026)
Sharpe Ratio12.67 (annualized)
Max Drawdown-0.7%
Profit Factor17.34
Win Rate85.7%
Round-trip costs deducted0.50% per trade (fees + slippage)

The remainder of this document explains how.

LUMIN> open terminal


3. Structure in Apparent Chaos

Price action in financial markets appears chaotic. Conventional wisdom treats short-term price movements as random walks, unpredictable by construction. This view is incomplete. Price action is chaotic but not random — it contains compressible structure, patterns that repeat because they are generated by the same underlying market mechanics.

This distinction is foundational. Consider two analogies from mathematics and computer science:

Prime Factorization

Every number, no matter how large, decomposes uniquely into a product of primes. The number 60 looks arbitrary — but it is entirely determined by four irreducible factors: 2 × 2 × 3 × 5. Each prime is a distinct factor of the number's structure. The factorization is hidden — finding it requires work — but once found, it reveals the complete internal architecture. A composite number is not random; it is a deterministic combination of irreducible components.

Large Language Models (like ChatGPT)

Natural language appears infinitely variable. Yet modern language models achieve coherent output by decomposing language into dimensions — hundreds of billions of parameters, each capturing a different aspect of linguistic structure: syntax, semantics, tone, context. The model does not memorize sentences; it learns the generative structure. More parameters capture more dimensions, and performance scales accordingly. Language is not random; it is a high-dimensional system with learnable structure.

View diagram: Language model parameter scaling
LLM Parameter Scaling

Our Model (Lumin)

Our signal model applies the same principle to price action. Each candlestick bar encodes a high-dimensional state: where price closed relative to its range, how volume compared to recent history, whether the bar challenged or respected prior highs and lows, whether price clusters formed support or resistance. These are the dimensions — distinct components of market state, each encoding different information about the same bar. Market dimensions are correlated — they share underlying drivers — but each captures structure that the others cannot express. It is their specific combinations that create unique, predictive signatures.

A single dimension tells you little. Close position alone is not predictive. Volume alone is not predictive. But specific combinations of 2–4 dimensions occurring simultaneously — what we call signatures — create conditions where the next directional move becomes highly predictable. Like a fingerprint composed of multiple ridge patterns, a signature is a unique combination of dimension states that identifies a specific market condition — one that historically precedes a directional outcome with measurable probability.

The model currently operates on ~125 dimensions. These are not arbitrary features. Each dimension captures a distinct aspect of market microstructure: price positioning, momentum, volatility regime, structural context, and the outcomes of prior signals. The system is inherently scalable — adding dimensions that capture information not expressible through existing ones increases the model's capacity to distinguish market states, just as increasing parameter count in language models increases their capacity to understand linguistic nuance.

On fundamental analysis. These dimensions capture the aggregate outcome of all market participants' actions — including those driven by fundamental analysis, macroeconomic events, and news flow. The model operates at the data layer where fundamentals have already been expressed as price and volume. It does not predict news; it reads the market's reaction to it.


4. How Dimensions Are Derived

Every completed bar on a price chart is an event — a structured record of what happened in a fixed interval. Like events in physics, each can be decomposed into three questions:

This What / How / Context decomposition is how dimensions are derived: not from textbook technical indicators, but from observable market behavior. Standard indicators (RSI, MACD, Bollinger Bands) are simplified summaries that discard information. Our dimensions capture the raw mechanics.

Two concrete examples illustrate this decomposition:

4.1 Stop Loss Hunt — SHORT Trigger

View diagram: Stop Loss Hunt anatomy
Stop Loss Hunt -- SHORT Signal Anatomy

Note: This diagram is a simplified illustration for explanation purposes only. The model does not trade based on any single pattern in isolation. A stop loss hunt is just one of 11 trigger types. Whether any specific occurrence is tradeable depends entirely on the dimensional signature that accompanies it — the specific combination of context, mechanics, and prior signal outcomes that the data has shown to be consistently profitable across hundreds of historical occurrences.

What (the event)

A bar makes a new high compared to the previous bar, then fails to close above it. This is a stop-loss hunt — short positions with tight stops above recent highs are forced to close (buy to cover), temporarily pushing price higher. If this forced buying doesn't attract genuine continuation buying, price falls back. The hunt reveals directional intent: the stops were triggered to fill sell orders at higher prices.

How (the mechanics)

Context (the surroundings)

Each of these is an independent, measurable dimension. The combination determines whether the hunt leads to reversal or continuation.

4.2 Bullish Engulfing at Lows — LONG Trigger

View diagram: Bullish Engulfing anatomy
Bullish Engulfing -- LONG Signal Anatomy

What (the event)

Price sweeps below a previous low (triggering stop losses on long positions), then reverses and engulfs the prior bar, closing at the extreme high of its range. The sweep-and-reverse traps late shorts who entered on the breakdown.

How (the mechanics)

Context (the surroundings)

4.3 Sample Dimensional Categories

The examples above illustrate the decomposition principle. The model applies it systematically across multiple categories. The following is a representative sample:

#CategoryWhat it captures
1Close positioningWhere price closed within the bar’s range, at multiple granularity levels. Distinguishes conviction from indecision.
2Volume regimeVolume relative to a smoothed adaptive baseline. Detects divergences where price and volume disagree — a hallmark of exhaustion.
3Volatility structureBar spread relative to recent norms. Expansion signals directional order-book imbalance; contraction signals exhaustion or indecision.
4Price clustersStatistical density analysis of historical trading activity. Identifies zones where trading has concentrated — data-driven support and resistance.
5Structural contextCharacteristics of recent preceding bars. The same trigger after a climax event vs. after quiet consolidation produces statistically different outcomes.
6Cross-pollinationOutcomes of recent signals on the same instrument. Captures inter-signal dynamics and sequential dependencies that single-bar analysis misses.
7Trend positioningPrice position relative to smoothed structural averages. The same pattern resolves differently depending on the prevailing trend context.

These categories are illustrative, not exhaustive. Each dimension is computed from different market data — they are complementary: close position captures different information than volume, volume captures different information than cluster density, cluster density captures different information than trend position. Dimensions may correlate — they share underlying market dynamics — but none is redundant given the others. This is what makes the system scalable: each new dimension that carries information not expressible through existing ones expands the combinatorial space of distinguishable market states.

4.4 Technical Tools

The only element borrowed from conventional technical analysis is RSI — but not the indicator itself. We use only its derivatives: breakout detection (when RSI crosses a pivot level) and divergence detection (when RSI direction disagrees with price direction). These are structural events, not indicator readings.

Even these derivatives are computed using a modified RSI based on logarithmic returns instead of standard percentage changes. Crypto price action is asymmetric — “stairs up, elevator down” — and log returns naturally weight downside moves more heavily, producing a more accurate real strength measurement.

All other dimensions are derived directly from market behavior (price structure, volume dynamics, cluster analysis) — not from standard technical indicators.


5. From Dimensions to Signatures

A signature is a specific combination of 2 to 4 dimension values occurring simultaneously on a single bar. Example structure:

TRIGGER_TYPE | condition_1 + condition_2 + condition_3

The trigger defines the event category. The conditions specify which dimensional values must be active. When a live bar's computed state contains all codes specified by a signature, the engine checks whether that combination exists in the validated signature database. If found, the signature fires and a trade is initiated.

The combinatorial search space is vast. With ~125 active dimensions, the number of possible 2–4 code combinations exceeds hundreds of thousands. The mining process evaluates each one, computing its historical win rate, trade count, and temporal stability across the full in-sample period. Only those meeting strict qualification criteria are retained as production signatures.

Current database: 2,959 validated signatures across four timeframes (1d: 1,636; 8h: 245; 4h: 790; 2h: 288). Each signature has been independently mined and validated. The databases are independently constructed per timeframe — there is no sharing or leakage between them.


6. Mining Methodology: How Overfitting Is Eliminated

The credibility of any systematic model rests on its protection against overfitting — patterns that appear profitable in historical data but fail on new data. The mining methodology employs five layers of defense:

6.1 Universe and Data

Signatures are mined across 22 institutional-grade pairs — the union of CoinMarketCap's historical top 20 by market capitalization (January snapshots, 2022–2025), filtered for Binance USDT-M availability and sufficient liquidity. This universe covers >85% of total crypto market capitalization.

GroupSymbols
CoreBTC, ETH, BNB, SOL, ADA, XRP, AVAX, DOGE
ExtendedDOT, SHIB, LTC, TRX, MATIC, UNI, LINK, BCH, TON, XLM, HBAR, ALGO, ICP, SUI

Mining uses the full 22-pair universe. Deployment is validated on a focused 4-pair subset (BTC, ETH, SOL, SUI) — the core institutional allocation — using the same databases without re-mining.

6.2 Five-Filter Qualification

FilterCriterionPurpose
1. Minimum trades200+ historical occurrencesEliminates statistical flukes — a pattern must recur consistently across 22 symbols over 4 years
2. Win rate90%+ at optimal targetOnly near-certain outcomes survive; timeouts count as losses
3. Temporal robustness80+ / 100 stability scorePattern must perform consistently across calendar quarters — a pattern that only works in one market regime is discarded
4. Target optimizationEV = WR% × Target% − (1−WR%) × AvgLoss%Each signature is assigned its profit target maximizing expected value, not simply the highest win rate
5. Strict IS/OOS separation2022–2025 In Sample, 2026 Out of SampleThe model was frozen before any 2026 data was examined. No parameters, thresholds, or selections were influenced by OOS data

6.3 Conservative Accounting

Every trade simulation deducts 0.50% round-trip cost (0.10% exchange taker fee + 0.40% slippage). Timeouts — trades that neither hit target nor stop within the maximum holding period — are counted as losses. Funding fees (periodic exchange settlement payments between long and short holders) are not included in this simulation; in live trading they represent a small additional cost that varies with market conditions.

6.4 Chronological Integrity

Deployment is chronological — the simulation processes signals in calendar order, exactly as a live system would. Any overfitting would manifest as performance degradation over time. Instead, performance is consistent across all four years, because signatures capture stable structural patterns, not temporary market artifacts. The databases are independent banks of validated signatures that have demonstrated consistent behavior across the entire in-sample period.

6.5 Quarterly Refresh

Signature databases are refreshed quarterly to accommodate regime changes. The mining process re-runs on the expanded dataset, preserving temporal distribution integrity. Signatures that no longer meet qualification criteria are retired; new qualifying signatures are added. The database evolves with the market without compromising statistical rigor.


7. Multi-Timeframe Architecture

The same mining methodology is applied independently to four timeframes:

Parameter1d8h4h2h
Bar interval24 hours8 hours4 hours2 hours
Max holding period7 days2.3 days2.3 days1.2 days
Profit targets5–10% (per-sig)5%4–5% (per-sig)4–5% (per-sig)
Signatures1,636245790288
IS signals (4-pair)1,005209286265
IS win rate (4-pair)72.0%98.0%95.4%92.9%

Each timeframe has its own independently mined signature database. There is no parameter sharing between timeframes. The same dimensional framework operates at each resolution, but the specific signatures that survive the filters differ — capturing resolution-specific market dynamics.

7.1 Combined Deployment

The recommended configuration runs all four timeframes on a single account with shared capital. LONG and SHORT signals are deployed simultaneously — the system is direction-agnostic and does not operate in a regime where only one direction is active. On any given day, the portfolio may hold LONG positions on one pair and SHORT positions on another, across different timeframes.

Each timeframe is allocated a fraction of current equity (see Section 6.3 for the asymmetric allocation). Capital returned from resolved trades re-enters the shared pool immediately, creating temporal capital efficiency: when a 1d trade locks capital for 7 days, the 2h and 4h timeframes deploy the idle pool on faster-resolving opportunities.

7.2 Cross-Timeframe Flip

When a lower timeframe (e.g. 4h) fires opposite to an active higher timeframe position (e.g. 8h) on the same symbol, and they form an adjacent couple (1d/8h, 8h/4h, 4h/2h), the system closes the higher timeframe position early and deploys the lower timeframe signal. The rationale: a fresher signal from a faster-reacting timeframe overrides a stale position from a slower timeframe.

This mechanism reduced maximum drawdown from -7.4% to -5.1% in-sample aggregate while increasing total return by +5,800 percentage points, by cutting losing positions early when the market structure shifts.

7.3 Asymmetric Allocation

When multiple signals fire on the same timeframe on the same day, the available TF budget is equally pre-allocated across all qualifying signals. There is no priority ranking or signal ordering — each signal receives an equal share of the timeframe's remaining budget. The per-slot size is then capped at the timeframe's allocation percentage of total equity:

TimeframeAllocationRationale
2h30% of equityFastest resolution (1.1 day avg hold), capital recycles ~3x faster than 1d
4h25%Fast resolution, highest sub-daily WR (95.4%)
8h25%Near-perfect WR (98.0%), medium resolution
1d20%Slowest resolution, lowest deployed WR, locks capital longest

8. Results

8.1 Out-of-Sample Performance (YTD 2026, ongoing)

True blind test. No parameters, thresholds, or selections influenced by this data. 4-pair deployment (BTC, ETH, SOL, SUI), $500,000 initial capital. The out-of-sample period is ongoing and currently spans 63 days (as of March 7, 2026) — a short window by institutional standards, but one that already includes meaningful market conditions (trending, ranging, and volatile regimes). Statistical confidence will strengthen as the live period extends.

MetricCombined1d8h4h2h
Return+41.5%Per-TF contribution within combined deployment
Win Rate85.7%74.2%100.0%100.0%92.9%
Sharpe Ratio12.67
Realized Volatility16.0%
Max Drawdown-0.7%
Profit Factor17.34
Trades Deployed7031131214
Realized PnL$+207,588$+59,508$+57,377$+60,973$+56,859

Realized PnL from $500,000 initial capital.

Sharpe Ratio = (mean(daily returns) − rf) / std(daily returns) × sqrt(365). Risk-free rate = 4% annual. Realized Volatility = std(daily returns) × sqrt(365) × 100.

Note: All columns reflect the actual combined deployment on shared capital. Per-TF columns show each timeframe's contribution within the combined portfolio (trades deployed, win rate, realized PnL). Risk metrics (Sharpe, volatility, drawdown) apply to the combined equity curve.

The combined return (+41.5%) with a maximum drawdown of just −0.7% reflects the structural benefit of multi-timeframe diversification: each timeframe captures independent signal opportunities while sharing capital, producing returns that exceed any single timeframe's standalone performance by a significant margin.

Out-of-Sample Equity Curve (2026)
Out-of-Sample Drawdown (2026)

8.2 In-Sample Consistency (2022–2025)

YearCombined ReturnCombined SharpeCombined Max DDWin RateTrades
2022+488.1%9.77−3.9%85.3%307
2023+312.9%7.76−5.2%80.1%272
2024+773.4%12.02−2.7%86.5%363
2025+492.6%10.16−3.7%85.8%331
Aggregate+38,697% (344% CAGR)9.18−5.2%84.7%1,275

The model is profitable in every year tested, including 2022 (a severe bear market). Maximum drawdown never exceeds −5.2% in any year. The Sharpe ratio ranges from 7.76 to 12.02, consistently above institutional thresholds.

A note on in-sample value. The signature database is mined from the full 2022–2025 period — each signature must demonstrate consistent behavior across all four years to qualify. In-sample deployment results are therefore not traditional backtests of unseen data. However, they are far from meaningless: the database is a bank of validated signatures, and deployment simulates real-world execution with costs, slot sizing, and Binance constraints. The consistency across all four years — including the 2022 bear market — demonstrates that signatures capture stable structural patterns, not transient artifacts. While out-of-sample results on fresh, unseen 2026 data carry the strongest evidential weight, the in-sample returns tell a compelling story of model robustness.

In-Sample Equity Curve (2022-2025)
In-Sample Drawdown (2022-2025)

8.3 Per-Year Metrics Comparison

Per-Year Metrics Comparison

8.4 Monthly Returns

Monthly Returns — In Sample and Out of Sample

8.5 Risk Metrics Summary

MetricOOS (2026)IS Aggregate (2022–2025)
Sharpe Ratio12.679.18
Realized Volatility16.0%15.9%
Max Drawdown−0.7%−5.2%
Profit Factor17.345.48
Win Rate85.7%84.7%
Avg Holding Period2.3 d2.3 d
Avg Capital Utilization30.7%22.7%
Peak Capital Utilization75.4%100%
Cross-TF Flips222

9. Structural Advantages

9.1 Diversification Across Time

Four timeframes on shared capital create natural diversification. The 2h timeframe resolves trades in ~1 day; the 1d timeframe holds for up to 7 days. When one timeframe's capital is locked, the others deploy the idle pool. This temporal diversification produces higher returns from the same capital base without increasing risk.

9.2 Drawdown Compression

The shorter timeframes (8h, 4h, 2h) maintain >92% win rates. When the 1d timeframe — which has the lowest win rate (74.2%) — takes a loss, the high-WR sub-daily trades are typically winning on the same capital base, absorbing the impact in real time. The Combined TF Drawdown is the actual drawdown experienced by the model — all four timeframes deployed together on shared capital.

PeriodWorst Single-TF DrawdownCombined TF DrawdownCompression
OOS 2026−1.6% (1d)−0.7%56%
IS 2022−5.8% (1d)−3.9%33%
IS 2023−9.1% (1d)−5.2%43%
IS 2024−3.5% (1d)−2.7%23%
IS 2025−6.9% (1d)−3.7%46%

9.3 Scalability

The dimensional framework is designed for expansion. Adding dimensions that capture information not expressible through existing ones increases the model's discriminative capacity. More dimensions mean:

Adding dimensions only enhances — it never disrupts what has already been acquired. Existing signatures continue to fire on the same conditions as before; new dimensions simply enable the discovery of additional qualifying signatures that were previously indistinguishable. The current ~125 dimensions represent the initial operating point. The architecture supports systematic expansion as new independent market measurements are identified and validated.

9.4 Capacity and Deployment Flexibility

This creates substantial capacity for growth:

9.5 Quarterly Database Refresh

Signature databases are updated on a quarterly cycle. Each refresh re-mines the full historical dataset, allowing the system to detect shifts in market regime and adapt accordingly. Signatures that no longer meet performance thresholds are retired; newly qualifying signatures enter the database based on their backtested metrics. This continuous curation ensures the model remains temporally stable while staying responsive to evolving market conditions.

The architecture is future-proof by design: the system lives, adapts, and stays performant over time. Rather than fitting to a static snapshot of market behavior, it maintains a living database that reflects the current structural landscape — preserving what works, discarding what no longer does, and incorporating what newly qualifies.


10. Audit and Validation

Every number in this report is verified day by day, trade by trade, through a programmatic audit system. Each deployment period is checked against 11 accounting invariants — capital conservation, budget enforcement, one-way positioning, PnL accuracy, cost verification, and equity continuity among others. Every daily equity value is reconciled: the change from one day to the next is fully explained by the trades that opened, closed, or remained active.

All audit checks pass with zero violations across every period and deployment mode presented in this document. The simulation output is deterministic and reproducible: given the same signature databases and market data, the results are identical to the last decimal.


11. Deployment Specification

ParameterValue
ConfigurationCombined multi-timeframe (1d + 8h + 4h + 2h)
ExchangeBinance USDT-M perpetual futures
UniverseBTC, ETH, SOL, SUI (expandable)
Initial Capital$500,000 (expandable to $10M+ on current pairs)
Per-TF budget1/4 of current equity
Per-slot sizing1d: 20%, 8h: 25%, 4h: 25%, 2h: 30% of equity
Profit targetsPer-signature optimal (1d: 5–10%, 8h: 5%, 4h/2h: 4–5%)
Holding periods1d: 7 days, 8h: 2.3 days, 4h: 2.3 days, 2h: 1.2 days
Costs0.50% round-trip (0.10% taker + 0.40% slippage)
Position modeONE-WAY with adjacent-couple cross-TF flip
Signal frequency~25 signals/month on 4-pair subset
Signature database2,959 validated signatures (refreshed quarterly)

12. Key Takeaways

  1. +41.5% return in 63 days of true out-of-sample trading, with 12.67 Sharpe and −0.7% max drawdown, after 0.50% round-trip costs on $500,000.
  2. The model captures genuine market structure, not noise. Performance is consistent across 4 years of in-sample data (7.76 to 12.02 Sharpe per year), including the 2022 bear market, and confirms out-of-sample.
  3. 2,959 validated signatures, each backed by 200+ historical occurrences and 90%+ win rate at optimal target with temporal robustness across calendar quarters. The signature database is refreshed quarterly to accommodate regime changes.
  4. Multi-timeframe deployment compounds the edge: four timeframes on shared capital produce 2x the return of the best individual timeframe with 78% less drawdown, through temporal capital efficiency, cross-timeframe confluence, and drawdown compression.
  5. The dimensional framework is scalable. Additional dimensions that capture information not expressible through existing ones expand the model's discriminative capacity without invalidating existing signatures — new dimensions only enhance, never disrupt what has already been acquired.
  6. Capacity for growth: universe expansion to additional liquid pairs increases signal frequency and capital utilization as the deployment scales.

Lumin | March 2026 | 2,959 Validated Signatures | 4 Timeframes | 4-Pair Universe (deployed)
110 independent audit checks passed | All results net of 0.50% round-trip costs

LUMIN> open terminal