What's a walk-forward backtest?

Walk-forward means we step through history one period at a time. At each rebalance date (weekly, monthly, or quarterly depending on your strategy's cadence), we mark the portfolio to market, optionally rotate holdings, and roll forward to the next date. Returns compound over time. The output is an equity curve, annualized return, Sharpe ratio, and max drawdown.

How long can my backtest period be?

1, 3, or 5 years. Anything else gets coerced to 3. Five years is the maximum because EODHD historical data costs scale with universe size and lookback length, and we want to keep backtests fast (<10 seconds typically) and free for all tiers.

Why are some of my historical returns suspiciously good?

Survivorship bias. Our backtest uses today's universe snapshot for the entire historical period. A ticker that delisted 4 years ago is invisible (so a 'failed' bet never shows up); a ticker that just IPO'd in 2024 looks like it always existed. This makes historical performance look better than reality. We're transparent about this — eliminating it requires loading the universe-as-it-was on each historical date, which is a future feature.

Does backtest cost credits?

Yes — backtest_strategy is a heavy tool that costs 5 credits per run. Standard quotes and screener calls cost 1 credit. Basic users get 200 free credits per month, which is enough for ~40 backtests. Heavier users buy credit packs (starting at $5 for 500 credits).

Can I backtest a Magic Formula strategy?

Yes — as of v3.5.21 the backtest engine correctly honors ranking blocks. Earlier versions silently used the legacy 'top 10 by market cap' default for ranking strategies, producing meaningless results. If you have a strategy created before v3.5.21 you may want to re-run its backtest now.

Guide 2026-05-05 · By Julie Derthing, Chief Staff Assistant at Seentio

How to Backtest a Strategy on Seentio

Q: Why are some of my historical returns suspiciously good?

Survivorship bias. Our backtest uses today's universe snapshot for the entire historical period. A ticker that delisted 4 years ago is invisible (so a 'failed' bet never shows up); a ticker that just IPO'd in 2024 looks like it always existed. This makes historical performance look better than reality. We're transparent about this — eliminating it requires loading the universe-as-it-was on each historical date, which is a future feature.

Q: Does backtest cost credits?

Yes — backtest_strategy is a heavy tool that costs 5 credits per run. Standard quotes and screener calls cost 1 credit. Basic users get 200 free credits per month, which is enough for ~40 backtests. Heavier users buy credit packs (starting at $5 for 500 credits).

Q: Can I backtest a Magic Formula strategy?

Yes — as of v3.5.21 the backtest engine correctly honors ranking blocks. Earlier versions silently used the legacy 'top 10 by market cap' default for ranking strategies, producing meaningless results. If you have a strategy created before v3.5.21 you may want to re-run its backtest now.

Backtesting answers one question: "If I'd been running this strategy for the last few years, how would it have done?" The answer is never perfect — past performance never guarantees future results, and even our walk-forward methodology has limitations we'll be honest about. But it's the best directional signal available before you commit real capital.

This guide walks you through running a backtest on any strategy you've saved on Seentio.

What you'll learn

How to run a 1, 3, or 5-year walk-forward backtest on any saved strategy via Claude Code, what the output means, and what backtests don't tell you.

Prerequisites

A saved strategy (see How to create a strategy)
Either Claude Code with MCP installed, or access to the Strategies page on the web (both surfaces hit the same backend)
5 AI credits per backtest (Basic tier has 200/month free, plenty for a few dozen runs)

Step-by-step walkthrough

1. Identify the strategy you want to test

Ask Claude:

What strategies do I have?

Claude calls list_strategies and shows you a table with name, status (draft/active/paused), and creation date. Pick the one you want to backtest. If you've just created a Magic Formula, that's the one.

2. Run the backtest

Backtest 'Greenblatt Magic Formula' for 3 years against SPY.

Claude calls backtest_strategy with your strategy's full definition, period_years=3, and benchmark="SPY". This call takes 3-8 seconds typically — we're fetching daily price history for up to 30 stocks plus the benchmark, then walking forward through ~750 trading days.

3. Read the output

You'll get back something like this:

Metric	Strategy	Benchmark (SPY)
Total return	+47.2%	+28.5%
Annualized return	+13.7%	+8.7%
Sharpe ratio	0.96	0.71
Max drawdown	-18.4%	-14.2%
Volatility (annualized)	14.3%	12.4%
Number of rebalances	36	n/a

Plus a rebalance log showing the most recent 3 rebalance events: which tickers were added, which dropped, the order quantities. Useful for sanity-checking that the strategy actually did what you expected.

4. Compare against the benchmark

The benchmark is your "do nothing different" baseline. SPY (S&P 500), QQQ (Nasdaq 100), IWM (Russell 2000), and DIA (Dow 30) are the four supported choices.

Some questions to ask:

Did your strategy beat the benchmark? Annualized return is the headline number. A 5%+ outperformance over 3 years is meaningful; less than 2% is noise.
Did it do so with similar risk? Look at Sharpe ratio (higher = better risk-adjusted return) and max drawdown (less-negative = lower worst-case loss). A strategy that beats SPY by 5% but has a 40% drawdown is not necessarily a good trade.
Is the win consistent? A strategy that's +50% one year and -10% the next has the same average as one that's +20% per year, but the second is much easier to live with. The volatility number captures this.

5. Decide what to do next

Three honest options:

Activate if the backtest looks reasonable and you understand the caveats below: "Activate it on my ETrade, email alerts only."*
Iterate if it doesn't: ask Claude to tweak the spec (different top_n, different exclusions, different cadence) and re-backtest. Each iteration costs 5 credits.
Walk away if no version looks compelling. Save the strategy as a draft for later or delete it.

What backtests DON'T show you

Be honest with yourself about the limitations:

Survivorship bias

Our backtest uses today's universe snapshot for the entire historical period. A company that delisted 3 years ago is invisible to the backtest. If your Magic Formula would have picked a stock that subsequently went to zero, you don't see that loss — because the stock isn't in today's universe.

This biases historical returns upward. The bias is bigger for longer lookback periods and for strategies that pick from broader universes. Eliminating it requires loading the universe-as-it-was on each historical rebalance date, which is a feature on our roadmap.

Trading costs

Backtests assume you got fills at the official close price with zero commission and zero slippage. Real-world trading has spreads, market impact (especially for smaller stocks), and commission (though most retail brokers are zero-commission for stocks now). For monthly-cadence strategies on liquid large-caps, this is small. For weekly strategies on small-caps, it's not.

Behavioral risk

The backtest shows what would have happened if you'd held the strategy through every drawdown without flinching. Real investors flinch. A backtest with -18% max drawdown means at some point you'd have been down 18% from the peak — would you really have stayed the course?

The future is not the past

The most important caveat. Strategies that worked historically can stop working. Factor strategies (value, momentum, quality) all go through extended periods of underperformance. The Magic Formula spent 2018-2020 underperforming SPY before recovering. Backtests show what the past looked like — they cannot predict what the next 5 years will look like.

Tips & common questions

Run multiple periods, not just one. A 1-year backtest can be dominated by recent narrative-driven moves; a 5-year backtest tells you whether the strategy works across regimes (rising rates vs falling rates, growth vs value rotations). Compare 1y, 3y, and 5y outputs side by side.
Read the rebalance log. It tells you which tickers actually drove the returns. If 80% of the gain came from one or two names, the strategy might just be a momentum bet on those names rather than a real factor strategy.
Don't over-optimize. It's tempting to tweak parameters until the backtest looks perfect. That's curve-fitting, and it doesn't generalize. If a strategy needs 7 specific parameters to look good, it's probably not a real strategy.
Compare to a simpler alternative. If your fancy multi-factor strategy barely beats top_n_large_cap (the boring buy-the-biggest-stocks template), the complexity isn't earning its keep.
The rebalance count matters. A monthly-rebalance strategy over 3 years should have ~36 rebalances. If yours has 5, something's wrong with the cadence config — ask Claude to check.

What's next

Create your first strategy using Claude Code if you haven't already.
Once you activate, learn how the alert system fires when monthly rebalances are due.
Understand what "live strategy" means before connecting your brokerage.

Need more help? Visit our Chat page to ask Seentio's AI assistant, or reach out at contact@app-seentio.com.