Evidence-based investing research
Value Investing Strategy (Strategy Overview)
Allocations for June 2026 (Final)
Cash TLT LQD SPY
Momentum Investing Strategy (Strategy Overview)
Allocations for June 2026 (Final)
1st ETF 2nd ETF 3rd ETF

Investing Expertise

Can analysts, experts and gurus really give you an investing/trading edge? Should you track the advice of as many as possible? Are there ways to tell good ones from bad ones? Recent research indicates that the average “expert” has little to offer individual investors/traders. Finding exceptional advisers is no easier than identifying outperforming stocks. Indiscriminately seeking the output of as many experts as possible is a waste of time. Learning what makes a good expert accurate is worthwhile.

U.S. Treasuries Selection by AI Panel

Is the evolving set of artificial intelligence (AI) platforms based on large language models interesting as U.S. Treasuries selection advisors? Are they monolithic, or diverse? As a simple exploration, we pose to each of Grok, ChatGPT, Claude, Perplexity and Gemini the following prompt regarding the seven U.S. Treasuries exchange-traded funds (ETF) considered in “Treasuries ETFs Momentum Strategy Update/Extension”:

Using all training and real-time data available to you, please provide your unique view on whether investing in each of the following ETFs is favorable, neutral or unfavorable for the balance of 2026: BIL, SHY, VTIP, IEI, IEF, TIP and TLT. Do not provide any explanations.

We then compare and contrast results. Using responses to the prompt as posed in late May 2026, we find that: Keep Reading

Whale Wisdom

Does the informativeness of markets come from wisdom of the crowd or the wisdom of a few? If a few, who are they? In their April 2026 paper entitled “Beyond the Wisdom of the Crowd: Concentrated Informed Trading in Earnings Prediction Markets” Wan Chu Cheong and Ane Tamayo examine who makes Polymarket earnings prediction markets (whether firms will beat earnings forecasts) informative. Specifically, for each market, they:

  • Rank each wallet (trader) by the value of their net positions.
  • Compare the predictive accuracy of top-ranked traders (whales) to that of other traders.

To ensure market liquidity, they focus on 435 earnings prediction markets created since September 2025, which involve median 184 traders and $16,665 volume. They employ all 435 markets to assess prediction accuracy and 381 markets matched to stock prices for tests involving equity returns. Using wallet addresses, timestamps, trade directions (buy or sell) and dollar values, predicted outcomes (Yes or No) and prices for 309,574 Polymarket trades by 21,083 traders, actual firm earnings beats/misses and matched stock returns as available during September 2025 through February 2026, they find that:

Keep Reading

How Are AI-powered ETFs Doing?

How do exchange-traded-funds (ETF) that employ artificial intelligence (AI) to pick assets perform? To investigate, we consider ten such ETFs, eight of which are currently available:

We use SPDR S&P 500 ETF Trust (SPY) for comparison, though it is not conceptually matched to some of the ETFs. We focus on monthly return statistics, along with compound annual growth rates (CAGR) and maximum drawdowns (MaxDD). Using monthly total returns for the ten AI-powered ETFs and SPY as available through April 2026, we find that: Keep Reading

Prediction Markets Are Better than Humans as Earnings Analysts?

Are prediction markets better at forecasting firm earnings than professional analysts? In their April 2026 paper entitled “Beating the Earnings Game: Why Do Prediction Markets Outperform Professional Analysts?”, Daniel Rabetti, Jiaqi Shao and Che Zhang investigate whether and, if so, why a blockchain-based prediction market such as Polymarket outperforms professional analysts in forecasting U.S. stock earnings. The earnings predictions of this market are public and unchangeable contracts, taking the form:

“Will [Company] beat earnings for [Quarter] [Fiscal Year]?”

relative to analyst consensus as of contract creation date. Using data for 469 Polymarket firm-quarter earnings beat contracts, corresponding analyst earnings forecast data and associated daily stock prices during September 2025 through February 2026, they find that:

Keep Reading

Wisdom of a Few?

Does the empirical accuracy of prediction markets derive from crowd wisdom or an informed few? In their April 2026 paper entitled “Prediction Market Accuracy: Crowd Wisdom or Informed Minority?”, Roberto Cram, Yunhan Guo, Theis Jensen and Howard Kung investigate why prediction markets exhibit accuracy. Specifically, they compare the distribution of actual trade directions with a hypothetical distribution of random trades, and thereby classify traders as:

  • Market makers, who provide liquidity by posting limit orders.
  • Skilled traders, winners whose gains cannot be attributed to chance.
  • Other winners and other losers, who respectively earn positive and negative returns but whose performance is not statistically significant.
  • Persistent losers, who consistently and significantly lose.

Using the Polymarket universe of transactions and accounts with at least 10 trades across propositions created after the beginning of January 2023 and resolved by the end of December 2025, they find that:

Keep Reading

Managing AI Researchers

Can artificial intelligence (AI) agents based on a large language model (LLM) carry most of the load in strategic asset allocation? In their April 2026 paper entitled “The Self-Driving Portfolio: Agentic Architecture for Institutional Asset Management”, Andrew Ang, Nazym Azimbayev and Andrey Kim present a 6-step strategic asset allocation system in which:

  1. A macro agent identifies the economic regime (expansion, late-cycle, recession or recovery).
  2. Asset class agents each assigned one class run in parallel to estimate respective expected returns, expected volatilities and confidence levels.
  3. A covariance agent generates an asset class covariance matrix.
  4. Portfolio construction agents each independently employ Step 2 and 3 outputs to proposed a portfolio based on an assigned method (such as equal weight, inverse volatility, mean-variance optimization or risk parity), including:
    • A researcher agent to propose novel portfolio construction methods.
    • An adversarial agent to uncover unconventional allocation ideas.
  5. Multiple agents review all proposed portfolios and vote on them.
  6. A chief investment officer agent scores, selects and combines surviving proposed portfolios using an ensemble of seven combination methods. This agent then summarizes a final recommendation/reasoning/dissenting views.

They include a meta-agent that compares forecasted and realized returns and rewrites agent scripts to improve future performance. They specify each agent in this system via a description, a set of scripts, a collection of skills and a structured output. An Investment Policy Statement (specifying asset class universe, objective, tracking error) constrains the AI agents. Overall, this system compresses days or weeks of human work into minutes. Based on prior research and experience with LLM-based AI agents, they observe that: Keep Reading

Differences in AI and Human Financial Research

When assigned to perform the same empirical financial research, do the findings of human researchers and large language models (LLM) as a kind of artificial intelligence (AI) differ? If so, why? In their March 2026 paper entitled “AI ‘Errors'”, Wenqian Huang, Albert Menkveld and Shihao Yu compare outcomes for 158 AI model iterations (agents) to those from prior research for 164 independent human teams employing the same sample of 720 million equity index futures trades to test the same six hypotheses. They choose the GPT-5.2 LLM to construct AI agents, with variability in outcomes driven by its probabilistic decision-making. They further examine which types of research decisions drive any differences. Using outcomes from the AI and human researcher test runs, they find that: Keep Reading

Do LLM Outages Affect the Stock Market?

Is growing investor/trader use of large language models (LLM) extinguishing known stock return anomalies? In their March 2026 paper entitled “Do LLMs Make Markets More Efficient?”, Runjing Lu, Yongxin Xu and Luka Vulicevic examine how use of LLMs is affecting reactions of individual stocks to recent newsworthy events with and without outages of LLMs from three major providers (ChatGPT, Claude and Gemini). Together, these three account for nearly 80% of LLM usage. They classify outages as (1) any, (2) single-provider severe or (3) multi-provider, as documented by each provider. They focus on outages that coincide with news releases and persist beyond the NYSE close. They use RavenPack Event Sentiment Scores for articles from the Dow Jones Newswire that have ticker-specific relevance scores above 75. They control for time-varying stock/firm characteristics, past returns, new type and calendar effects. They measure daily abnormal stock returns relative to those of a characteristic-matched benchmark portfolio. Using daily outage, stock/firm and news/sentiment data during March 2023 through November 2025, they find that: Keep Reading

LLMs as Financial Advisors for Individuals

Are large language models (LLM) robust financial advisors for individuals? In their March 2026 paper entitled “AI Financial Advice: Supply, Demand, and Life Cycle Implications”, Taha Choukhmane, Tim de Silva, Weidong Lin and Matthew Akuzawa examine the personal financial advice from LLMs. They mainly use GPT-5.2 but repeat analyses using Gemini 3 Flash as a robustness check. Specifically, they:

  • Construct a life cycle model of income/spending/saving/investment, with labor market shocks and asset returns calibrated to U.S. data.
  • Collect questions (prompts) from a demographically representative sample of about 1,000 U.S. adults about spending and investing, including summaries of respective financial situations.
  • Simulate life cycle paths of individuals for each year from ages 22 to 90 who follow two-pass advice in LLM responses to prompts from survey participants matched by age, income and employment status. The first pass solicits textual advice, and the second translates text to quantified saving, spending and asset allocation recommendations.

They consider two benchmarks: (1) the optimal behaviors for the life cycle model simulations; and, (2) substitution of survey respondent prompts with expert (academic) prompts that ask the LLM to give professional life cycle advice under modern portfolio theory, including explicit personal situations/economic assumptions. Using the specified life cycle model and LLM prompts, they find that: Keep Reading

Epitome of Trading Expertise?

How strongly do profits concentrate among winners in zero-sum prediction market trading? In their March 2026 paper entitled “Who Wins and Who Loses In Prediction Markets? Evidence from Polymarket”, Pat Akey, Vincent Grégoire, Nicolas Harvie and Charles Martineau examine trading profits and losses on Polymarket, the world’s largest prediction market, to measure:

  • Profit concentration.
  • The link between prediction accuracy and profitability.
  • Characteristics of profitable and unprofitable trading.

Using the complete Polymarket transaction history (about 70 million trades by 1.4 million users) during November 2022 through October 2025, they find that: Keep Reading

Research Finder

Search 1,200+ research articles