Universe & data · MoEfolio

The fleet

Three machines, one bot.

Built across heterogeneous hardware so every role gets the right silicon. The Mini orchestrates and owns the database. The Studio's 256 GB unified memory hosts all four persona models simultaneously. A separate Grace Blackwell box renders the entire visual half of the show, portraits, group scenes, episode video, without touching the trade-decision loop.

Orchestrator

Mac Mini M4, the desk

FastAPI server, scanner.db owner, scheduler heartbeat. Pulls every external feed, dispatches the persona panel, manages position lifecycle, talks to Alpaca. LAN-bound at 127.0.0.1:8888 since launch, never reachable from the public internet. The whole bot runs from this box.

SQLite WAL · BEGIN IMMEDIATE writes 30 s heartbeat FastAPI + uvicorn Alpaca broker client

Brain

Mac Studio M3 Ultra, the brain

256 GB unified memory, ~230 GB realistic LLM budget. Runs the OpenAI-compatible oMLX server with all four persona models loaded simultaneously. No swapping mid-debate, no warm-up tax. The single most important hardware purchase the project will ever make.

oMLX inference server 4 personas co-resident

Studio · show production

ThinkStation PGX

NVIDIA Grace Blackwell GB10 inside. 128 GB unified memory, 1 PFLOP FP4. The visual half of MoEfolio is rendered here: persona portraits in Flux 2, identity-locked group scenes via chained PuLID + IPAdapter blocks, episode video via Wan 2.2, and per-cycle scorecard PNGs. Also keeps a very large ~130B-parameter open-weights model resident for second-opinion reasoning on high-conviction calls. Deliberately isolated from the trade-decision loop so a slow render never blocks a debate.

ComfyUI · custom node graphs Flux 2 + Wan 2.2 PuLID + IPAdapter identity chains ~130B model · second-opinion judge

The mind

Mixture of Experts, for real this time.

"Mixture of Experts" usually means a single model with internal expert routing. Here it means four actually different open-weights LLM families served simultaneously from one 256 GB Mac Studio via oMLX. Same family = same blind spots = correlated wrong answers. Different families produce different priors, different vocabularies, different hallucination modes, so when they agree, the agreement carries information.

Architecture

4 open-weights families. 4 personas. 2 teams. 1 cloud host.

Bruce, Meryl, Walter and Layla each run on a different open-weights model family, selected from non-overlapping post-training lineages so their reasoning genuinely diverges. Bull team is Bruce + Meryl; Bear team is Walter + Layla. Donald, the host and post-cycle judge, is the only cloud LLM in the loop, deliberately separated so the trade decisions stay 100% local.

We don't publish which persona maps to which family, they're characters first, models second. The diversity is by construction, the casting is private.

4 model families · zero overlap Asymmetric verdict · one team commits + the other doesn't equally oppose → trade fires Cloud host only · all trade decisions local

Asymmetric optimism floor

A bear-skewed panel can't pile on a survivor.

When a verdict resolves to AVOID, a deterministic post-check runs: if the stock is down >40% from its 52-week high and generates positive free cash flow and shows recent insider buying, the verdict is downgraded to HOLD with confidence floored at 45. Counters the panel's natural pile-on bias on distressed-but-survivable names.

3 signals required · all must trigger

Prompt sanitizer

External text can't speak as the panel.

Every external string injected into a persona prompt, news headlines, Reddit posts, transcripts, reviews, is wrapped with [external/source] markers. The personas treat anything inside those markers as third-party data, not instructions. Indirect prompt injection from a scraped feed can't hijack a verdict.

Applied at _san() · enforced repo-wide

Featured intelligence

The data sources nobody else wires up.

Most of these aren't on any other retail dashboard because they're a pain to integrate, heterogeneous APIs, scrapers that break, transcripts that need parsing, regulatory sites built in 2003. We did the engineering. The personas can read them during research cycles, and persistent feeds refresh in the background.

Intercept

YouTube transcripts of notable investors and macro analysts.

A curated watchlist of investor and macro-analyst YouTube channels, the ones serious money actually listens to. yt-dlp walks each channel's recent uploads, pulls auto-captions, and the personas scan the transcripts before voting on any ticker that was named. Not "did someone say it", "what did they actually argue and was the case good".

14+ channels watched auto-caption parsing SQLite transcript store

Echo

Public Telegram signal channels, what retail is being told to buy.

Public TG signal channels are loud, promotional, and routinely wrong, but the consensus tells you what retail is being marketed. We poll a watchlist of public channels every 15 minutes, parse out entry / stop / target levels with regex, and surface the cluster of names being pumped. The personas treat it as crowd-pressure data, not trade ideas.

30+ channels monitored entry/SL/TP extraction 15-min cron

Excavate

Internet Archive, building the past, snapshot by snapshot.

For backtests we need data the way it looked then, not the way it looks now. We pull Wayback Machine snapshots of historical Wikipedia revisions, Companies House officer pages, pre-paywall news articles, reconstructing FTSE 100 constituents pre-2023, director-resignation timelines, and the headlines that hit before they were edited. The result: a 25-year survivorship-corrected backtest universe.

25 years of S&P 500 history 1,272 distinct historical symbols gzipped snapshot store

Watch

Congressional trade disclosures, STOCK Act filings.

Every senator and representative trade above the disclosure threshold, often within days of relevant committee work. We don't trade off it; we ask the right question, did Pelosi just buy the ticker we're debating?, and let the persona panel weigh the answer. Free, public, and routinely embarrassing.

~535 lawmakers tracked cross-checked at debate time

Shadow

Famous-investor 13Fs, Burry, Ackman, Loeb, Einhorn.

When a legendary fund opens a new position or zeroes one out, it shows up in EDGAR's 13F filings, and on Dataroma within an hour. That's typically 24 to 72 hours before financial Twitter notices. We cache these walks and refresh them when a research cycle or persona tool asks, so the panel can see smart-money moves without hammering the source.

~70 managers tracked ~1 h after EDGAR posting

Cross-check

Court records, is the company we're about to buy being sued?

Federal and state filings via the CourtListener API. Before any BUY recommendation we check whether the company, or its officers by name, appears in active litigation. Securities-fraud class actions, antitrust cases, SEC enforcement actions all surface here. A matched hit doesn't auto-veto, but the panel sees it and weighs it.

~300 queries/day quota officer + entity match

Source freshness

Are the feeds actually pulling?

"81 sources" is only honest if the feeds are refreshing inside their cadence. This panel reads the live state of every claimed adapter and pipeline. Last updated just now.

—loading…

Status grammar: fresh = refreshed inside cadence. warming = barely past cadence or just spinning up. stale = significantly past cadence. failing = last attempt errored. disabled = source switched off in settings. unknown = registered but no data observed yet.

Usage grammar: used recently = cited by a panel verdict in the last 24h. fresh, unused = pulling fine but the panel has not cited it in 7 days. stale, but used = past cadence yet still cited in the last 30 days. never used = registered but no panel has cited it in the last 30 days.

Pipeline

How a verdict gets made.

Every cycle is the same eight stages, scan, gather, take, huddle, challenge, verdict, judge, execute. Each stage is independently inspectable; every output is logged and traceable back to the inputs that produced it.

Scan

Round-robin blended scanner pulls a top-20 pool: 75% alt-scanner output (sector momentum + insider clusters + earnings momentum) and 25% value-screen (below 200-week MA + positive FCF). Symbol cooldown of 4 hours prevents back-to-back debates of the same name.

Gather

83 data adapters fan out in parallel for the chosen ticker. Each cached for its own TTL, heavy 13F walks 24 h, news 15 m, quotes 30 s. Adapters silently skip when their source is unreachable; the panel sees the gap, not stale junk.

Initial takes

Each persona writes its verdict in character, calm-quantitative, growth-evangelist, macro-paranoid, sceptical-referee. Each call carries a confidence (0 to 100) and an explicit thesis the validator can grade.

Team huddles

Bull team aligns. Bear team aligns. Internal disagreement collapses into a single team verdict only if both members commit; otherwise the team is split and the asymmetric rule downgrades it.

Cross-team challenges

Layla challenges Bruce. Meryl challenges Walter. The other side responds in real time. A genuine debate, the personas have read each other's openings and react to specifics, not to a templated rebuttal.

Asymmetric verdict

A trade fires only when one team commits AND the other doesn't oppose with equal conviction. Otherwise HOLD. Conservative by design, false negatives are cheap, false positives lose paper money in public.

Cloud judge + validator

Donald (cloud) grades panel reasoning quality 0 to 100 after each cycle. A separate validation guard runs live during the debate, catching factually-wrong citations (e.g. "Q3 revenue was $X" when EDGAR says otherwise).

Execute & track

OCO bracket on Alpaca paper. Position monitored, exits on time-stop, 50w-MA cross, or verdict-reversal-guard tightening. Every closed trade is linked back to its originating verdict for outcome calibration.

Risk gates

11 deterministic checks between verdict and trade.

A BUY verdict is necessary but not sufficient. Every committed trade also passes through these eleven gates, any one of them can veto. They're deterministic, fast, and inspectable. The persona panel argues the thesis; these gates argue the math.

Gate 01 · Verdict

Final asymmetric verdict must be BUY. AVOID/HOLD/STRONG_SELL all veto here.

Gate 02 · Confidence

Effective confidence (Brier-weighted per persona) must clear the cycle's min_confidence floor.

Gate 03 · Slot

Open-position count must be below the wallet's max-positions cap. No room → veto.

Gate 04 · Already held

If we already hold the symbol, only the conviction-adder pyramid path can buy more, not a fresh entry.

Gate 05 · Sector cap

No sector can exceed its concentration ceiling. A BUY in an over-weighted sector is vetoed even if everything else passes.

Gate 06 · Drawdown

Portfolio-level drawdown from the earliest persisted equity baseline must be inside the live-mode tolerance.

Gate 07 · Wash sale

Recent close on the same symbol within the wash window blocks re-entry, even paper trades follow the rule for backtest fidelity.

Gate 08 · Correlation

Pairwise return correlation against existing book, too-correlated additions get vetoed to keep the portfolio's information ratio honest.

Gate 09 · Event blackout

Earnings, FOMC, scheduled macro release within the blackout window blocks the trade until the event passes.

Gate 10 · Statistical filter

Liquidity, beta, and volatility z-scores must be inside acceptable bands, protects against thinly-traded or pathological names.

Gate 11 · Multi-timeframe

Daily / weekly / monthly trend coherence check. A BUY on a name in active monthly downtrend is downgraded or vetoed.

Fail-CLOSED in live mode: if any signal a gate needs is missing, the trade is blocked rather than waved through. Better a missed BUY than a blind one.

Continuous learning

Every closed trade grades the panel that called it.

A verdict is a prediction. A closed trade is a verified outcome. We link them, the originating verdict ID is stored on the trade, and when the position closes, the realised P&L grades the verdict retroactively. That signal feeds back into the panel as per-persona vote weights.

Brier calibration

Confidence audit · per persona, per outcome.

For every closed trade we compute the Brier score against the originating persona's stated confidence. Persistently overconfident personas get their vote weight halved; well-calibrated ones keep theirs. The panel's effective confidence is the weighted average of its members' modifiers.

Walk-forward

25-year survivorship-corrected window.

The historical S&P 500 universe is reconstructed from Wikipedia revisions + fja05680/sp500 + Siblis Research, 1,272 distinct symbols across 25 years, including delisted names. Walk-forward backtests use the universe-as-it-was on each historical date, not survivors only.

Calibration curve · live

Does confidence 90 actually win 90% of the time?

Per-bucket win rate vs stated confidence, last 60 days of team panel verdicts. The diagonal is perfect calibration; points below are overconfident. Live chart and table at /calibration.

Outcome linking

Verdict ID → trade ID, both ways.

Every BUY verdict that fires a trade gets the trade ID stamped back onto it. Every closed trade resolves the originating verdict's outcome_pnl_pct and outcome_was_correct. Without this linkage the daily learning loop would be wishful thinking.

Autopsy

"Why I was wrong", published, not buried.

When a position closes red, a forensic post-mortem runs, names the cognitive bias, identifies which persona's reasoning broke down, and posts the result publicly to the Open Book. Wrong is fine; wrong-and-quiet is not.

Show production

Every episode is generated, not edited.

The bot doesn't just trade, it produces a daily show. Persona portraits, group scenes, episode video, social cards, and the entire YouTube format are rendered in-house on the Grace Blackwell box. No stock footage, no off-the-shelf templates, no AI-generated slop. The personas have a consistent face across every frame because we built an identity-locked workflow to make sure of it.

Portraits · Flux 2

Each persona has a locked, canonical face.

Bruce, Meryl, Walter and Layla each have a single canonical portrait set rendered in Flux 2 with carefully-tuned prompts and reference images. Once a face is locked it doesn't drift: every subsequent appearance, episode chyron, social card, podcast thumbnail, uses the same identity reference. The cast is recognisable because we made it stay recognisable.

Flux 2 · prompt + ref-image stable

Group scenes · PuLID + IPAdapter

Four people in one scene without face-swap fall-off.

Standard text-to-image collapses identity when you ask for four named people in one frame. We chained PuLID + IPAdapter blocks with per-persona conditioning so the cast portrait holds all four faces correctly. Iterated through several reference-weight schedules until intra-team identity drift stopped happening.

4 conditioning chains · 1 latent

Video · Wan 2.2

Talking-head episodes generated end-to-end.

Episode video, the daily 8 to 10 minute panel format and the 90-second vertical "Crackpot Hour" cuts for TikTok/Reels, is rendered via Wan 2.2. The transcript drives the audio track; the model lip-syncs to it; the canonical portraits anchor identity. Same persona, same voice, same vibe across every episode.

Wan 2.2 · text-to-video Daily long + vertical short

Per-cycle artifacts

Every cycle ships a Scorecard PNG.

After each debate finishes, a PNG scorecard is auto-generated showing the ticker, both team verdicts, the asymmetric outcome, and Donald's reasoning grade. Auto-posted to the broadcast channels and embeddable on social. The show isn't post-edited, it's generated as the bot thinks.

Posted to 6 Discord channels + Telegram

Universe & cadence

Two desks, steady drip.

The US desk is live on Alpaca paper. The UK desk (Sterling Desk) runs overnight UK cycles on FTSE 100 constituents through a UK-paper broker shim. Each desk has its own scheduler, its own throttle, and its own data adapters.

S&P 500 live

~540 active constituents · 1,272 historical (survivorship-corrected) · Alpaca paper-trading

1 / 30 minduring US market sessions

FTSE 100, Sterling Desk live · throttled

100 constituents from iShares ISF daily holdings · UK-paper broker · overnight cycle window

1 / cycleUS-closed only

04:00, 20:00 ETUS active window (Mon-Fri)

~32 / dayweekday US cycles

~6 LLM callsper cycle (4 personas + panel + judge)

100%local inference for trade decisions

Full inventory

Adapters available to every research cycle.

Every signal you see in a panel transcript is traceable to one of these adapters. The cycle gathers the relevant set for each ticker; heavyweight sources are cached or called on demand, and persistent feeds like YouTube, Reddit, Substack, Telegram and podcasts refresh in the background. UK-prefixed sources surface only for FTSE tickers. Adapters silently skip when their source is unreachable, so the persona panel sees the gap, not stale junk.

Fundamentals & filings

edgar10-K/10-Q facts, insider, 13D/G
tiingoEOD prices, IEX quote, meta
finnhubreal-time quote, company news
alphavantagetechnicals, earnings calendar
slickchartslive S&P 500 constituents
stooqdaily OHLCV fallback
earningssurprise history, days-until-next
analyst_revisionsconsensus drift direction
sector_strength63d relative-to-SPY

Macro & event-risk

fredCPI, unemployment, indicators
fomcmeeting calendar + statements
treasuryyield curve, auctions
cftccommitments-of-traders
cross_assetVIX, credit spreads, rates vol
earnings_seasonseason pacing
political_speechpresidential posts, market keywords
gdeltglobal event tone, top neg events
gdelt_v2_graph30d tone timeline + global coverage
weatheractive alerts (commodity-relevant)
earthquakesUSGS feed (supply-chain risk)
eia_petroleumweekly crude/gasoline/distillate
tsa_throughputdaily airline pax + YoY

Sentiment & social

news_sentimentlocally-scored polarity
rss_newsaggregated headlines
redditr/investing + subreddit chatter
stocktwitsretail bull/bear directional
aaii_sentimentweekly retail-investor survey
app_store_reviewsconsumer-app pulse
google_trendsWikipedia pageviews proxy
substack_feedsnewsletter coverage
youtube_transcriptsfamous-investor channels
telegram_signalspaid signal-channel chatter
tldr_newsletterTLDR ai/tech/founders/marketing
hacker_newsHN front-page narrative signal
wikipedia_viewspageview velocity, WoW change
box_officeweekend chart + studio attribution
steam_concurrentlive game player counts + 24h peak

Prediction markets & research

polymarketodds on policy, election, geopolitics
kalshiUS-regulated prediction markets
arxivrecent preprints, AI/biotech labs
fda_adcommFDA AdComm calendar + ClinicalTrials.gov
uspto_patentsrecent patents by company assignee

Smart money

inst_13f12 funds × Q-over-Q changes
superinvestorsDataroma famous-investor 13Fs
congresssenator/rep trades disclosure
fec_filingscorporate PAC + political contributions

Risk & enforcement

optionsIV, put/call vol, unusual strikes
polygon_optionstape-side flow detail
short_reportsactivist short publications
court_recordsCourtListener filings
sec_enforcementSEC press releases
ftc_enforcementFTC actions
doj_fraudDOJ fraud-section feed
ofac_sdnsanctions list match
cyber_alertsCVE / breach feeds
nhtsa_recallsauto-OEM recall overhang

UK desk (Sterling) in dev

ftse_constituents_yfiuacanonical FTSE 100
ishares_isfISF daily holdings CSV
uk_macroBoE base rate, CPI
uk_indicesFTSE / sector indices
uk_pressFT, Guardian, Reuters UK
uk_gazettecompany insolvencies
uk_cmacompetition cases
uk_fca_noticesFCA enforcement
uk_ico_finesdata-protection fines
uk_sfo_casesserious-fraud office
uk_sanctionsUK sanctions list
uk_short_interestFCA shortselling reports
uk_land_registryproperty transactions
uk_powerNational Grid, NESO
uk_redditr/UKInvesting
uk_reddit_deeppaginated post + comment trees
uk_trustpilotconsumer review pulse
companies_houseofficers, filings, charges
investegateUK regulatory news

Engineering principles

The non-negotiables.

Most of these are repo-wide invariants enforced at the chokepoint, not aspirations posted in a README. Each one closed a class of past bug.

Free data only

No paid APIs in the trade-decision loop. If a source needs a credit card it doesn't ship. The cost discipline keeps the project reproducible by anyone with a Mac and a weekend.

100 % local for trade decisions

Every BUY / AVOID / HOLD verdict is produced on owned hardware. The cloud host (Donald) narrates and grades after the fact, never inside the decision.

Fail-CLOSED in live mode

If a required signal is missing in live mode, the trade is blocked rather than waved through. Better a missed BUY than a blind one.

Sanitised prompts

External text is wrapped in [external/source] markers. Indirect prompt injection from a scraped feed cannot impersonate the panel.

Adapters skip silently

A broken source returns nothing, not stale data. The panel sees the gap honestly, no fabricated facts.

Atomic SQLite writes

WAL mode + BEGIN IMMEDIATE on every write. The scheduler heartbeat row is filtered out of last-fire queries, a past audit caught a reader missing the filter and double-firing.

Hard-cap yfinance threads

yf.download(threads=2) always. We were burned once by FD exhaustion wedging the database; the cap is now load-bearing.

Mini never sees the open internet

The orchestrator binds to 127.0.0.1. The public site reads from a separate LXC. Cloudflare Tunnel handles ingress with no inbound port on the home network.

Outcome-linked verdicts

Every BUY verdict that fires a trade has the trade ID stamped back onto it; every closed trade resolves the originating verdict's outcome. Without this link the daily learning loop would be wishful thinking.

Verify before claiming a fix

A bug isn't fixed until the affected flow has been triggered and the observable evidence is checked. "I refactored it and it should work now" is not a fix.

Acknowledgements

Built on the shoulders of open work.

The four local LLMs that argue every stock pick are served on a Mac Studio M3 Ultra by oMLX , an OpenAI-compatible MLX inference server purpose-built for Apple Silicon. It's the quiet backbone that lets us host four different model families simultaneously without paying a per-token bill to anyone. Thank you to the oMLX team.

The adaptive learning layer that watches every gate decision and slowly learns which contexts deserve confidence vs caution is Syntra, an open-source neural runtime by SectorOPS. Today it runs in shadow mode: it observes every trade-gate decision, suggests a policy, and learns from delayed outcome feedback, but never alters a live trade.

Thanks also to the maintainers of every adapter listed above, FRED, SEC EDGAR, Finnhub, Tiingo, the Reddit data ecosystem, the Wikipedia historical S&P 500 contributors, fja05680/sp500, Siblis Research, the iShares ISF daily-holdings export, slickcharts, and the dozens of other free public sources that make this project possible without a paid API.

Three machines. Four model families. Eighty-three data adapters. Zero paid APIs.