Signum

Automated quantitative equity trading system. Trains a LightGBM model weekly on S&P 500 data, selects top 10 stocks by predicted 5-day return, optimizes portfolio weights via HRP, and executes through Alpaca with ATR-based stop-loss/take-profit brackets.

Currently paper trading $100k on a DigitalOcean VPS. This is Bot A (main branch). Bot B (ensemble: LightGBM + CatBoost + RF + Ridge stacking) runs on the feature/comprehensive-improvements branch for A/B comparison via a Next.js dashboard.

How It Works

Every Wednesday at market open, the bot runs a full cycle:

Scrape S&P 500 tickers (Wikipedia)
        │
        ▼
Fetch 2yr daily OHLCV (yfinance, full ~500 S&P 500 universe)
        │
        ▼
Compute 22 alpha features (momentum, volatility, RSI, volume, cross-sectional ranks, VIX)
        │
        ▼
Train LightGBM (Huber loss, residual return target, 80/20 date-split with 5-day embargo)
        │
        ▼
Score all ~500 S&P 500 stocks using saved winsorization bounds
        │
        ▼
Select top 10 by predicted residual return
        │
        ▼
Optimize weights via HRP (Ledoit-Wolf covariance shrinkage)
        │
        ▼
Risk checks (position size, sector exposure, leverage, VaR, drawdown)
        │
        ▼
Execute via Alpaca (sells first, then buys, poll for fills)
        │
        ▼
Attach OCO brackets (SL = 2x ATR below fill, TP = 3x ATR above fill)
        │
        ▼
Send trade summary via Telegram, persist state, sleep until next Wednesday

Between Wednesdays the bot sleeps. GTC stop-loss and take-profit orders sit on Alpaca's servers and fire automatically.

Regime Detection

The bot monitors VIX and SPY drawdown continuously:

Regime	Condition	Action
Normal	VIX < 25, SPY DD < 8%	Full exposure
Caution	VIX 25-35 or SPY DD 8-15%	50% exposure (all weights halved)
Halt	VIX > 35 and SPY DD > 15%	Liquidate everything, wait 1 hour

De-escalation uses OR logic (either VIX or drawdown clearing allows caution).

Model

The live pipeline uses a single LightGBM model (Huber loss, residual return target). An ensemble module (ensemble.py) with LightGBM + RF exists for research but is not wired into the live trading loop on this branch.

Bot B on feature/comprehensive-improvements uses the full ensemble (LightGBM 45% + CatBoost 30% + RF 25% + Ridge stacking meta-learner).

Deployment

The bot runs on a VPS as two systemd services:

Service	What	Port
`signum-bot`	Trading bot (sleeps between Wednesdays)	—
`signum-dashboard`	Dash web UI + JSON API	8050 (localhost only)

Both auto-restart on crash and start on boot.

A Next.js A/B dashboard on Vercel compares Bot A (this branch) vs Bot B (ensemble) in real-time. See A/B Dashboard.

Quick Deploy

# 1. Clone and install
git clone https://github.com/Codeptor/signum.git
cd signum
uv sync

# 2. Configure
cp .env.example .env
# Edit .env: set ALPACA_API_KEY and ALPACA_API_SECRET
# Get keys from https://app.alpaca.markets/paper/dashboard/overview

# 3. Dry run (full ML pipeline, no orders)
uv run python examples/dry_run.py

# 4. Run locally
uv run python examples/live_bot.py

# 5. Or deploy to VPS with systemd
sudo cp deploy/signum-bot.service /etc/systemd/system/
sudo systemctl enable --now signum-bot

CLI Shortcuts (zsh)

After setup, these commands are available:

signum            SSH into the VPS
signum -h         Show all commands
signum-dash       Open dashboard (SSH tunnel + browser)
signum-status     Account, regime, bot state
signum-logs       Stream live bot logs
signum-positions  Current open positions with P&L
signum-regime     Market regime (VIX, SPY drawdown)
signum-restart    Restart the bot service
signum-stop       Stop the bot service
signum-deploy     Push local code to VPS and restart

Monitoring

Telegram Bot (Primary)

Control the bot from your phone via Telegram commands:

Command	What it does
`/status`	Bot state, regime, account overview
`/positions`	Current holdings with weights and P&L
`/equity`	Portfolio value, cash, buying power, total return
`/regime`	Market regime (VIX, SPY drawdown, exposure)
`/health`	System health check
`/trades`	Recent trade info
`/logs`	Last 20 log lines
`/help`	List all commands

Setup: create a bot via @BotFather, set TELEGRAM_BOT_TOKEN and TELEGRAM_CHAT_ID in .env.

Alerts (18 events)

Automatic Telegram alerts for all critical events:

Event	Severity
Bot startup / shutdown	INFO
Trade cycle summary (fills, equity, holdings)	INFO
Hourly heartbeat (silence = problem)	INFO
Stale data, order timeout, partial fill	WARNING
OCO bracket failure, risk violation, order rejection	WARNING
Caution mode (VIX/drawdown elevated)	WARNING
ML pipeline failure, Alpaca connection failure	CRITICAL
Halt mode, drawdown kill switch, liquidation	CRITICAL

CRITICAL alerts bypass rate limiting. All others rate-limited to 20/5min.

Transport priority: Telegram (always) > Resend > SendGrid > SMTP for email.

Dash Web UI (Bot A only)

Public at https://dashboard.bhanueso.dev (nginx reverse proxy + Let's Encrypt SSL).

Two tabs:

Live — account overview, open positions, regime beacon, equity curve, bot log viewer
Backtest — historical performance, drawdown, rolling Sharpe

A/B Dashboard

A Next.js dashboard on Vercel provides real-time side-by-side comparison of Bot A vs Bot B during paper trading. Features: dual equity chart overlay, sector exposure bars, risk metrics, positions, logs, keyboard shortcuts (1/2/R/Space), market session detection, browser notifications on regime changes.

The dashboard source lives on the feature/comprehensive-improvements branch at dashboard/. It uses a server-side API proxy to route requests to each bot's VPS endpoint.

JSON API

12 endpoints, all return structured JSON with CORS headers:

Endpoint	Returns
`GET /api`	Index of all endpoints
`GET /api/status`	System overview (regime + account + bot state)
`GET /api/account`	Alpaca account (equity, cash, buying power)
`GET /api/positions`	Open positions with unrealized P&L
`GET /api/regime`	VIX, SPY drawdown, exposure multiplier
`GET /api/equity`	Equity history time-series
`GET /api/risk`	Full risk engine output (VaR, Sharpe, drawdowns)
`GET /api/drift`	Feature drift report (PSI per feature)
`GET /api/bot`	Bot state (last trade, shutdown reason)
`GET /api/backtest`	Backtest metrics and risk summary
`GET /api/logs`	Bot log lines (`?lines=N`, default 80, max 500)
`GET /healthz`	Health check (bot liveness, alerting, data freshness)

Risk Controls

Trade-Level

Check	Limit	Severity
Max position weight	30%	Critical (blocks trade)
Max sector weight	25%	Critical
Max single trade size	15%	Critical
Max leverage	1.0x (long-only)	Critical
Max daily trades	50	Warning
Max daily turnover	100%	Warning

Portfolio-Level

Check	Limit	Action
Max drawdown	15%	Kill switch — liquidate all
VaR (95%, daily)	6%	Warning logged
Min Sharpe ratio	-0.5	Warning logged
Max volatility	30% annualized	Warning logged

Position Protection

ATR-based stop-loss (2x ATR) and take-profit (3x ATR) via OCO orders
Fallback to fixed 5% SL / 15% TP when ATR unavailable
Orphaned order cleanup every cycle
Duplicate execution prevention (checks _has_traded_today before trading)

Project Structure

signum/
├── examples/
│   ├── live_bot.py              # Main trading bot (entry point)
│   ├── dry_run.py               # ML pipeline test without orders
│   └── paper_trading_tracker.py # CLI portfolio snapshot
├── python/
│   ├── alpha/
│   │   ├── features.py          # 22 alpha features + winsorization
│   │   ├── model.py             # LightGBM (Huber loss) wrapper
│   │   ├── ensemble.py          # LightGBM + RF ensemble (research only, not wired into live)
│   │   ├── predict.py           # End-to-end: data → features → rank → optimize
│   │   └── train.py             # Training pipeline orchestrator
│   ├── portfolio/
│   │   ├── optimizer.py         # HRP, Min-CVaR, Black-Litterman, Risk Parity
│   │   ├── risk.py              # VaR, CVaR, Sharpe, Sortino, drawdowns
│   │   ├── risk_manager.py      # Real-time trade gating
│   │   └── risk_attribution.py  # Marginal/component risk, Brinson-Fachler
│   ├── bridge/
│   │   └── execution.py         # Order submission, position tracking, P&L
│   ├── brokers/
│   │   ├── base.py              # Abstract broker interface
│   │   └── alpaca_broker.py     # Alpaca Markets implementation
│   ├── data/
│   │   ├── ingestion.py         # S&P 500 scrape + yfinance OHLCV
│   │   └── sectors.py           # GICS sector map + dynamic yfinance lookup
│   ├── backtest/
│   │   ├── run.py               # Walk-forward backtest engine
│   │   ├── validation.py        # Purged k-fold CV + deflated Sharpe
│   │   ├── robustness.py        # Monte Carlo, block bootstrap, stress tests
│   │   └── regime_analysis.py   # Per-regime performance breakdown
│   └── monitoring/
│       ├── alerting.py          # Multi-channel alerts (Telegram, Resend, SendGrid, SMTP, webhook)
│       ├── telegram_cmd.py      # Telegram command handler (/status, /positions, etc.)
│       ├── dashboard.py         # Dash web UI + JSON API (12 endpoints + /healthz)
│       ├── drift.py             # KS test + PSI feature drift detection
│       └── regime.py            # VIX/SPY-based regime detector
├── deploy/
│   ├── signum-bot.service       # systemd service file (trading bot)
│   └── signum-dashboard.service # systemd service file (web dashboard)
├── tests/                       # 594 tests
├── rust/matching-engine/        # Lock-free order book (sub-microsecond)
├── run_live_bot.sh              # Bash wrapper with crash recovery
├── .env.example                 # Environment variable template
└── pyproject.toml               # Python 3.11, all dependencies

Configuration

All parameters configurable via .env:

# Alpaca (required)
ALPACA_API_KEY=your_key
ALPACA_API_SECRET=your_secret

# Strategy
TOP_N_STOCKS=10              # Stocks to hold
OPTIMIZER_METHOD=hrp         # hrp, min_cvar, risk_parity
REBALANCE_FREQUENCY=weekly   # daily or weekly
REBALANCE_DAY=2              # 0=Mon ... 4=Fri

# Risk
MAX_POSITION_WEIGHT=0.30     # 30% max per position
MAX_DRAWDOWN_LIMIT=0.15      # 15% kill switch
ATR_SL_MULTIPLIER=2.0        # Stop-loss at 2x ATR
ATR_TP_MULTIPLIER=3.0        # Take-profit at 3x ATR

# Alerts (recommended)
TELEGRAM_BOT_TOKEN=          # From @BotFather
TELEGRAM_CHAT_ID=            # Your chat ID
ALERT_WEBHOOK_URL=           # Slack/Discord webhook (optional)

Backtest Results

Walk-forward backtest on S&P 500, LightGBM alpha (22 features), residual return target, top-20 portfolio, 5-day rebalancing, VIX scaling:

Optimizer	Sharpe (net)	Ann. Return	Max DD	Avg Turnover
Equal Weight	1.66	24.6%	50.0%	36%
HRP	1.28	13.9%	40.8%	39%
Black-Litterman	0.99	10.5%	31.8%	44%

Known backtest limitations (these don't affect live trading):

Survivorship bias: uses current S&P 500 list for historical data (~1-3% annual return inflation)
Forward return overlap: 5-day returns with 5-day rebalancing inflates Sharpe by ~sqrt(overlap)
Feature leakage: backtest computes features on full dataset before train/test split (live pipeline correctly saves/loads bounds per training cycle)

Audit History

Three rounds of code audit (113+ findings resolved):

Round	Findings	Focus
1	40	Initial code review
2	56	Parallel audit by 6 agents across execution + ML pipeline
3	37	Final pre-paper-trading hardening

Key fixes: OCO order construction, train/inference winsorization parity, date-space purged k-fold, geometric Sharpe standardization, Ledoit-Wolf covariance shrinkage, regime de-escalation logic, risk manager weight tracking, dynamic sector classification.

Post-audit additions: yfinance circuit breaker, centralized alerting module (Telegram + email), Telegram command handler, /healthz endpoint, structured JSON logging.

Bot B (feature/comprehensive-improvements) extends this with ensemble models, HMM regime detection, TCA, confidence-weighted sizing, graduated drawdown control, SHAP explainability, and 1443+ tests.

Tests

uv run python -m pytest tests/ -x -q --tb=short
# 594 passed in ~80s

Coverage includes: ML pipeline (features, model, ensemble, predict), portfolio optimization, risk engine, risk manager, execution bridge, broker integration, backtest validation, robustness analysis, live bot helpers, alerting (Telegram, SendGrid, SMTP, webhook), Telegram command handler, and full integration tests.

Tech Stack

Layer	Technology
ML	LightGBM, scikit-learn (Random Forest)
Portfolio	skfolio (HRP, CVaR, BL), Ledoit-Wolf shrinkage
Data	yfinance, pandas
Risk	scipy, numpy
Monitoring	Dash, Plotly, Flask (JSON API)
Alerting	Telegram Bot API, Resend, SendGrid, SMTP, webhooks
Broker	Alpaca Markets API (`alpaca-trade-api`)
Infra	DigitalOcean VPS, systemd, nginx, Let's Encrypt
Matching Engine	Rust, BTreeMap, Criterion.rs

Design Decisions

Residual return target: The model predicts stock-specific alpha (returns minus cross-sectional mean), not absolute returns. This means the model learns which stocks outperform relative to the average — works well in bull/flat markets, but in broad selloffs all positions lose money (regime detection partially mitigates this).

Weekly rebalancing: Reduces transaction costs ~74% vs daily. The model predicts 5-day forward returns, matching the rebalance frequency.

HRP over mean-variance: HRP uses hierarchical clustering + recursive bisection — no covariance matrix inversion required. More stable with 10 stocks and noisy correlation estimates.

Long-only with market-neutral model: The model is trained on residual returns but the bot only takes long positions. This wastes half the model's discriminative power by design. A long-short structure would capture more alpha but adds complexity (margin, locate fees, short squeeze risk) that isn't warranted during paper trading validation.

ATR-based brackets over fixed percentages: 2x ATR stop-loss adapts to each stock's volatility. A volatile stock gets a wider stop, a stable stock gets a tighter one. Falls back to fixed 5%/15% when ATR data is unavailable.

Name		Name	Last commit message	Last commit date
Latest commit History 176 Commits
.dvc		.dvc
.github/workflows		.github/workflows
deploy		deploy
docs		docs
examples		examples
infra		infra
python		python
rust/matching-engine		rust/matching-engine
tests		tests
.dvcignore		.dvcignore
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
dvc.yaml		dvc.yaml
live_bot.log		live_bot.log
pyproject.toml		pyproject.toml
research.md		research.md
run_live_bot.sh		run_live_bot.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Signum

How It Works

Regime Detection

Model

Deployment

Quick Deploy

CLI Shortcuts (zsh)

Monitoring

Telegram Bot (Primary)

Alerts (18 events)

Dash Web UI (Bot A only)

A/B Dashboard

JSON API

Risk Controls

Trade-Level

Portfolio-Level

Position Protection

Project Structure

Configuration

Backtest Results

Audit History

Tests

Tech Stack

Design Decisions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Signum

How It Works

Regime Detection

Model

Deployment

Quick Deploy

CLI Shortcuts (zsh)

Monitoring

Telegram Bot (Primary)

Alerts (18 events)

Dash Web UI (Bot A only)

A/B Dashboard

JSON API

Risk Controls

Trade-Level

Portfolio-Level

Position Protection

Project Structure

Configuration

Backtest Results

Audit History

Tests

Tech Stack

Design Decisions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages