Berkeley, CA · Founded 2026

Neural
Intelligence
On-Chain

Merkavian builds cryptographically verified AI infrastructure. We make neural network training trustless through zero-knowledge proofs and decentralized federated learning.

View PoLChain About

ZK+FL Proof-of-Learning Protocol

5.4M Candles Analyzed

Base. Sepolia Deployed

50+ Parallel Experiments

7× Signal Layers

Products

What we build

Two systems at the frontier of AI and decentralized infrastructure.

Research Protocol · UC Berkeley 2026

PoLChain

A proof-of-learning blockchain protocol that uses zero-knowledge proofs to verify gradient updates from distributed miners — making AI model training cryptographically trustless without a central aggregator.

Halo2 / EZKLFederated Learning Base Sepolia$POL Token

Explore protocol →

Live System · Running 24/7

Autonomous Trading

A self-evolving algorithmic trading system with a 7-layer signal stack, reinforcement learning loop, and evolutionary engine running 50 parallel experiments simultaneously across 15 crypto pairs.

Reinforcement LearningEvolution Engine Kraken APIOracle Cloud

View system →

Technology

Core infrastructure

∅

Zero-Knowledge Proofs

Halo2 circuits generated via EZKL verify ML training computations on-chain without revealing model weights or training data. Pixels and logits public; parameters private.

⊕

Federated Learning

Four miners train on independent MNIST shards with distinct augmentation strategies. Quality-gated FedAvg merges gradients — 50/50 if challenger beats global, 80/20 otherwise.

◈

On-Chain Verification

TaskManager, POLToken, and EZKL-generated Halo2Verifier deployed on Base Sepolia. Every gradient submission, block finalization, and reward is a verified on-chain transaction.

⟳

Evolutionary Strategies

50 parallel sandbox experiments with 8 mutation types. Evaluator runs 3-gate validation every 6h. Graduates enter 24h A/B test against champion before promotion.

≋

Multi-Signal Stack

7 independent signal layers: base rules, MTF alignment, VWAP 2σ bands, funding rate, candlestick patterns, volume profile, geometric aggregation with learned weights.

⊛

Safe Learning Pipeline

4-gate validation before any signal weight push: 50+ live trades required, ≤30% change magnitude, <14 days old, RSI threshold match. Human review step before every Oracle push.

Get in touch

Building at the frontier

UC Berkeley research at the intersection of zero-knowledge cryptography and machine learning.

jackson@merkavian.com

Research Protocol · UC Berkeley · April 2026

PoLChain

A proof-of-learning protocol demonstrating that AI model training can be made cryptographically verifiable on a live blockchain — without a trusted aggregator.

The Problem

Leading AI companies face a fundamental economic crisis. In 2024, OpenAI spent $3B on model training against $3.7B in revenue. Anthropic spent $1.5B against $2.55B. Profitability under centralized training economics is nearly impossible.

Blockchain enables a different model: AI developers issue tokens tied to model development, letting funding come from the market rather than burning capital. Miners earn tokens by contributing verified gradient updates. The critical unsolved problem — without a trusted aggregator, how do you verify miners actually trained on real data?

The Solution

Zero-knowledge proofs make gradient fraud cryptographically impossible. Each submission is paired with a Halo2 ZK proof confirming training occurred correctly — without revealing training data or model weights. Pixels and logits are public. Parameters stay private.

polchain — block #1,204

→ TaskManager.postTask() · 100 POL escrowed · deadline T+60s

4 miners: Alpha(shard0) Beta(shard1) Gamma(shard2) Delta(shard3)

→ autoMiner POST /prove/async · Flask EZKL pipeline starts

gen_witness → prove → hex_proof + instances[]

→ Delta.submitWithProof(gradHash, 94.7, proofBytes, instances)

Halo2Verifier.verifyProof() → true · ZKWorkSubmitted event

→ finalizeTask() · Winner: Delta · 100 $POL transferred

spawn aggregate.py --task_id 1204 --winner_score 94.7

FedAvg 50/50 (shard_acc > global_acc) → global_model.pth updated

System Architecture

Three layers interact in every 60-second block cycle:

◈ ML / ZK Layer

Flask Prove Server

:5001 · /train · /prove/async · /prove/status · gen_witness → ezkl.prove → proof.json

MNISTNet (model.py)

784→128→64→10 MLP · 4 shards × 10k · focus-digit oversampling · augmentation per shard

aggregate.py

FedAvg post-finalize · 50/50 or 80/20 merge · writes global_model.pth + accuracy_log.json

↑↓ POST /train · /prove/async · spawn aggregate.py · model.pth reads

◈ Off-Chain Node Services

Admin Server

Express :3001 · spawns miningLoop + autoMiner + Flask · SSE log fan-out at /logs

miningLoop.js

60s cadence · postTask → wait → finalizeTask → spawn aggregate.py · reads mode.json

autoMiner.js

Alpha/Beta/Gamma/Delta · one-block-delayed ZK · submitWithProof while next proof generates

↑↓ submitWithProof · finalizeTask · transferFrom · verifyProof · POL reward

◈ Base Sepolia L2 (chainId 84532)

POLToken.sol

ERC-20 · 1,000,000 supply · escrow via transferFrom · reward via transfer

TaskManager.sol

Core protocol · postTask · submitWork · submitWithProof · finalizeTask · calls Verifier

Halo2Verifier.sol

EZKL-generated · verifyProof(bytes, uint256[]) → bool · pairing + polynomial checks in EVM

ZK Proof Pipeline

Converting floating-point ML training into on-chain verifiable cryptography requires a precise pipeline. EZKL quantizes the neural network's operations into integer arithmetic over a finite field — enabling ZK verification while preserving computational integrity.

MNISTNet exported to ONNX (opset 11) with dummy input shape [1, 784]
ezkl.gen_settings() — public/public/private visibility model; pixels and logits on-chain, weights secret
ezkl.calibrate_settings() — adjusts quantization scales for resources target
ezkl.compile_circuit() → network.compiled (arithmetic circuit over finite field)
ezkl.get_srs() → kzg.srs (KZG structured reference string, one-time download)
ezkl.setup() → pk.key + vk.key (Halo2 proving and verifying keys)
ezkl.create_evm_verifier() → Verifier.sol → deployed to Base Sepolia
Per block: gen_witness → prove → hex_proof + instances → submitWithProof → Halo2Verifier on-chain

The Gradient-Gap Problem

The central research contribution and limitation: EZKL's quantization converts floating-point operations to integer arithmetic over a finite field. This creates a small discrepancy between what the ZK circuit proves and what the neural network actually computed. For MNISTNet this error is inconsequential — the model still converges correctly. For larger production models, this quantization error compounds across layers and could meaningfully degrade proof validity. Solving the gradient-gap problem is the prerequisite for applying PoLChain to frontier models.

Applied Proof-of-Learning: ZK-Verified Federated Learning on a Live Blockchain

Jackson Geiger · UC Berkeley · April 2026

Training AI models at scale requires substantial capital. We propose that blockchain-based federated learning with gradient verification enables a new economic model — one in which AI developers issue tokens that enable decentralized markets to fund model training. PoLChain demonstrates a Proof-of-Learning protocol on a live blockchain, where miners compete to submit gradients verified by Halo2 ZK proofs generated via EZKL.

PREPRINT

April 2026

Live System · Oracle Cloud · Running 24/7

Autonomous
Trading

A self-evolving algorithmic trading system with a 7-layer signal stack, closed reinforcement learning loop, and evolutionary engine running 50 parallel strategy experiments simultaneously.

Signal Architecture

Every trade consideration passes through seven independent signal layers before execution. Each layer produces a modifier that is geometrically aggregated using learned weights from live trade forensics — reloaded every 100 cycles.

Base Rule Fire

RSI < 45 · MACD cross · Bollinger squeeze · EMA crossover · momentum_buy

base_conf [0.45–0.75]

↓

Multi-Timeframe Gate

5m → resample 15m/1h · alignment check · HARD REJECT if BUY into 1h downtrend

modifier 1.2× or 0.7×

↓

VWAP Position + Pivot Levels

2σ standard deviation bands · S1/S2/R1/R2 pivot proximity · dynamic TP targets resistance

modifier 0.85–1.15

↓

Funding Rate

Binance perp BTC/ETH/SOL · every 2h · EXTREME_LONG avoids crowded longs

modifier 0.6–1.4

↓

Candlestick Pattern Modifier

7 patterns: hammer, engulfing, doji, shooting star, morning star, dark cloud, harami

geometric mean ∈ [0.88–1.15]

↓

Volume Profile + ADX Bucket

High/low volume hour-of-day · ADX 15-20 best bucket for mean reversion (+1.15×)

modifier 0.85–1.15

↓

Signal Aggregation

final = base × geomean(modifier_i ^ weight_i) · weights from signal_weights.json (100-cycle reload)

final_conf [0.10–0.95]

↓ if conf > 0.58

EXECUTE

CooldownManager (30m) → check_correlation_limits (4 max) → open_position() → calculate_dynamic_take_profit()

Evolution Engine

50 parallel sandbox experiments run at all times. The system continuously searches for better strategies through mutation, evaluation, and promotion — closing the loop from live trading back to strategy generation.

⊕

Experimenter

8 mutation types · SHA-256 dedupe · refills every 6h

every 6h

⊞

Tier 3 Sandbox

50 portfolios × $1k · 10 threads · hourly score recompute

30s cycle

◎

Evaluator

3-gate: perf → backtest → regime diversity · kills or graduates

every 6h +3h

⟳

Evolution Engine

24h A/B test · challenger vs champion · 1.15× threshold to promote

24h A/B

★

Champion

Hot reload · generation tracked · history archived · forensics logged

live trading

Reinforcement Learning Loop

Every closed trade feeds back into the system — adjusting signal weights, improving position sizing, and directing the next generation of experiments toward what's working in live conditions.

Trade Close → Forensics

Every closed trade records 15+ signal fields: mtf_alignment, vwap_position, funding_signal, near_support_at_entry, indicator_regime, hour_utc, signal_source, take_profit_type, and more. Forensics capped at last 500 trades.

Self-Tuner → Signal Weights

Groups trades by signal field value, calculates win rate per group. Updates signal_weights.json after 20 live trades. Weights reload into SignalAggregator every 100 cycles via _maybe_reload_weights(). Effective modifier = 1 + (raw - 1) × weight.

Adaptive Position Sizer

6 multipliers combined: confidence (0.7–1.3×), volatility (ATR%), pair quality score, adaptive (from 20-trade streak window), global consecutive loss protection (0.6× after 5 losses), pair adaptation override. Persisted to position_sizing_state.json.

Experimenter ← Signal Performance

Self-tuner writes signal_performance.json tracking win rate by source (champion, support_bounce, supertrend_breakout, vwap_reversion). Experimenter reads this when computing spawn weights — templates with >58% win rate get +35% spawn weight.

Mac → Oracle Pipeline

A safe learning pipeline validates all data before pushing to the live bot. Signal weight changes require 50+ live trades, ≤30% change magnitude, data <14 days old, and RSI threshold matching the live champion.

learning_pipeline.py --review

→ Training complete: 50 experiments · 7 periods · 100 Monte Carlo sims/exp

Fee model: 0.26% taker + 0.10% altcoin slippage applied to all results

Fingerprint cache: 12 duplicate mutations skipped instantly

→ Regime scoring complete:

ROBUST: 2 · SOLID: 5 · TRENDING: 8 · REVERSION: 6 · KILL: 29

→ Pushing to Oracle:

✅ 50 experiment classifications applied

✅ 12 high-confidence lessons pushed

✅ Oracle sync complete — evaluator will spawn new generation

→ New experiment generation deploying · next cycle in 6h

About

Building trustless
AI infrastructure

Merkavian was founded on the belief that the next breakthrough in AI economics is decentralization. The current model — where a handful of companies bear the entire cost of model training — is fundamentally unsustainable. OpenAI spent $3B training models in 2024. That's not a business, it's a subsidy.

We're building the infrastructure layer that makes distributed AI training cryptographically verifiable. PoLChain demonstrates that zero-knowledge proofs can solve the honesty problem in federated learning — making it possible for anyone to contribute compute and earn tokens, without requiring a trusted central aggregator.

Our second system proves we can apply the same rigor to production AI applications. The autonomous trading bot runs 50 parallel strategy experiments simultaneously, continuously learning from live market data through a closed reinforcement learning loop. The same principles — trustless verification, decentralized execution, cryptographic guarantees — applied to real capital.

Merkavian is based in Berkeley, California, founded by an undergraduate researcher at UC Berkeley working at the intersection of zero-knowledge cryptography, federated machine learning, and decentralized systems.

UC Berkeley ZK Cryptography Federated Learning Base Sepolia EZKL / Halo2

Jackson Geiger

Founder · Researcher

UC Berkeley undergraduate researcher focused on the intersection of zero-knowledge cryptography, federated learning, and decentralized AI infrastructure. Author of the PoLChain proof-of-learning protocol. Building systems where cryptographic guarantees replace institutional trust.

Founded2026

LocationBerkeley, CA

StagePre-seed / R&D

FocusAI + Blockchain Infra

ResearchUC Berkeley

Emailjackson@merkavian.com

Neural
Intelligence
On-Chain

What we build

PoLChain

Autonomous Trading

Core infrastructure

Zero-Knowledge Proofs

Federated Learning

On-Chain Verification

Evolutionary Strategies

Multi-Signal Stack

Safe Learning Pipeline

Building at the frontier

PoLChain

The Problem

The Solution

System Architecture

ZK Proof Pipeline

The Gradient-Gap Problem

Protocol Specs

Smart Contracts

ZK Artifacts

Tech Stack

Autonomous
Trading

Signal Architecture

Evolution Engine

Reinforcement Learning Loop

Mac → Oracle Pipeline

System Specs

Signal Stack

Risk Controls

Backtesting

Building trustless
AI infrastructure

NeuralIntelligenceOn-Chain

What we build

PoLChain

Autonomous Trading

Core infrastructure

Zero-Knowledge Proofs

Federated Learning

On-Chain Verification

Evolutionary Strategies

Multi-Signal Stack

Safe Learning Pipeline

Building at the frontier

PoLChain

The Problem

The Solution

System Architecture

ZK Proof Pipeline

The Gradient-Gap Problem

Protocol Specs

Smart Contracts

ZK Artifacts

Tech Stack

AutonomousTrading

Signal Architecture

Evolution Engine

Reinforcement Learning Loop

Mac → Oracle Pipeline

System Specs

Signal Stack

Risk Controls

Backtesting

Building trustlessAI infrastructure

Neural
Intelligence
On-Chain

Autonomous
Trading

Building trustless
AI infrastructure