CLI Reference#
Install the package (or run python -m eurekaclaw) to get the eurekaclaw command.
Global Options#
Flag |
Description |
|---|---|
|
Enable DEBUG logging |
Commands#
prove — Prove a conjecture#
eurekaclaw prove "<conjecture>" [OPTIONS]
Arguments:
conjecture— The mathematical conjecture or claim to prove (string)
Options:
Option |
Default |
Description |
|---|---|---|
|
|
Research domain. Auto-inferred from conjecture if omitted |
|
|
Post-run learning mode: |
|
(all) |
Pin specific skills by name (repeatable). Pinned skills always appear first in the injection regardless of usage score |
|
|
Gate control: |
|
|
Output directory for artifacts |
Example:
eurekaclaw prove "UCB1 achieves O(sqrt(KT log T)) expected cumulative regret in the stochastic multi-armed bandit setting" \
--domain "multi-armed bandit theory" \
--skills ucb_regret_analysis --skills concentration_inequalities \
--gate human \
--output ./results
explore — Explore a research domain#
eurekaclaw explore "<domain>" [OPTIONS]
Arguments:
domain— The research domain to explore (string)
Options:
Option |
Default |
Description |
|---|---|---|
|
|
Specific research question within the domain |
|
|
Post-run learning mode: |
|
|
Gate control: |
|
|
Output directory for artifacts |
Example:
eurekaclaw explore "multi-armed bandit theory" \
--query "tight regret bounds for heavy-tailed rewards" --output ./results
from-papers — Generate hypotheses from reference papers#
eurekaclaw from-papers <paper_id> [<paper_id> ...] [OPTIONS]
Arguments:
paper_ids— One or more arXiv IDs or Semantic Scholar IDs (variadic)
Options:
Option |
Default |
Description |
|---|---|---|
|
(required) |
Research domain |
|
|
Specific research question or focus within the papers |
|
|
Post-run learning mode |
|
(all) |
Pin specific skills by name (repeatable). Pinned skills always appear first in the injection regardless of usage score |
|
|
Gate control |
|
|
Output directory |
Example:
eurekaclaw from-papers 1602.01783 2301.00774 \
--domain "bandit algorithms" --output ./results
pause — Pause a running session#
eurekaclaw pause <session_id>
Arguments:
session_id— Session ID of the running proof to pause (found in the console header at startup)
Writes a pause.flag file to ~/.eurekaclaw/sessions/<session_id>/. The theory agent detects this flag at the next stage boundary, saves a checkpoint, and exits cleanly with a ProofPausedException. The partial proof state is preserved in ~/.eurekaclaw/sessions/<session_id>/checkpoint.json.
You can also pause by pressing Ctrl+C during a run. EurekaClaw intercepts SIGINT and writes the pause flag instead of raising KeyboardInterrupt, giving the agent time to reach a clean checkpoint boundary.
Example:
# In a separate terminal while a proof is running:
eurekaclaw pause abc12345
resume — Resume a paused session#
eurekaclaw resume <session_id>
Arguments:
session_id— Session ID of the paused proof to continue
Loads the checkpoint from ~/.eurekaclaw/sessions/<session_id>/checkpoint.json and re-runs the theory agent starting from the saved stage, with all previously proved lemmas already in TheoryState. Passes the same domain and query as the original session.
Example:
eurekaclaw resume abc12345
replay-theory-tail — Replay theory tail stages#
eurekaclaw replay-theory-tail <session_id> [OPTIONS]
Arguments:
session_id— Session ID of a completed run
Options:
Option |
Default |
Description |
|---|---|---|
|
|
Stage to restart from: |
Re-runs the final stages of the theory pipeline (Assembler → TheoremCrystallizer → ConsistencyChecker) from a saved theory_state.json without repeating the survey, planning, or lemma proving phases. Useful for quickly iterating on crystallization or consistency-check failures.
Example:
eurekaclaw replay-theory-tail abc12345 --from assembler
test-paper-reader — Test PaperReader on a single paper#
eurekaclaw test-paper-reader <session_id> <paper_ref> [OPTIONS]
Arguments:
session_id— Session ID of a completed run whose bibliography to usepaper_ref— Paper ID, arXiv ID, or case-insensitive substring of the title
Options:
Option |
Default |
Description |
|---|---|---|
|
|
Extraction mode: |
|
|
Research direction override for extraction prompts |
Exercises PaperReader’s abstract and/or PDF extraction on a single bibliography entry without running the full pipeline.
Example:
eurekaclaw test-paper-reader abc12345 "UCB1" --mode both
onboard — Interactive configuration wizard#
eurekaclaw onboard [OPTIONS]
Options:
Option |
Description |
|---|---|
|
Write defaults without prompting |
|
Overwrite existing |
|
Path to the |
Walks you through LLM backend selection, API key setup, search tools, and system behaviour, then writes (or updates) .env.
Example:
eurekaclaw onboard
eurekaclaw onboard --env-file ~/.eurekaclaw/.env
skills — List available skills#
eurekaclaw skills
Prints a Rich panel listing all skills in the skill bank with:
Skill name
Tags
Description
Source (
seed,distilled, ormanual)
eval-session — Evaluate a completed session#
eurekaclaw eval-session <session_id>
Arguments:
session_id— Session ID from a previous run (found in run directory name)
Prints an evaluation report with proof quality metrics.
install-skills — Install seed skills#
eurekaclaw install-skills [SKILLNAME] [--force]
Arguments:
skillname(optional) — Install a specific skill from clawhub by name
Options:
Option |
Description |
|---|---|
|
Overwrite existing skills in |
Without arguments, copies all bundled seed skills from the package to ~/.eurekaclaw/skills/. When a skill name is provided, downloads that skill from clawhub instead.
ui — Launch the browser UI#
eurekaclaw ui [OPTIONS]
Options:
Option |
Default |
Description |
|---|---|---|
|
|
Interface to bind to |
|
|
Port to listen on |
|
False |
Auto-open browser on start |
Example:
eurekaclaw ui --open-browser
Output Artifacts#
All three research commands (prove, explore, from-papers) write artifacts to <output>/<session_id>/:
<output>/<session_id>/
├── paper.tex LaTeX source
├── paper.pdf Compiled PDF (requires pdflatex + bibtex)
├── references.bib Bibliography in BibTeX format
├── theory_state.json Full proof state (lemmas, proofs, status)
├── research_brief.json Planning state (directions, selected direction)
└── experiment_result.json Numerical validation results (if run)
Paused sessions also write a checkpoint to ~/.eurekaclaw/sessions/<session_id>/checkpoint.json.
Theory Review Gate#
After the Theory Agent finishes and before the Writer runs, EurekaClaw displays a numbered proof sketch and asks for approval:
──────────────── Proof Sketch Review ────────────────
L1 [✓] arm_pull_count_bound verified
For arm a with mean gap Δ_a ...
L2 [~] regret_decomposition low confidence
Total regret decomposes as ...
L3 [✓] main_theorem verified
UCB1 achieves O(√(KT log T)) regret ...
──────────────────────────────────────────────────────
Does this proof sketch look correct?
y — Proceed to writing
n — Flag the most logically problematic step
→
y / Enter — proceed to the WriterAgent
n — you are asked which step has the most critical logical gap (e.g.
L2or the full lemma ID) and to describe the issue. The TheoryAgent re-runs once with your feedback injected into its task, then shows the updated sketch one more time.
The theory review gate is always shown regardless of --gate mode.
Exit Codes#
Code |
Meaning |
|---|---|
|
Success — paper generated |
|
Runtime error (see console output) |