GitHub - lewismj/eta: A Lisp System

η (Eta)
A Lisp/Scheme-inspired language with built-in logic programming,
automatic differentiation, neural networks, and causal inference.

Quick Start · Build from Source · Architecture · NaN-Boxing · Bytecode & VM · Compiler · Optimization · Runtime & GC · Finalizers & Guardians · Networking · Message Passing · Modules & Stdlib · Next Steps · Release Notes

Language Guide

Basics · Reverse Mode AAD (Finance Examples) · Logic Programming – Unification and Backtracking · Constraint Logic Programming · Causal Inference & Do-Calculus · Fact Tables · Eigen: Linear Algebra, Statistics · LibTorch: Neural Networks

Featured Examples

Causal Decision Engine for Portfolio Optimisation

What is Eta?

Eta is a Scheme-like language designed for symbolic reasoning, differentiable programming, neural-network training, and causal inference — all from a single S-expression syntax. It compiles to compact bytecode and runs on a stack-based VM implemented in C++ (NaN-boxed values, closures, TCO, call/cc, hygienic macros, module system — see Architecture for the full compilation pipeline).

Headline Features

Domain	What you get	Deep-dive
Scheme Core	Closures, tail-call elimination, first-class continuations (`call/cc`), hygienic `syntax-rules` macros, module system with import filters	Bytecode & VM · Modules
Logic Programming	VM-native structural unification & backtracking — seven dedicated opcodes give you Prolog-style pattern matching without leaving the language	Logic
Constraint Logic Programming	`clp(Z)` integer-interval and `clp(FD)` finite-domain solvers built on the unification layer	CLP
Reverse-Mode AAD	VM-native tape-based automatic differentiation — standard arithmetic is recorded transparently when a `TapeRef` operand is present; zero closure overhead	AAD – Finance Examples
Linear Algebra & Statistics	Eigen-backed multivariate OLS, covariance/correlation matrices, column quantiles over FactTables; plus `std.stats` descriptive stats, CIs, t-tests, and simple OLS over lists	Stats
Neural Networks (libtorch)	Native C++ bindings to PyTorch's backend — tensors, autograd, NN layers, optimizers, and GPU offload from Eta code	LibTorch
Causal Inference	Pearl's do-calculus engine, back-door / front-door adjustment, and end-to-end factor analysis	Causal
Message Passing & Actors	Erlang-style actor model via nng: `spawn` child processes, `send!`/`recv!` over PAIR sockets, `worker-pool` parallel fan-out, REQ/REP, PUB/SUB, SURVEYOR/RESPONDENT, and supervision trees (`one-for-one`, `one-for-all`) — network-transparent across machines	Networking · Message Passing · Supervisors
End-to-End Pipeline	All domains compose: symbolic differentiation → do-calculus identification → logic/CLP validation → libtorch neural estimation → AAD risk → CLP(R)+QP allocation	Portfolio Engine · Causal Primer

The implementation ships as five executables and a VS Code extension:

Bytecode Compiler (etac)
Interpreter (etai)
Interactive REPL (eta_repl)
Language Server (eta_lsp)
Debug Adapter (eta_dap)
VS Code Extension

;; Hello, Eta!
(module hello
  (import std.io)
  (begin
    (println "Hello, world!")

    (defun factorial (n)
      (if (= n 0) 1
          (* n (factorial (- n 1)))))

    (println (factorial 20))))

Quick Start

Download the latest release for your platform, unpack it, and run the installer:

Platform	Archive
Windows x64	`eta-v0.2.0-win-x64.zip`
Linux x86_64	`eta-v0.2.0-linux-x86_64.tar.gz`

# Windows                              # Linux / macOS
cd eta-v0.2.0-win-x64                  cd eta-v0.2.0-linux-x86_64
.\install.cmd                          chmod +x install.sh && ./install.sh

The installer adds bin/ to your PATH, sets ETA_MODULE_PATH, and installs the VS Code extension automatically if VS Code is detected.

Note

Open a new terminal after running the installer for the environment changes to take effect.

Interpret from Source — `etai`

etai compiles a .eta file in-memory and executes it immediately:

$ etai examples/hello.eta
Hello, world!
2432902008176640000

Ahead-of-Time Compilation — `etac` + `etai`

etac compiles .eta source to compact .etac bytecode. etai then loads .etac files directly, skipping all front-end phases (lex, parse, expand, link, analyze, emit) for faster startup:

$ etac examples/hello.eta
compiled examples/hello.eta → examples/hello.etac (3 functions, 1 module(s))

$ etai examples/hello.etac
Hello, world!
2432902008176640000

Key etac flags:

Flag	Effect
`-O`	Enable optimization passes (constant folding, dead code elimination)
`--disasm`	Print human-readable bytecode to stdout (no `.etac` written)
`--no-debug`	Strip source maps for a smaller output file
`-o <path>`	Custom output path (default: `<input>.etac`)

$ etac -O examples/hello.eta -o hello-opt.etac
$ etac --disasm examples/hello.eta

Interactive REPL

$ eta_repl
η> (+ 1 2 3 4 5)
=> 15
η> (exit)

VS Code

The installer automatically sets up the VS Code extension. Configure the paths in settings (Ctrl+, → search Eta):

{
  "eta.lsp.serverPath":     "/path/to/eta-v0.2.0/bin/eta_lsp",
  "eta.dap.executablePath": "/path/to/eta-v0.2.0/bin/eta_dap"
}

Open the examples/ folder, open any .eta file, and press F5 to debug. The extension provides:

Syntax highlighting and live diagnostics (LSP)
Breakpoints & stepping — F10 Step Over · F11 Step In · Shift+F11 Step Out · F5 Continue
Heap Inspector — live memory gauge, per-kind object stats, GC root tree with drill-down (Ctrl+Shift+P → Eta: Show Heap Inspector)
Disassembly View — live bytecode with current-PC marker in the Debug sidebar (Ctrl+Shift+P → Eta: Show Disassembly)
GC Roots Tree — expandable root categories (Stack, Globals, Frames), module-grouped globals, object drill-down
Child Processes — debug sidebar panel listing all spawned actor processes with PID, endpoint, and live/exited status

Tip

See TLDR.md for a step-by-step walkthrough with screenshots, or Quick Start for the full reference.

Build from Source

For contributors or those who want to build from source, see Building from Source — prerequisites, one-script builds, manual CMake steps, and CI details.

Quick version:

# Linux / macOS
./scripts/build-release.sh ./dist/eta-release
cd dist/eta-release && ./install.sh

# Windows (PowerShell)
.\scripts\build-release.ps1 .\dist\eta-release
cd dist\eta-release; .\install.cmd

See Language Guide for a guided tour of the example programs.

Bundle Layout

eta-<version>-<platform>/
├── bin/                        # Executables (and bundled DLLs on Windows)
│   ├── etac(.exe)              # Ahead-of-time bytecode compiler
│   ├── etai(.exe)              # File interpreter (also runs .etac files)
│   ├── eta_repl(.exe)          # Interactive REPL
│   ├── eta_lsp(.exe)           # Language Server (JSON-RPC over stdio)
│   └── eta_dap(.exe)           # Debug Adapter (used by VS Code)
├── lib/                        # Shared libraries (Linux/macOS only, e.g. libtorch)
├── stdlib/                     # Standard library (.eta sources)
│   ├── prelude.eta             # Auto-loaded prelude
│   └── std/                    # core, math, io, collections, logic, clp,
│                               # causal, stats, torch, net, supervisor, …
├── editors/
│   └── eta-lang-<version>.vsix # VS Code extension
├── install.sh                  # Installer (Linux/macOS)
├── install.cmd / install.ps1   # Installer (Windows)
└── quickstart.md

Compilation Pipeline

Every Eta source file flows through six phases before execution. The Driver class orchestrates the full pipeline and owns the runtime state:

flowchart LR
    SRC["Source\n(.eta)"] --> LEX["Lexer"]
    LEX --> PAR["Parser"]
    PAR --> EXP["Expander"]
    EXP --> LNK["Module\nLinker"]
    LNK --> SEM["Semantic\nAnalyzer"]
    SEM --> EMT["Emitter"]
    EMT --> VM["VM\nExecution"]
    EMT --> SER["Serialize\n(.etac)"]
    SER -.->|"etai fast-load"| VM

    style SRC fill:#2d2d2d,stroke:#58a6ff,color:#c9d1d9
    style LEX fill:#1a1a2e,stroke:#58a6ff,color:#c9d1d9
    style PAR fill:#1a1a2e,stroke:#58a6ff,color:#c9d1d9
    style EXP fill:#1a1a2e,stroke:#58a6ff,color:#c9d1d9
    style LNK fill:#16213e,stroke:#79c0ff,color:#c9d1d9
    style SEM fill:#16213e,stroke:#79c0ff,color:#c9d1d9
    style EMT fill:#0f3460,stroke:#56d364,color:#c9d1d9
    style VM  fill:#0f3460,stroke:#56d364,color:#c9d1d9
    style SER fill:#0f3460,stroke:#56d364,color:#c9d1d9

Phase	Input	Output	Header
Lexer	Raw UTF-8 text	Token stream	`lexer.h`
Parser	Tokens	S-expression AST (`SExpr`)	`parser.h`
Expander	`SExpr` trees	Desugared core forms + macros	`expander.h`
Module Linker	Expanded modules	Resolved imports/exports	`module_linker.h`
Semantic Analyzer	Linked modules	Core IR (`Node` graph)	`semantic_analyzer.h`
Emitter	Core IR	`BytecodeFunction`s	`emitter.h`
VM	Bytecode	Runtime values (`LispVal`)	`vm.h`

Note

Every phase reports errors through a unified DiagnosticEngine with span information.

Key Design Highlights

Feature	Detail
NaN-Boxing	All values are 64-bit; doubles pass through unboxed while tagged types (fixnums, chars, symbols, heap pointers) are encoded in the NaN mantissa. → Deep-dive
AOT Compilation	`etac` compiles `.eta` → `.etac` bytecode; `etai` loads `.etac` files directly, skipping all front-end phases. Supports `-O` optimization passes (constant folding, DCE). → Deep-dive
47-bit Fixnums	Integers up to ±70 trillion are stored inline — no heap allocation.
Mark-Sweep GC	Stop-the-world collector with sharded heap, hazard pointers, and a GC callback for auto-triggering on soft-limit. → Deep-dive
Tail-Call Elimination	`TailCall` and `TailApply` opcodes reuse the current stack frame.
First-Class Continuations	`call/cc` captures the full stack + winding stack; `dynamic-wind` is supported.
Hygienic Macros	`syntax-rules` with ellipsis patterns.
Module System	`(module …)` forms with `import`/`export`, `only`, `except`, `rename` filters. → Deep-dive
Arena Allocator	IR nodes are block-allocated in a 16 KB arena for cache locality.
Concurrent Heap	`boost::unordered::concurrent_flat_map` with 16 shards for lock-free reads.
LSP Integration	JSON-RPC language server for real-time diagnostics in any editor.
DAP Integration	Debug Adapter Protocol server (`eta_dap`) enables breakpoints, step-through debugging, call-stack inspection, and REPL-style expression evaluation directly in VS Code.
libtorch Integration	Native C++ bindings to PyTorch's backend for tensors, autograd, neural-network layers, optimizers, and GPU offload. → Deep-dive
nng Networking	Erlang-style actor model: `spawn` processes, `send!`/`recv!`, `worker-pool` for parallel fan-out, REQ/REP, PUB/SUB, SURVEYOR/RESPONDENT — network-transparent over IPC or TCP. → Deep-dive · Actor Model

Documentation

Page	Contents
Quick Start	Installing, running `etai`/`etac`, REPL, modules, VS Code extension (full reference)
Build from Source	Prerequisites, one-script builds, manual CMake, CI, testing
Architecture	Full system diagram, phase-by-phase walkthrough, Core IR node types
NaN-Boxing	64-bit memory layout, bit-field breakdown, encoding/decoding examples
Bytecode & VM	Opcode reference, end-to-end compilation trace, call stack model, TCO
Compiler (`etac`)	AOT bytecode compiler: CLI reference, `.etac` binary format, optimization passes, disassembly
Optimization	IR optimization pipeline architecture, built-in passes, writing custom passes
Runtime & GC	Heap architecture, object kinds, mark-sweep GC, intern table, factory
Finalizers & Guardians	Weak lifecycle hooks for cleanup, resurrection semantics, and guardian queues
Modules & Stdlib	Module syntax, linker phases, import filters, standard library reference
Regex	`std.regex` API, ECMAScript syntax notes, and performance guidance
Language Guide	Guided tour of the language using simple example programs with expected output
Networking Primitives	nng socket API: `nng-socket`, `send!`, `recv!`, `nng-poll`, endpoints, error handling
Message Passing & Actors	Actor model: `spawn`, `worker-pool`, REQ/REP, PUB/SUB, scatter-gather, timeouts
Network & Message Passing Design	Full design doc: architecture, phases, nng rationale
AAD – Finance Examples	Reverse-mode AD walkthrough, xVA sensitivities, European Greeks, SABR vol surface
CLP	Constraint Logic Programming: clp(Z) intervals, clp(FD) finite domains, `clp:solve`
Causal Inference	Do-calculus engine, back-door adjustment, finance factor analysis
Fact Tables	Columnar fact tables with hash-indexed queries, iteration, and fold
Linear Algebra & Statistics	Eigen-backed multivariate stats on FactTables; `std.stats` descriptive stats, CIs, t-tests, OLS
Causal Pipeline Primer	Gentle on-ramp: symbolic diff → do-calculus → logic/CLP → libtorch NN → ATE (precursor to the portfolio engine)
Neural Networks	libtorch integration: tensors, autograd, NN layers, training loops, GPU support
Release Notes	Recent rollout changes and benchmark/gate commands
Next Steps	Roadmap: compiler, GC, testing, logic/CLP upgrades, VS Code improvements, Jupyter kernel

Standard Library

The prelude auto-loads the following modules:

Module	Highlights
`std.core`	`identity`, `compose`, `flip`, `constantly`, `iota`, `assoc-ref`, list utilities
`std.math`	`pi`, `e`, `square`, `gcd`, `lcm`, `expt`, `sum`, `product`
`std.io`	`println`, `eprintln`, `read-line`, port redirection helpers
`std.collections`	`map*`, `filter`, `foldl`, `foldr`, `sort`, `zip`, `range`, vector ops
`std.regex`	`regex:compile`, `regex:match?`, `regex:search`, `regex:find-all`, `regex:replace`, `regex:split`
`std.logic`	`==`, `copy-term`, `naf`, `findall`, `run1`, `run*`, `run-n` — Prolog/miniKanren-style combinators
`std.freeze`	`freeze`, `dif` — suspended goals & structural disequality on attributed vars
`std.db`	`defrel`, `assert`, `retract`, `call-rel`, `tabled` — fact-table-backed relations with SLG-lite tabling
`std.clp`	`clp:domain`, `clp:in-fd`, `clp:solve`, `clp:all-different`, `clp:labeling`, `clp:minimize`/`maximize` — CLP(Z) and CLP(FD)
`std.clpb`	`clp:boolean`, `clp:and/or/xor/imp/eq/not`, `clp:card`, `clp:sat?`, `clp:taut?` — CLP(B) Boolean propagation
`std.clpr`	`clp:real`, `clp:r<=`, `clp:r=`, `clp:r-minimize`/`maximize`, `clp:rq-minimize`/`maximize` — CLP(R) linear + convex QP
`std.causal`	`dag:*`, `do:identify`, `do:estimate-effect` — causal inference engine
`std.fact_table`	`make-fact-table`, `fact-table-insert!`, `fact-table-query`, `fact-table-fold` — columnar fact tables
`std.stats`	Descriptive stats, CIs, t-tests, OLS regression (Eigen-backed)
`std.time`	Wall-clock + monotonic time, UTC/local parts, ISO-8601 formatting
`std.torch`	`tensor`, `forward`, `train-step!`, `sgd`, `adam` — libtorch neural networks
`std.net`	`spawn`, `send!`, `recv!`, `monitor`, `worker-pool`, `with-socket`, `request-reply`, `pub-sub`, `survey` — Erlang-style actors & nng
`std.supervisor`	`one-for-one`, `one-for-all` — supervision trees over actor processes
`std.test`	`assert-equal`, `assert-true`, `run-tests` — lightweight test framework

(module my-app
  (import std.core)
  (import std.collections)
  (import std.io)
  (begin
    (define xs (iota 10))                    ;; (0 1 2 3 4 5 6 7 8 9)
    (println (filter odd? xs))               ;; (1 3 5 7 9)
    (println (foldl + 0 (filter even? xs)))  ;; 20
  ))

Project Structure

eta/
├── CMakeLists.txt              # Top-level build
├── eta/                        # C++ sources
│   ├── core/                   # Shared library: reader + semantics + runtime
│   │   └── src/eta/
│   │       ├── reader/         # Lexer, Parser, Expander, Module Linker
│   │       ├── semantics/      # Semantic Analyzer, Core IR, Emitter, Arena
│   │       ├── runtime/        # NaN-box, VM, Heap, GC, Types, Primitives
│   │       └── diagnostic/     # Unified error reporting
│   ├── compiler/               # etac      (AOT bytecode compiler)
│   ├── interpreter/            # etai + eta_repl (Driver orchestration)
│   ├── lsp/                    # eta_lsp   (Language Server, JSON-RPC over stdio)
│   ├── dap/                    # eta_dap   (Debug Adapter, DAP over stdio)
│   ├── nng/                    # nng networking layer
│   ├── torch/                  # libtorch integration
│   ├── stats/                  # Eigen-backed statistics primitives
│   ├── test/                   # Boost.Test unit tests
│   └── fuzz/                   # Fuzz testing (heap, intern table, nanbox)
├── stdlib/                     # Standard library (.eta sources)
│   ├── prelude.eta             # Auto-loaded prelude
│   └── std/                    # core, math, io, collections, logic, freeze,
│                               # db, clp, clpb, clpr, causal, fact_table,
│                               # stats, time, torch, net, supervisor, test
├── examples/                   # Example programs (see docs/examples.md)
├── editors/vscode/             # VS Code extension (TextMate grammar + LSP/DAP client)
├── scripts/                    # Build & install automation
└── docs/                       # Design documentation

License

See LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is Eta?

Headline Features

Quick Start

Interpret from Source — `etai`

Ahead-of-Time Compilation — `etac` + `etai`

Interactive REPL

VS Code

Build from Source

Bundle Layout

Compilation Pipeline

Key Design Highlights

Documentation

Standard Library

Project Structure

License

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 232 Commits
.github/workflows		.github/workflows
cmake		cmake
docs		docs
editors/vscode		editors/vscode
eta		eta
examples		examples
scripts		scripts
stdlib		stdlib
.gitattributes		.gitattributes
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md
TLDR.md		TLDR.md

Folders and files

Latest commit

History

Repository files navigation

What is Eta?

Headline Features

Quick Start

Interpret from Source — etai

Ahead-of-Time Compilation — etac + etai

Interactive REPL

VS Code

Build from Source

Bundle Layout

Compilation Pipeline

Key Design Highlights

Documentation

Standard Library

Project Structure

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Interpret from Source — `etai`

Ahead-of-Time Compilation — `etac` + `etai`

Packages