Lambda-Calculus cooked n-ways

This repository is a simple demonstration of multiple ways to implement variable binding in Haskell as well as a benchmark suite of correctness and performance tests.

This is derived from Lennart Augustsson's unpublished draft paper "Lambda-calculus Cooked Four Ways".

File structure

lib/ DeBruijn/ Par/ LocallyNameless/ Named/ Lennart/ IdInt.lhs IdInt/

bench/ test/

Basic implementation organization

lib/ Imports.hs - rexports common modules IdInt.lhs - Identifiers based on a newtype for Ints - Includes FreshM state monad for generating new IdInts Lambda.lhs
- Lambda calculus parameterized by type of binder/variable (v) - Same type must be used in both locations. - Includes ReadP, show, fv, aeq, Impl.lhs - General Definition of a LambdaImpl structure Misc.lhs Suite.lhs QuickBench.lhs - Tools for generating benchmarks and test suites

Simple

Most direct and traditional implementation based on variable names. Renames bound variables to avoid capture.
Unique

Maintains the invariant that all bound variables are unique. Needs to freshens the binders of terms being substituted to maintain this invariant.
HOAS

Higher-order abstract synatax (uses Haskell functions for lambda calculus functions)
Debruijn

DeBruijn indices that shift during substitution.

DeBrujn-index based implementations

Debruijn index based implementations:

Bound

Uses Kmett's bound library. Nested datatypes ensure that terms stay well-scoped.
Kit

Based on code distributed with this paper https://dl.acm.org/doi/10.1145/3018610.3018613
DeBruijn.Par.F [DB_F]

Parallel substitution version, representing substitutions as functions.
DeBruijn.Par.F [DB_FB]

Parallel substitution version, representing substitutions as functions. Introduces a 'Bind' abstract type to cache substitutions at binders.
DeBruijn.Par.P [DB_P]

Parallel substitution version (with reified substs). Based on https://github.com/sweirich/challenge/blob/canon/debruijn/debruijn1.md
DeBruijn.Par.B [DB_B]

Parallel substitution version with reified substs, but caches a substitution in terms. Uses general the purpose library in Subst Optimized version described here https://github.com/sweirich/challenge/tree/canon/debruijn
DeBruijn.Par.Scoped [Scoped]

Above, but also uses a GADT to enforce that the syntax is well-scoped.

Locally-Nameless implementations

Unbound

Uses the unbound library
UnboundGenerics

Uses the GHC.Generics port of Unbound
Ott/Opt/Par/ParOpt

Uses output of Ott's locally nameless backend
Typed/TypedOpt

Version of above with types to ensure that terms are locally closed

Named representations

SimpleB

Optimizes the "simple" approach by caching the substitution and free variable set at binders. Not at all simple. Took a long time to get this one correct. Actually it isn't correct.
SimpleH

Corrected version of SimpleB.
SimpleM

Version of SimpleH that uses a freshness monad to generate fresh variables.
NominalG

Uses nominal package & generic programming https://hackage.haskell.org/package/nominal

Other

Core

Uses the FV and Substitution functions ripped out of GHC Core (HEAD as of 5/28/20) Like DB_C, this file uses a delayed substitution (e.g. environment) during normalization. Does not add any explicit substitutions to the term. Uses Data.IntMap instead of lists to keep track of the substitution.

Benchmarks

Download the html files to see the Criterion graphs. Or look at the raw results.

Normalization of random lambda terms: rand_bench.html.

These 25 random terms stored in the file random2.lam. They are generated via genScopedLam in Lambda.lhs with size parameter 100000, and so are closed and contain lots of lambdas. Normalizing these terms requires between 26-36 calls to subst. The terms themselves have total depth from 23-60 and binding depth from 13-46.

Conversion to representation: conv_bench.html. How long does it take to convert a parsed named representation to the internal representation of the implementation? alpha-converts the pathological term.
Normalization of pathological lambda term: nf_bench.html. See below.

   bind depth: 25
   depth:      53
   num substs: 119697

Alpha-equivalence of pathological lambda term: aeq_bench.html

Normalization microbenchmark

The microbenchmark is full normalization of the lambda-calculus term: factorial 6 == sum [1..37] + 17 represented with a Scott-encoding of the datatypes. See lennart.lam for the definition of this term.

This "benchmarks" several different representations of variable binding and substitution in the untyped lambda calculus using a single pathological case: computing the normal form of factorial 6 == sum [1..37] + 17. (Spoiler alert, these terms are not equal, so the normal form is the encoding of false).

By full normalization, we mean computing the following partial function that repeatedly performs beta-reduction on the leftmost redex.

  nf x         = x
  nf (\x.e)    = \x. nf e
  nf (e1 e2)   = nf ({e2/x}e1')         when whnf e1 = \x.e1'
                (nf (whnf e1)) (nf e2)       otherwise

  whnf x       = x
  whnf (\x.e)  = \x.e
  whnf (e1 e2) = whnf ({e2/x} e1') when whnf e1 = \x.e1'
                (whnf e1) e2            otherwise

Note: the goal of this operation is to benchmark the substitution function, written above as {e2/x}e1. As a result, even though some lambda calulus implementations may support more efficient ways of computing the normal form of a term (i.e. by normalizing e2 at most once) we are not interested in enabling that computation. Instead, we want the computation to be as close to the implementation above as possible.

Because this function is partial (not all lambda-calculus terms have normal forms), for testing, each implementation also should support a "fueled" version of the nf and whnf functions (called nfi and whnfi, respectively). However, benchmarking uses the unfueled version.

Running the benchmarks

 make timing

Testing the benchmarks

stack test

The directory lams/ contains files of non-normalized lambda-calculus terms. In each case, if the file is test.lam then a matching file test.nf.lam contains the normalized version of the term.

Unit tests:

pathological term (lennart.lam).
random terms with a small number of substitutions during normalization (onesubst, twosubst...)
random terms with a large number of substitutions during normalization (random25, random35,lams100)
constructed terms (capture10, constructed,
terms that reveal a bug in some implementation (tX, tests, regression)

QuickChecks

conversion from/to named representation is identity on lambda terms
freshened version of random lambda term is AEQ
nf on random lambda term matches reference version (DB) (This test is only for impls with a "fueled version" of normalization)

References

repo this is forked from (and Lennart's draft paper)
https://www.schoolofhaskell.com/user/edwardk/bound
https://gitlab.haskell.org/ghc/ghc

Missing implementations

Optimized version of nominal logic

DeBruijn levels

Locally-named implementation

GHC Core type-level substitution

Canonically-named https://link.springer.com/article/10.1007/s10817-011-9229-y

https://arxiv.org/pdf/1111.0085.pdf

https://www.mimuw.edu.pl/~szynwelski/nlambda/doc/ Module supports computations over infinite structures using logical formulas and SMT solving.

Name		Name	Last commit message	Last commit date
Latest commit History 132 Commits
.vscode		.vscode
bench		bench
doc		doc
lams		lams
lib		lib
results		results
tests		tests
.DS_Store		.DS_Store
.gitignore		.gitignore
.hlint.yaml		.hlint.yaml
Makefile		Makefile
README.md		README.md
cabal.project		cabal.project
lennart-lambda.cabal		lennart-lambda.cabal
stack-lts-14.27.yaml		stack-lts-14.27.yaml
stack-lts-15.14.yaml		stack-lts-15.14.yaml
stack.yaml		stack.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lambda-Calculus cooked n-ways

File structure

Basic implementation organization

Contents

DeBrujn-index based implementations

Locally-Nameless implementations

Named representations

Other

Benchmarks

Normalization microbenchmark

Running the benchmarks

Testing the benchmarks

References

Missing implementations

About

Releases

Packages

Languages

sweirich/lennart-lambda

Folders and files

Latest commit

History

Repository files navigation

Lambda-Calculus cooked n-ways

File structure

Basic implementation organization

Contents

DeBrujn-index based implementations

Locally-Nameless implementations

Named representations

Other

Benchmarks

Normalization microbenchmark

Running the benchmarks

Testing the benchmarks

References

Missing implementations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages