Macros

code generation: automatically generate code based on templates or other input.
metaprogramming: write code that can manipulate and generate other code at runtime.

Macros are a facility that allows defining syntactic extensions to a programming language. Macros implement the essence of Lisp: code can be used as data, and data can be used as code. Although in Lisp the transformation of code into data (specifically, an AST tree) is obvious as the S-expression syntax is just nested lists, in Stroscot we choose to have more complex syntax and accompanying mental overhead. It should be noted that even McCarthy the inventor of Lisp intended to use an M-expression (mathematical expression) syntax and S-expressions were only a matter of convenience.

Comparison

Macros, micros, and fexprs are all possibilities. Comparing them:

A (C) macro is a function translating text into text. Textual macros suffer from the scoping/name capture problem. Applying a macro double x=x+x to a side-effectful expression like double(i++) executes the side effect twice. They allow unbalanced/incorrect expansions like startLoop = while(true) { and double(+) = +++. One often can’t easily determine the expanded expression due to multiple layers of macros. Furthermore looking for control flow paths is hard.
A (Scheme) macro is a function translating an AST node into an AST node. Macro substitution of an AST reduces away all macros and produces an AST. Scheme macros have a complex hygiene system to make naive macros handle scoping correctly and avoid name capture.
A micro [Kri01] is a function translating from a source AST node and an environment to an expression in the intermediate language. Micro dispatch of an AST reduces away all micros and produces an intermediate representation.
An f-expression (fexpr) [Shu10] is a function translating an AST node and an environment to a value. Fexpr evaluation of an AST reduces away all fexprs and produces a value.

Fexprs are the most powerful: IR expressions and AST nodes are values, hence micros and macros can be implemented in terms of fexprs, but conversely fexprs can’t be emulated. E.g. we can implement Scheme macros as f-expressions by computing the AST and evaluating, defmacro f = \vau $args $env -> eval $env (f $args), modulo some hygiene stuff. But there is no macro corresponding to an eval function evalF = \vau $args env -> \f -> eval f $env that evaluates a value in an environment at runtime.

Another advantage of fexprs is that there are no phases - the full language is always available and immediately executed. For example predicate dispatch of fexprs is possible - you just have to look up the argument values in the environment like with predicate dispatch on applicatives. In contrast macros can only overload on number of arguments because the precise values are not available in the preprocessing phase, and similarly micros have a linking phase for the IR. Hence implementing the compiler using the Futamura projection of specializing an interpreter to a program is only sensible for fexprs.

Terminology

Loosely adapting [Shu10]’s terminology we get the following:

Evaluation is a function from an AST node and a (lexical) environment to produce a value. Values are AST nodes that evaluate to themselves in every environment. A non-value AST node (reducible expression) is usually a variable x or an application f x, but since Stroscot is term-rewriting it can be anything.
A reduction rule maps particular AST nodes and environments to new AST nodes and environments. These AST nodes are of course reducible.
An applicative is a reduction rule that uses each pattern variable once by transforming it to an “argument” by evaluating it using the current environment.
An operative is a non-applicative reduction rule.
A (Scheme) macro is an operative that uses its operands to derive a syntax tree and then returns the result of evaluating the entire tree.
A (Scheme) special form is an operative built into the language, accessible by a reserved symbol.
A (Kernel) fexpr is a non-macro operative that is expressed as a $vau lambda taking unevaluated operands and the environment.

Although it gives each concept its own name, “fexpr” is an unusual word with no prior referent except maybe the old Lisp I fexpr which didn’t take an environment hence couldn’t implement lexical scope. So Stroscot’s terminology follows newLISP and calls fexprs “macros”:

A Stroscot function is a Shutt applicative
A Stroscot macro is a Shutt operative

Stroscot’s macros operate on ASTs like macros in other languages, so it’s clearer to the uninitiated to call them macros. But, they return a value instead of an AST, so they are more powerful than other languages’ macros.

Hygiene

Scheme macros are supposed to be “hygienic” in that they always evaluate expressions in the lexical environment of the macro’s definition site, as opposed to use site. But [Kis02] shows that in fact the environment of the use site is fully accessible through some tricks. The newer syntax-case allows explicit access through datum->syntax.

Formal definition

Fexprs in contrast get an explicit environment. They can do staged lookup, eval $env (eval $env (f $args)), where an expression evaluates to an AST symbol and the AST symbol is looked up, and other weird things. [Shu10] chapter 5 discusses various hygiene-breaking problems and concludes they aren’t too worrisome.

eval is hard to compile, because it makes the full power of an interpreter available. But we can often simplify eval (a + b) to eval a + eval b, reducing the amount of code that is evaluated each loop. If all of the variable lookups are static, we can furthermore optimize the environment to remove all unneeded variables. Hence we can recover macro-level performance on macros. Dynamic lookups need the full environment unfortunately. But dynamic lookups are essentially a REPL or debugging tool, so does not need to be too efficient, and we can warn that they are not optimized.

Fexprs make the equational theory of ASTs trivial, ([Shu10], chapter 15) in that ASTs can be completely deconstructed, so no two ASTS are behaviorally equivalent. But this is good, because it means the programmer’s intent can be fully examined. If (\x. x) y was equivalent to y then many DSL’s would not be possible. The behavior of programs containing fexprs is decidedly nontrivial and quite varied.

In Stroscot, as in Kernel [Shu10], fexprs are functions that take code AST’s and a lexical environment instead of evaluated values. So when you write f a b, and f is an operative, then f has a type like f : Env -> Ast -> .... The Env is an opaque map that might or might not have bindings for a and b, and the AST is fragment like ((Sym 'f') `App` (Sym 'a')) `App` (Sym 'b'). Then f can do arbitrary operations with those, with the full power of the programming language, and in particular f can eval AST fragments with the env it’s given (or with envs from elsewhere).

The main power fexprs give over macros is that there’s no phase distinction. A macro is like an fexpr that builds up a single AST and calls eval at the end. But fexprs can call eval multiple times, and these can depend on the results of previous evaluations, so for example you can lookup a variable name stored in an argument and evaluate that name.

Parsing

Macros consume the syntax tree, so