Algorithmic conversion with surjective pairing

This repository contains the Rocq mechanization of the results from the paper "Algorithmic Conversion with Surjective Pairing: A Syntactic and Untyped Approach".

While the documentation is written in org mode, it is recommended that you use the generated README.html file so you can access the hyperlinks into the generated coqdoc files to navigate the development in your web browser by launching a local web server using python or darkhttpd. Note that if you instead load the file directly with your web server, the toggle proof functionality of coqdocjs will not work because of CORS restrictions of javascript modules!

# starting a local server with python listening to port 8080
python -m http.server --bind 127.0.0.1 8080 --directory .
# alternatively, starting a local server with darkhttpd listening to port 8080
darkhttpd . --addr 127.0.0.1 --port 8080

You can then access the document from your web browser with the url http://127.0.0.1:8080.

As mentioned in the paper, while the natural numbers are omitted from the text, they can be found in the mechanization and the specification pdf file.

Quick start

Installing dependencies

You can skip this section if you are compiling the proof scripts from the configured VM.

Inside a fresh opam switch, run the following commands to install the required dependencies.

opam update
opam install -y coq-hammer-tactics coq-stdpp coq-autosubst-ocaml
eval $(opam env)

The termination proof can be brittle because we expect the inversion to produce the subproofs of the domain relation that make the termination checker happy.

Here are the versions of the Rocq packages that are known to work.

coq 8.20.1
coq-stdpp 1.11.0
coq-hammer-tactics 1.3.2
coq-autosubst-ocaml 1.1

Note that if you don't plan to modify and regenerate the syntax.v file from syntax.sig using autosubst, you can choose not to install coq-autosubst-ocaml. However, you need to be careful not to accidentally run make deepclean, which will delete the auto-generated syntax files. Instead, run make clean if you want to keep the auto-generated files.

Validating the proofs

After the dependencies are installed, run the following command to validate the proof development. The -j2 flag allows make to validate up to 2 files in parallel. According to our testing, going beyond -j2 does not reduce the compilation time.

make -j2

It is expected that the compilation process will generate a lot of warnings about notations and the ssreflect library. The compilation is successful as long as the command terminates with the following output.

# ... lots of warnings ...
make[1]: Leaving directory '****'

On a Linux laptop with an Intel 12700k, the compilation takes 67 seconds. If you are running the command inside the attached VM and your host device has an ARM CPU (e.g. Mac M-series), then it'll take around 10x longer since emulating x86 instructions can be quite slow.

Definitions

Grammar (Fig. 1)

Terms: PTm
Typing context: defined directly as list PTm

The grammar specification in higher-order abstract syntax (HOAS) can be found in syntax.sig. We use the autosubst-ocaml tool to turn the syntax file into the Rocq file syntax.v, which contains not only the de Bruijn grammar but also definitions of renaming and substitution functions and their associated lemmas.

Typing specification (Fig. 2, 3)

Typing: Wt (Γ ⊢ a ∈ A)
Equality: Eq (Γ ⊢ a ≡ b ∈ A)
Subtyping: LEq (Γ ⊢ A ≲ B)
Context well-formedness: Wff ( ⊢ Γ)

Untyped relations and naming schemes

Due to the many reduction relations used in our proof, we organize each untyped reduction, equality, and subtyping relation inside its own module, where the relation itself is simply named R. Thus, to avoid ambiguity, those modules are never meant to be imported and the relation should always be referred to in its qualified form. For example, the β-reduction relation is defined as the inductive type R in the module RRed. Outside the module, the β-reduction relation is referred to by its fully qualified name RRed.R.

Given a module containing a reduction relation named Red, the module Reds contains the properties about its reflexive and transitive closure. Instead of defining the transitive and reflexive closure of Red.R as RReds.R, we directly refer to it as rtc RRed.R, where rtc is a relation operator that outputs the reflexive and transitive closure of its input.

Some relations are standard and therefore not included in the text for concision, but we include them here for completeness.

Normal form predicates (Fig. 5)
- normal forms (nf): nf
- neutral forms (nf): ne
- canonical forms (canf): ishf
- weak-head neutral forms (whne): ishne
- weak-head normal forms (whnf): inlined as the union of ishf and ishne
Note that we sometimes use HRed.nf in place of the above definition of whnf, where the former means the term would no longer reduce with weak-head reduction. These two definitions coincide for precisely the set of terms that are non-stuck.

Also, instead of defining nf and ne as inductive predicates, we define them as mutually recursive fixpoints. For the definition to be accepted by the termination checker, the injection from ne to nf is proven a posteriori as the lemma ne_nf.
Reductions
- β-reduction (⤳_β): RRed
- η-reduction (⤳_η): ERed
- parallel β-reduction (⇒_β): RPar
- parallel η-reduction (⇒_η): EPar
- βη-reduction (⤳_βη): RERed
- leftmost-outermost β-reduction (⤳_β): LoRed
- weak-head β-reduction: HRed
- restrictive parallel η-reductions (⇒_r, ⇒_r̄): NeEPar
Strong normalization (Sec. 3.2)
- Strong normal forms: SN
- Strong neutral forms: SNe
- Strong weak head reduction: TRedSN
Joinability and Subtyping
- Joinability (w.r.t βη-reduction, Def. 3.1): DJoin
- Joinability (w.r.t η-reduction): EJoin
- One-step subtyping (Page 12): Sub1
- Untyped subtyping (Def. 3.2): Sub
- Untyped subtyping (w.r.t η-reduction): ESub
Note that ESub holds when two terms can be related by one-step subtyping after η-reduction. It is not mentioned in the paper but is convenient to have around in the mechanization for automation purposes.
Coquand's algorithm (Sec. 4.1)

Coquand's algorithm is one of the exceptions of the above naming scheme, and the actual formal definition is slightly different from the text presentation. Notably, the algorithmic equality for head normal forms is split into two relations, one that handles the case where both terms are neutral, and one that handles the cases where at least one term is not neutral.
- Algorithmic equality (a ↔︎ b in the text): CoqEq_R (a ⇔ b in the mechanization)
- Algorithmic equality for head normal forms (f₀ ∼ f₁ in the text):
  - When both inputs are neutral: CoqEq_Neu (a ∼ b in the mechanization)
  - Otherwise: CoqEq (a ↔︎ b in the mechanization)
- Termination metric: sn_term_metric
The relations are all formulated on arbitrary terms. The neutral and normal form restrictions in f₀ ∼ f₁ are proven a posteriori as lemmas in the mechanization (e.g. coqeq_no_hred).

Subtyping works similarly, though there is no need to split the relation as the neutral case is handled by equalities.
- Algorithmic subtyping (A ≪ B): CoqLEq_R
- Algorithmic subtyping for head normal forms (f₀ ≲ f₁): CoqLEq

The untyped logical predicate (Sec 3.7)

As mentioned in the text, the definition of the logical predicate cannot be written in Rocq directly as it consists of an inductive definition nested in a fixpoint definition over universe levels.

The technique we adopt to encode the logical relation is described in detail in the write-up by Liu and Weirich. To make the code more readable, we specify a module type LogRel that includes the introduction and induction principles that fully characterizes the logical predicate. The module LogRelImpl shows how the logical predicate is actually defined through the inductive definition InterpExt and the fixpoint InterpUniv, the latter of which the logical predicate satisfying the abstract properties we actually need.

By encapsulating the Rocq-specific workarounds behind the module signature, the properties about the logical predicate (found in LogRelFactsImpl) can be implemented purely in terms of the clean interface specified in LogRel.

logical predicate (⟦A⟧ ↘ S): InterpUniv (signature, implementation)
- induction principle (signature, implementation)
- introduction rules (omitted, defined in the same module)

To avoid introducing propositional and functional extensionality axioms, we add rule InterpUniv_Conv to our mechanization to ensure that the logical predicate operates on predicates that are extensionally equivalent (denoted by ≐ in the mechanization). These artifacts introduced by avoiding the axioms are noted in the development and can be safely ignored.

Executable conversion algorithm (Sec. 4)

The relational definition of algorithmic conversion is not immediately executable. The decidability result (Theorem 4.1) is justified by defining a total function that returns true precisely when two of its input are convertible.

Here, we give links to the definition of the algorithm and the Bove-Capretta domains to handle termination checking.

Executable algorithmic
- subtyping: check_sub_r
- equality: check_equal_r
Bove-Capretta domains for
- subtyping: salgo_dom_r
- equality: algo_dom_r

The completeness and soundness of the computable functions with respect to their relational counterparts are not explicitly included in the paper, but they are linked in the mechanization by the following lemmas.

subtyping: check_sub_sound, check_sub_complete
equality: check_eq_sound, check_eq_complete

The termination of algorithmic conversion is implied by the above completeness and soundness results.

Properties proven in the paper

Section 2

Lemma 2.1 (context regularity): wff_mutual
Lemma 2.2 (inversion): Bind_Inv, Var_Inv, App_Inv, Abs_Inv, Proj1_Inv, Proj2_Inv, Pair_Inv
Lemma 2.3 (subject reduction): subject_reduction
Lemma 2.4 (type correctness): regularity

Section 3

Lemma 3.1

RRed.nf_imp

Lemma 3.2

ERed.nf_preservation

Lemma 3.3

LoReds.FromSN_mutual

Lemma 3.4 (no stuck terms)

SN_NoForbid.PApp_imp, SN_NoForbid.PProj_imp, SN_NoForbid.PInd_imp (the P property is defined as SN)

Lemma 3.5 (SN renaming)

sn_renaming

Lemma 3.6 (SN antisubstitution)

sn_unmorphing

Lemma 3.7 (SN inversion)

P_AppInv, P_PairInv, P_ProjInv, P_BindInv, P_SucInv, P_AbsInv, P_IndInv

Lemma 3.8 (sn preservation)

split into two separate lemmas

preservation for parallel η-reduction: epar_sn_preservation
preservation for parallel β-reduction: red_sn_preservation

Lemma 3.9 (restrictive-η and normal form)

NeEPar.R_elim_nf

Lemma 3.10 (η-decomposition)

UniqueNF.η_split. Note: we parameterize the proof over the P predicate as mentioned in Sec. 5.3. The P predicate is instantiated to SN in SN_NoForbid

Lemma 3.11 (η-postponement)

UniqueNF.η_postponement

Corollary 3.1 (strengthened η-postponement)

UniqueNF.η_postponement_strengthened

Corollary 3.2 (η-postponement for normal forms)

rered_standardization'

Lemma 3.12 (confluence for β)

red_confluence

Lemma 3.13 (confluence for η)

ered_confluence

Theorem 3.1 (confluence for βη

rered_confluence

Lemma 3.14 (transitivity of joinability)

DJoin.transitive

Lemma 3.15 (injectivity of joinability)

DJoin.hne_app_inj, DJoin.hne_proj_inj

Lemma 3.16 (transitivity of one-step subtyping)

Sub1.transitive

Lemma 3.17 (commutativity of one-step subtyping)

Sub1.commutativity0

Lemma 3.18 (one-step subtyping preserves sn)

Sub1.sn_preservation

Corollary 3.3 (transitivity of untyped subtyping)

Sub.transitive

Lemma 3.19 (noconfusion for untyped subtyping)

The Sub.*_noconf lemmas starting with Sub.sne_nat_noconf

Lemma 3.20 (untyped injectivity of type constructors)

Sub.bind_inj, Sub.univ_inj

Lemma 3.21 (adequacy)

LogRelFactsImpl.adequacy

Lemma 3.22 (backward closure)

LogRelFactsImpl.back_clos

Lemma 3.23 (logical predicate cases)

LogRelFactsImpl.case

Lemma 3.24 (logical predicate is preserved by subtyping)

LogRelFactsImpl.sub

Corollary 3.4 (logical predicate is functional)

LogRelFactsImpl.functional

Lemma 3.25 (logical predicate is cumulative)

LogRelFactsImpl.cumulative

Lemma 3.26 (semantic weakening)

weakening_Sem

Lemma 3.27 (semantic substitution)

morphing_SemWt

Lemma 3.28 (structural rules for semantic well-formedness)

SemWff_lookup

Theorem 3.2 (fundamental theorem)

fundamental_theorem

Corollary 3.5 (completeness of reduce-and-compare)

Inlined into proof scripts

Corollary 3.6 (completeness of reduce-and-compare)

synsub_to_usub

Section 4

Lemma 4.1 (Π-subtyping): Sub_Bind_InvL, Sub_Bind_InvR
Lemma 4.2 (univ-subtyping): Sub_Univ_InvR
Lemma 4.3 (soundness for algorithmic equality): coqeq_sound_mutual
Lemma 4.4 (soundness for algorithmic subtyping): coqleq_sound_mutual
Lemma 4.5 (metric implies domain): sn_term_metric
Lemma 4.6 (termination of Coquand's algorithm): check_sub_r (termination is implicit in our mechanization in the sense that we can construct the Bove-Capretta domain from the typing judgment, which we can then feed to the check_sub_r function)
Lemma 4.7 (completeness of Coquand's algorithm): coqeq_complete'
Lemma 4.8 (completeness of Coquand's algorithmic subtyping): coqleq_complete'
Lemma 4.9 (completeness of Coquand's algorithmic subtyping): coqleq_complete_unty, coqleq_complete, coqleq_sound
Theorem 4.1 (type conversion is decidable): In Rocq, we have to go an extra mile by defining the computable function check_sub_r_wt (a wrapper around check_sub_r) and then prove that it agrees with the inductive subtyping relation in Conv_dec

Section 5

Proposition 5.1: Safe_NoForbid

Validating axiom usage

We claim that our development is axiom-free. To validate that claim, one can use the Print Assumptions command on the theorems and confirm that no axioms are displayed.

An alternative method is to run coqchk, which can be invoked on all .vo files by running make validate. However, coqchk doesn't work that well with module types and will report axioms that we didn't actually use in the development.

* Theory: Set is predicative

* Theory: Rewrite rules are not allowed

* Axioms:
    DecSyn.logrel.LRFacts.functional
    DecSyn.logrel.LRFacts.Bind_inv_nopf
    DecSyn.logrel.LRFacts.back_clos
    DecSyn.logrel.LRFacts.Bind_nopf
    DecSyn.logrel.LRFacts.adequacy
    DecSyn.logrel.LRFacts.back_closs
    Coq.Logic.FunctionalExtensionality.functional_extensionality_dep
    Coq.Reals.ClassicalDedekindReals.sig_not_dec
    DecSyn.logrel.LRFacts.join
    DecSyn.logrel.LRFacts.case
    DecSyn.logrel.LRFacts.sub
    Coq.Reals.ClassicalDedekindReals.sig_forall_dec
    DecSyn.logrel.LRFacts.cumulative
    DecSyn.logrel.LRFacts.Univ_inv
    DecSyn.logrel.LRFacts.SNe_inv
    DecSyn.logrel.LRFacts.Bind_inv
    Coq.Logic.Eqdep.Eq_rect_eq.eq_rect_eq
    DecSyn.fp_red.NoForbid_FactSN.P_RReds
    DecSyn.fp_red.NoForbid_FactSN.P_EPars
    DecSyn.logrel.LRFacts.Nat_inv

* Constants/Inductives relying on type-in-type: <none>

* Constants/Inductives relying on unsafe (co)fixpoints: <none>

* Inductives whose positivity is assumed: <none>

make[1]: Leaving directory '****'

Again, all the axiom reported are false positives and you should always trust Print Assumptions when it comes to axiom usage. Still, the output provides the useful information that our development does not rely on any of the dangerous or inconsistent features that would make our theorems trivially true.