Release notes for HOL4, Trindemossen-1

(Released: 25 April 2024)

We are pleased to announce the Trindemossen 1 release of HOL4. We have changed the name (from Kananaskis) because of the kernel change reflected by the new efficient compute tool (see below).

New features
Bugs fixed
New theories
New tools
New Examples
Incompatibilities

New features:

The HOL_CONFIG environment variable is now consulted when HOL sessions begin, allowing for a custom hol-config configuration at a non-standard location, or potentially ignoring any present hol-config. If the variable is set, any other hol-config file will be ignored. If the value of HOL_CONFIG is a readable file, it will be used.
There is a new theorem attribute, unlisted, which causes theorems to be saved/stored in the usual fashion but kept somewhat hidden from user-view. Such theorems can be accessed with DB.fetch, and may be passed to other tools though the action of other attributes, but will not appear in the results of DB.find and DB.match, and will not occur as SML bindings in theory files.
Holmake will now look for .hol_preexec files in the hierarchy surrounding its invocation. The contents of such files will be executed by the shell before Holmake begins its work. See the DESCRIPTION manual for more.
Holmake (at least under Poly/ML) now stores most of the products of theory-building in a “dot”-directory .holobjs. For example, if fooScript.sml is compiled, the result in the current directory is the addition of fooTheory.sig only. The files fooTheory.sml, fooTheory.dat, fooTheory.uo and fooTheory.ui are all deposited in the .holobjs directory. This reduces clutter.
Paralleling the existing Excl form for removing specific theorems from a simplifier invocation, there is now a ExclSF form (also taking a string argument) that removes a simpset fragment from the simplifier. For example
```
     > simp[ExclSF "BOOL"] ([], “(λx. x + 1) (6 + 1)”);
     val it = ([([], “(λx. x + 1) 7”)], fn)
```
where the BOOL fragment includes the treatment of β-reduction.

Bugs fixed:

Fix a failure to define a polymorphic datatype with name a.

New theories:

A theory of “contiguity types”, as discussed in the paper Specifying Message Formats with Contiguity Types, ITP 2021. (DOI: 10.4230/LIPIcs.ITP.2021.30)

Contiguity types express formal languages where later parts of a string may depend on information held earlier in the string. Thus contig types capture a class of context-sensitive languages. They are helpful for expressing serialized data containing, for example, variable length arrays. The soundness of a parameterized matcher is proved.
permutes: The theory of permutations for general and finite sets, originally ported from HOL-Light’s Library/permutations.ml.
keccak: Defines the SHA-3 standard family of hash functions, based on the Keccak permutation and sponge construction. Keccak256, which is widely used in Ethereum, is included and was the basis for this work. A rudimentary computable version based on sptrees is included; faster evaluation using cvcompute is left for future work.

New tools:

The linear decision procedure for the reals (REAL_ARITH, REAL_ARITH_TAC and REAL_ASM_ARITH_TAC) have been updated by porting the latest code from HOL-Light. There are two versions: those in the existing RealArith package only support integral-valued coefficients, while those in the new package RealField support rational-valued coefficients (this includes division of reals, e.g. |- x / 2 + x /2 = x can be proved by RealField.REAL_ARITH). Users can explicitly choose between different versions by explicitly opening RealArith or RealField in their proof scripts. If realLib were opened, the maximal backward compatibilities are provided by first trying the old solver (now available as RealArith.OLD_REAL_ARITH, etc.) and (if failed) then the new solver. Some existing proofs from HOL-Light can be ported to HOL4 more easily.
New decision procedure for the reals ported from HOL-Light: REAL_FIELD, REAL_FIELD_TAC and REAL_ASM_FIELD_TAC (in the package RealField). These new tools first try RealField.REAL_ARITH and then turn to new solvers based on calculations of Grobner’s Basis (from the new package Grobner).
Multiplying large numbers more efficiently:

In src/real there is a new library bitArithLib.sml which improves the performance of large multiplications for the types :num and :real. The library uses arithmetic of bitstrings in combination with the Karatsuba multiplication algorithm. To use the library, it has to be loaded before the functions that should be evaluated are defined.
Fast in-logic computation primitive: A port of the Candle theorem prover’s primitive rule for computation, described in the paper “Fast, Verified Computation for Candle” (ITP 2023), has been added to the kernel. The new compute primitive works on certain operations on a lisp-like datatype of pairs of numbers:
```
     Datatype: cv = Pair cv cv
                  | Num num
     End
```
This datatype and its operations are defined in cvScript.sml, and the compute primitive cv_compute is accessible via the library cv_computeLib.sml (both in src/cv_compute).

There is also new automation that enables the use of cv_compute on functional HOL definitions which do not use the :cv type. In particular, cv_trans translates such definitions into equivalent functions operating over the :cv type. These can then be evaluated using cv_eval, which uses cv_compute internally. Both cv_trans and cv_eval can be found in the new cv_transLib.

Some usage examples are located in examples/cv_compute. See the DESCRIPTION manual for a full description of the functionality offered by cv_compute.

NB. To support cv_compute, the definitions of DIV and MOD over natural numbers num have been given specifications for the case when the second operand is zero. We follow HOL Light and Candle in defining n DIV 0 = 0 and n MOD 0 = n. These changes make DIV and MOD match the way Candle’s compute primitive handles DIV and MOD.
Polarity-aware theorem-search. Extending what is available through DB.find and DB.match, the DB.polarity_search allows the user to search for explicitly negative or positive occurrences of the specified pattern. Thanks to Eric Hall for this contribution.

New examples:

Dependability Analysis: Dependability is an umbrella term encompassing Reliability, Availability and Maintainability. Two widely used dependability modeling techniques have been formalized namely, Reliability Block Diagrams (RBD) and Fault Trees (FT). Both these techniques graphically analyze the causes and factors contributing the functioning and failure of the system under study. Moreover, these dependability techniques have been highly recommended by several safety standards, such as IEC61508, ISO26262 and EN50128, for developing safe hardware and software systems.

The new recursive datatypes are defined to model RBD and FT providing compositional features in order to analyze complex systems with arbitrary number of components.
```
    Datatype: rbd = series (rbd list)
                  | parallel (rbd list)
                  | atomic (α event)
    End

    Datatype: gate = AND (gate list)
                   | OR (gate list)
                   | NOT gate
                   | atomic (α event)
    End
```
Some case studies are also formalized and placed with dependability theories, for illustration purposes, including smart grids, WSN data transport protocols, satellite solar arrays, virtual data centers, oil and gas pipeline systems and an air traffic management system.
large_numberTheory (in examples/probability): various versions of The Law of Large Numbers (LLN) of Probability Theory.

Some LLN theorems (WLLN_uncorrelated and SLLN_uncorrelated) previously in probabilityTheory are now moved to large_numberTheory with unified statements.
Vector and Matrix theories (in examples/vector) translated from HOL-Light’s Multivariate/vectors.ml.
Relevant Logic (in examples/logic/relevant-logic): material contributed by James Taylor, mechanising a number of foundational results for propositional relevant logic. Three proof systems (two Hilbert, one natural deduction) are shown equivalent, and two model theories (the Routley-Meyer ternary-relation Kripke semantics, and Goldblatt’s “cover” semantics) are shown sound and complete with respect to the proof systems.
armv8-memory-model (in examples/arm): a port by Anthony Fox of Viktor Vafeiadis’s Coq formalization of the Armv8 Memory Model, which is based on the official mixed-size Armv8 memory model and associated paper.
p-adic numbers (in examples/padics): a construction of the p-adic numbers by Noah Gorrell. The approach taken defines the prime valuation function ν on first the natural numbers and then the rationals. It then defines the absolute value on ℚ so as to establish a p-metric. Cauchy sequences over these can be constructed and quotiented to construct a new numeric type. The new type adic is polymorphic such that the cardinality of the universe of the argument defines the prime number p of the construction. For types that have infinite or non-prime universes, p is taken to be 2. Thus, :2 adic, :4 adic and :num adic are isomorphic types, but :3 adic is distinct. Addition, multiplication and injection from the rationals are defined.

Incompatibilities: