ddot — double-precision dot product
Sum of pairwise products of two vectors with configurable strides. The strong test is harder than the April version; the score reflects that. Closure on the remaining 22 is bounded engineering.
Results · Per routine
For every routine on the OWI ladder, this page gives the original Fortran (linked to its source), the Ada spec the factory emitted, the Ada body the factory emitted, and a short paragraph on what shape the result took. Nothing about how — that is on the methodology page. The page grows as new phases land.
The ten kernels any safety-critical numerical stack ultimately sits on top of. Fortran sources are Netlib reference BLAS and LAPACK (Modified BSD).
Sum of pairwise products of two vectors with configurable strides. The strong test is harder than the April version; the score reflects that. Closure on the remaining 22 is bounded engineering.
Vector scalar-multiply-add: y[i] := alpha * x[i] + y[i]
for every selected pair. 15 unproved obligations; bounded closure.
x[i] := alpha * x[i] for every selected element.
25 unproved obligations; bounded closure.
L2 norm of a vector, with overflow-safe rescaling for very large and very small magnitudes. 23 unproved obligations; bounded closure.
y := alpha * op(A) * x + beta * y, with op
selecting transpose-or-not. 46 unproved obligations; bounded
closure.
A := alpha * x * y^T + A. Adds an outer product to
a matrix. This one needed a one-line manual patch on the
emitted spec; the underlying gap in the pipeline is logged
for a structural fix and a clean re-run.
C := alpha * op(A) * op(B) + beta * C. The hardest
test in the set; the score reflects that. Closure on the
remaining 79 is bounded engineering work.
Factors a symmetric positive-definite matrix as A = L · LT (lower-triangular). The body now serves the re-verified Cholesky factorisation against the same tightened audit dgeqrf received; the small dip from the earlier 90.2% figure reflects the harder test.
QR factorisation. The earlier 98.7% figure on this routine was withdrawn after an internal audit found the body satisfied its specification without doing the underlying Householder work. The body now serves a re-verified Householder factorisation against a tightened specification; 87.5% is the honest figure on the harder test.
P · A = L · U. LU factorisation with partial pivoting. Re-verified under the same audit treatment as dgeqrf and dpotrf; the body now serves a factorisation checked against a known-input reconstruction at runtime.
The signal-processing layer that surrounds the perception network in any autonomous-systems deployment. cfftf is rerun under strong postconditions; the three textbook control routines (Kalman, PID, quaternion) sit at the April baseline and are queued.
Cooley-Tukey mixed-radix forward complex FFT (radix 2, 3, 4, 5).
48 unproved obligations; bounded closure. The Fortran is the
original Netlib FFTPACK bundle (1985, public domain); the Ada is
a single consolidated package covering cfftf,
cffti, cfftf1, cffti1, and
the passf* pass functions.
The cuDNN-shaped operations: int8 matmul and conv2d (quantised), plus FP64 conv2d, pooling, activation and normalisation. Unlike Phase 1 and Phase 2, where the Fortran is real published code (Netlib reference BLAS / LAPACK / FFTPACK), the Phase 3 reference Fortran was hand-written as a textbook implementation for the purpose — there is no public "reference int8 matmul in Fortran". The Fortran files are linked per row below; they are short (40–70 lines each), released into the public domain, and intended to be the verifiable mathematical baseline against which the emitted Ada is judged.
Reference: Jacob et al., Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference, arXiv:1712.05877 (2018). Symmetric per-tensor formulation, both input zero-points fixed at zero. 24 unproved obligations; bounded closure.
Reference: same Jacob et al. quantisation scheme applied to direct 2D convolution. Stride 1, no padding (valid). 38 unproved obligations; bounded closure.
Reference: standard FP64 direct convolution; see e.g. Goodfellow, Bengio, Courville, Deep Learning (2016), ch. 9. Lowest Phase 3 discharge; the score reflects the test being harder on the heavier shape. Closure on the remaining 49 is bounded engineering.
Reference: standard CNN max-pooling layer; cuDNN's CUDNN_POOLING_MAX shape. Multi-channel input, configurable window and stride. 36 unproved; bounded closure.
Reference: V. Nair and G. Hinton, Rectified Linear Units
Improve Restricted Boltzmann Machines, ICML 2010.
Element-wise y[i] := max(0, x[i]). 15 unproved;
bounded closure.
Reference: Ba, Kiros, Hinton, Layer Normalization, arXiv:1607.06450 (2016). Per-row mean / variance / affine transform via the learnable scale and shift. 23 unproved; bounded closure.
Three control kernels (Kalman, PID, quaternion) are queued for rerun; they sit on the OWI page at the April baseline. Phase 4 (a wider set of FP inference layers and additional int8 quantised primitives) is queued. The page grows as the work does.