TaylorDiff.jl: Taking Higher-order Derivatives Seriously
Songchen Tan
Introduction
Theory
Implementation
Checkout the code at TaylorDiff.jl!
Results
Scenario 1: Scaling comparison
We compare the efficiency of TaylorDiff.jl to ForwardDiff.jl (nesting first-order derivative), with scalar function $f(x)=\sin(x)$ up to order 10 and a simple MLP $f(x)=W_2\cdot\exp.(W_1\cdot x+b_1)+b_2$ up to order 7: