Benutzer:JonskiC/Vektor- und Matrixdifferenzierung

In der linearen Algebra ist die Vektor- und Matrixdifferenzierung oder auch kurz Matrixdifferenzierung die Ableitung von Matrizen und Vektoren.

Rechenregeln

Lineare Formen

Sei die Form $\mathbf {y} =\mathbf {A} \mathbf {x}$ gegeben, wobei $\mathbf {A}$ eine $m\times n$ Matrix und $x$ ein $n$ -elementiger Spaltenvektor, dann gilt

{\frac {\partial \mathbf {y} }{\partial \mathbf {x} }}=\mathbf {A}

.

Bilineare Formen

Sei $\mathbf {y} =\mathbf {z} ^{\top }\mathbf {A} \mathbf {x}$ , wobei $\mathbf {z}$ ein $m\times 1$ -Vektor und $\mathbf {A}$ eine $m\times n$ -Matrix darstellt. Weiterhin sei $\mathbf {x}$ ein $n\times 1$ Vektor und A unabhängig von $\mathbf {z}$ und $\mathbf {x}$ , dann gilt

{\frac {\partial \mathbf {y} }{\partial \mathbf {z} }}=\mathbf {x} ^{\top }\mathbf {A} ^{\top }

und

{\frac {\partial \mathbf {y} }{\partial \mathbf {x} }}=\mathbf {z} ^{\top }\mathbf {A}

.^[1]

Quadratische Formen

Es sei die quadratische Form $\mathbf {y} =\mathbf {z} ^{\top }\mathbf {A} \mathbf {z}$ , wobei $\mathbf {A}$ eine $n\times m$ -Matrix, und $\mathbf {z}$ einen $n$ -dimensionalen Vektor darstellt. Die Ableitung dieser quadratischen Form nach $\mathbf {z}$ ergibt

{\frac {\partial \mathbf {z} ^{\top }\mathbf {A} \mathbf {z} }{\partial \mathbf {z} }}=\mathbf {z} ^{\top }\left(\mathbf {A} +\mathbf {A} ^{\top }\right)

.

Beweis:

Per Definition gilt:

\mathbf {y} =\sum _{j=1}^{n}\sum _{i=1}^{n}a_{ij}z_{i}z_{j}

.

Wenn man nach dem $k$ -ten Element ableitet, erhält man

{\frac {\partial \mathbf {y} }{\partial z_{k}}}=\sum _{j=1}^{n}a_{kj}z_{j}+\sum _{i=1}^{n}a_{ik}z_{i}\quad k=1,\ldots ,n

und damit schließlich

{\frac {\partial \mathbf {y} }{\partial \mathbf {z} }}=\mathbf {z} ^{\top }\mathbf {A} ^{\top }+\mathbf {z} ^{\top }\mathbf {A} =\mathbf {z} ^{\top }\left(\mathbf {A} +\mathbf {A} ^{\top }\right)

.

Falls $\mathbf {A}$ eine symmetrische Matrix, so gilt

{\frac {\partial \mathbf {z} ^{\top }\mathbf {A} \mathbf {z} }{\partial \mathbf {z} }}=2\mathbf {z} ^{\top }\mathbf {A}

.

Ableitung von Determinanten

Für Ableitungen von Determinanten gilt es vielfältige Rechenregeln. Die einfachste ist die Ableitung einer Matrix nach sich selbst. Sei A eine quadratische Matrix der Ordnung m, dann gilt für ihre Ableitung ihrer Determinante