(of
(6.6.2)) The central idea in proving
(6.6.2) is that for any open subset
\(U'\subset U\) with a compact closure
\(\bar U' \subset U\) and
\(\epsilon>0\text{,}\) there exists a
\(\delta >0\) such that for any sufficiently small cube
\(Q\subset U'\) in the sense that its side length are equal and no more than
\(\delta\text{,}\) we have
\begin{equation*}
v(T(Q))\le (1 +\Lambda \epsilon)^{n} \left|\det \left(D T(\bar {\mathbf x})\right)\right| v(Q),
\end{equation*}
where \(\bar {\mathbf x}\) is any point in \(Q\text{,}\) but will be taken as the center of \(Q\text{,}\) and \(\Lambda \ge 1\) depends on \(T, U'\) such that
\begin{equation*}
\Lambda^{-1}\Vert \mathbf u \Vert \le \Vert DT({\mathbf x}) \mathbf u \Vert \le \Lambda \Vert \mathbf u\Vert,
\forall {\mathbf x} \in U', \mathbf u \in \mathbb R^{n}.
\end{equation*}
This is seen by the linear Taylor approximation of \(T({\mathbf x})\text{:}\)
\begin{equation*}
T({\mathbf x})=T(\bar{\mathbf x}) + \left(DT(\bar{\mathbf x}) +C({\mathbf x},\bar {\mathbf x})\right)({\mathbf x}-\bar {\mathbf x}) \text{ for any $\bx, \bar \bx \in U'$,}
\end{equation*}
where \(|C({\mathbf x},\bar {\mathbf x})|\le \epsilon\) as long as \(\Vert {\mathbf x}-\bar {\mathbf x} \Vert \le \delta\) for some \(\delta >0\) which depends on \(T\text{,}\) \(U'\) and \(\epsilon >0\text{.}\) For any cube \(Q\) in \(U'\) with \(\bar {\mathbf x}\) as center and side lengths equal and no more than \(\delta\text{,}\)
\begin{equation*}
\left\{ T(\bar {\mathbf x}) + DT(\bar{\mathbf x})({\mathbf x}-\bar {\mathbf x}): {\mathbf x} \in Q\right\}
\end{equation*}
is a parallelepiped with volume \(|\det \left(D T(\bar {\mathbf x})\right)| v(Q)\text{.}\)
We will use the Taylor expansion above to estimate the volume of \(T(Q)\text{.}\) Itβs easier to decompose \(T=T_{1}\circ T_{2}\) on \(Q\text{,}\) where \(T_{1}\) is the linear map given by the matrix \(\left(D T(\bar {\mathbf x})\right)\text{,}\) and \(T_{2}=T_{1}^{-1}\circ T\text{.}\) Note that the Jacobian matrix of \(T_{2}\) at \(\bar \bx\) equal to the identity. If we apply \(T_{1}^{-1}\) to the Taylor expansion above we get
\begin{equation*}
T_{2}(\bx)=T_{1}^{-1}\circ T(\bar{\mathbf x}) + \left(I+T_{1}^{-1}\circ C({\mathbf x},\bar {\mathbf x})\right)({\mathbf x}-\bar {\mathbf x}).
\end{equation*}
It follows that \(T_{2}(Q)\) is contained in a hypercube \(Q'\) with \(T_{1}^{-1}\circ T(\bar {\mathbf x})\) as center and side lengths equal to the side lengths of \(Q\) multiplied by \((1+\Lambda \epsilon)\text{.}\) Furthermore, \(T(Q)=T_{1}\circ T_{2}(Q)\subset T_{1}(Q')\) and \(v(T_{1}(Q'))=|\det DT(\bar {\mathbf x})| |Q'|\text{.}\) Thus
\begin{equation*}
v(T(Q)) \le \left|\det \left(DT(\bar {\mathbf x})\right)\right| (1+\Lambda \epsilon)^n v (Q).
\end{equation*}
The approach here follows that in the article by
J. Schwartz, The formula for change in variables in a multiple integral, Amer. Math. Monthly 61, (1954), 81--85.
The above estimate also holds for hypercubes whose ratios of side lengths are within
\(1\pm \epsilon\text{,}\) with a somewhat modified constant replacing
\((1+\Lambda \epsilon)^n\) and that constant still approaches
\(1\) as
\(\epsilon \to 0 \; \mbox{---} \) we will keep using
\((1+\Lambda \epsilon)^n\) in the estimate for such hypercubes.
For any closed rectangle
\(S\subset U\) and any
\(\epsilon>0\text{,}\) we can do fine enough partition
\(\cP=\{Q_{\alpha}\}\) of
\(S\) using hypercubes for which the above estimate holds for each
\(Q_{\alpha}\text{.}\) Here we didnβt use
LemmaΒ 6.6.5 to produce sub rectangles
\(Q_{\alpha}\) of
\(S\) with equal side lengths as we want to work with a finite number of sub rectangles in a partition.
It now follows that
\begin{equation*}
v(T(S))=\sum_{\alpha}v(T(Q_{\alpha})) \le (1+\Lambda \epsilon)^n \sum_{\alpha}
\left|\det \left(DT(\bar {\mathbf x_{\alpha}})\right)\right| v (Q_{\alpha})\text{.}
\end{equation*}
The summation above is a Riemann sum for the integral of \(\left|\det \left(DT({\mathbf x})\right)\right|\) on \(S\text{,}\) so as the partition size tends to \(0\text{,}\) we get
\begin{equation*}
v(T(S)) \le (1 +\Lambda \epsilon)^{n}\int_{S} \left|\det \left(D T( {\mathbf x})\right)\right|\, d{\mathbf x}.
\end{equation*}
Since \(\epsilon>0\) is arbitrary, it follows that
\begin{equation*}
v(T(S))\le \int_{S} \left|\det \left(DT( {\mathbf x})\right)\right|\, d{\mathbf x}.
\end{equation*}
And this argument works not only for rectangles, but for all Jordan-measurable set. In fact, for any Jordan-measurable set \(E\) whose closure is in \(U\text{,}\) and any non-negative function \(f\text{,}\) integrable on \(T(E)\text{,}\)
\begin{equation}
\int_{T(E)}f({\mathbf y})\, d{\mathbf y} \le \int_{E} f(T({\mathbf x}))|\det DT({\mathbf x})|\, d{\mathbf x}.\tag{6.6.4}
\end{equation}
Here are some more details for proving
(6.6.4). In defining
\(\int_{T(E)}f({\mathbf y})\, d{\mathbf y}\text{,}\) we may work with partitions
\(\cP\) in the
\(y\)-space such that any of its subrectangle that has non-empty intersection with
\(T(E)\) must be contained in
\(T(U)\text{.}\) Then
\begin{align*}
L(f \chi_{T(E)}, \cP) \amp = \sum_{S_{j}: S_{j}\cap T(E)^{c}\ne \emptyset} m_{S_{j}}(f \chi_{T(E)})v(S_{j})
+ \sum_{S_{j}: S_{j}\subset T(E)} m_{S_{j}}(f \chi_{T(E)})v(S_{j})\\
\amp \le \sum_{S_{j}: S_{j} \subset T(E) }
\int_{T^{-1}(S_{j})} f(T({\mathbf x})) |\det DT({\mathbf x})|\, d{\mathbf x}\\
\amp \le \int_{E} f(T({\mathbf x})) |\det DT({\mathbf x})|\, d{\mathbf x},
\end{align*}
where we have used \(m_{S_{j}}(f \chi_{T(E)})=0\) when \(S_{j}\cap T(E)^{c}\ne \emptyset\text{,}\) \(v(S_{j})\le \int_{T^{-1}(S_{j})} |\det DT({\mathbf x})|\, d{\mathbf x}\) and \(0\le m_{S_{j}}(f \chi_{T(E)}) \le f({\mathbf x})\) for \({\mathbf x}\in T^{-1}(S_{j})\text{,}\) as well as \(\cup_{S_{j}\subset T(E)} T^{-1}(S_{j})\) forming a non-overlapping subset of \(E\text{.}\) It follows that
\begin{equation*}
\int_{T(E)}f({\mathbf y})\, d{\mathbf y} =\sup_{\cP} L(f \chi_{T(E)}, \cP) \le \int_{E} f(T({\mathbf x})) |\det DT({\mathbf x})|\, d{\mathbf x}.
\end{equation*}