The key property used here is the uniform continuity of
\(f\) on
\(R\text{:}\) both sides of
(6.3.1) and
(6.3.2) can be approximated by any Riemann sum with respect to a partition whose size is sufficiently small. More specifically, for any
\(\epsilon >0\text{,}\) there exists
\(\delta >0\) such that for any rectangle
\(S_{1}\) of
\(R_{1}\) and
\(S_{2}\) of
\(R_{2}\) whose side length is less than
\(\delta\text{,}\) we have
\begin{equation*}
\osc (f(\bx, \cdot), S_{2}) \lt \epsilon \text{ for any $\bx \in R_{1}$,}
\end{equation*}
\begin{equation*}
\osc (f(\cdot, \by), S_{1}) \lt \epsilon \text{ for any $\by \in R_{2}$,}
\end{equation*}
and
\begin{equation*}
\osc (f, S_{1}\times S_{2}) \lt \epsilon.
\end{equation*}
It follows that for any partition \(\cP_{1}\) of \(R_{1}\) and \(\cP_{2}\) of \(R_{2}\) whenever \(\lambda (\cP_{1}), \lambda (\cP_{2}) \lt \delta\text{,}\)
\begin{equation*}
U(f(\bx, \by), \cP_{2})-L(f(\bx, \by), \cP_{2}) \lt \epsilon |R_{2}| \text{ for any $\bx \in R_{1}$,}
\end{equation*}
\begin{equation*}
U(f(\bx, \by), \cP_{1})-L(f(\bx, \by), \cP_{1}) \lt \epsilon |R_{1}| \text{ for any $\by \in R_{2}$,}
\end{equation*}
and
\begin{equation*}
U(f(\bx, \by), \cP_{1}\times \cP_{2})-L(f(\bx, \by), \cP_{1}\times \cP_{2}) \lt \epsilon |R_{1}| |R_{2}|.
\end{equation*}
Since \(\int_{R_{1}\times R_{2}} f(\bx, \by)\, dA\) is sandwiched between \(U(f(\bx, \by), \cP_{1}\times \cP_{2})\) and \(L(f(\bx, \by), \cP_{1}\times \cP_{2})\text{,}\) it follows that
\begin{equation*}
\left\vert \int_{R_{1}\times R_{2}} f(\bx, \by)\, dA - U(f(\bx, \by), \cP_{1}\times \cP_{2}) \right \vert
\lt \epsilon |R_{1}| |R_{2}|.
\end{equation*}
Likewise, \(\int_{R_{2}} f(\bx, \by)\, d\by\) is sandwiched between \(U(f(\bx, \by), \cP_{2})\) and \(L(f(\bx, \by), \cP_{2})\text{,}\) it follows that
\begin{equation*}
\left\vert \int_{R_{2}} f(\bx, \by)\, d\by - U(f(\bx, \by), \cP_{2}) \right \vert \lt \epsilon |R_{2}|
\text{ for any $\bx \in R_{1}$,}
\end{equation*}
therefore for any sampling \(\{\bx_{\alpha}\}\) of points corresponding to \(\cP_{1}\) we have
\begin{equation*}
\left\vert \sum_{S_{\alpha}\in \cP_1} \left(\int_{R_{2}} f(\bx_{\alpha}, \by)\, d\by \right) |S_{\alpha}|-
\sum_{S_{\alpha}\in \cP_1} U(f(\bx_{\alpha}, \by), \cP_{2}) |S_{\alpha}| \right \vert \lt\epsilon |R_{1}| |R_{2}|.
\end{equation*}
Observe that \(\sum_{S_{\alpha}\in \cP_1} U(f(\bx_{\alpha}, \by), \cP_{2}) |S_{\alpha}| \) is sandwiched between \(U(f(\bx, \by), \cP_{1}\times \cP_{2})\) and \(L(f(\bx, \by), \cP_{1}\times \cP_{2})\text{,}\) so
\begin{equation*}
\left\vert \sum_{S_{\alpha}\in \cP_1} U(f(\bx_{\alpha}, \by), \cP_{2}) |S_{\alpha}| -
\int_{R_{1}\times R_{2}} f(\bx, \by)\, dA
\right\vert \lt \epsilon |R_{1}| |R_{2}|.
\end{equation*}
It then follows that
\begin{equation*}
\left\vert \sum_{S_{\alpha}\in \cP_1} \left(\int_{R_{2}} f(\bx_{\alpha}, \by)\, d\by \right) |S_{\alpha}|-
\int_{R_{1}\times R_{2}} f(\bx, \by)\, dA
\right\vert \lt 2 \epsilon |R_{1}| |R_{2}|.
\end{equation*}
But \(\sum_{S_{\alpha}\in \cP_1} \left(\int_{R_{2}} f(\bx_{\alpha}, \by)\, d\by \right) |S_{\alpha}|\) is a Riemann sum of the integral \(\int_{R_{1}} \left( \int_{R_{2}} f(\bx, \by)\, d\by \right) \, d\bx\) only subject to \(\lambda (\cP_{1}) \lt \delta\text{,}\) it follows that
\begin{equation}
\left\vert \int_{R_{1}} \left( \int_{R_{2}} f(\bx, \by)\, d\by \right) \, d\bx-
\int_{R_{1}\times R_{2}} f(\bx, \by)\, dA
\right\vert \le 2 \epsilon |R_{1}| |R_{2}|.\tag{6.3.3}
\end{equation}
Since
\(\epsilon >0 \) is arbitrary, it follows from
(6.3.3) that
(6.3.1) holds.
(6.3.2) can be proved in a similar way.