Sophie: eso-midas-doc-13SEPpl1.2-3.mga5 i586

eso-midas-doc-13SEPpl1.2-3.mga5.i586.rpm

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<!--Converted with LaTeX2HTML 98.1p1 release (March 2nd, 1998)
originally by Nikos Drakos (nikos@cbl.leeds.ac.uk), CBLU, University of Leeds
* revised and updated by:  Marcus Hennecke, Ross Moore, Herb Swan
* with significant contributions from:
  Jens Lippmann, Marek Rouchal, Martin Wilck and others -->
<HTML>
<HEAD>
<TITLE>The Modified Gauss-Newton Method.</TITLE>
<META NAME="description" CONTENT="The Modified Gauss-Newton Method.">
<META NAME="keywords" CONTENT="vol1">
<META NAME="resource-type" CONTENT="document">
<META NAME="distribution" CONTENT="global">
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">
<LINK REL="STYLESHEET" HREF="vol1.css">
<LINK REL="next" HREF="node131.html">
<LINK REL="previous" HREF="node129.html">
<LINK REL="up" HREF="node128.html">
<LINK REL="next" HREF="node131.html">
</HEAD>
<BODY >
<!--Navigation Panel-->
<A NAME="tex2html2189"
 HREF="node131.html">
<IMG WIDTH="37" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="next"
 SRC="icons.gif/next_motif.gif"></A> 
<A NAME="tex2html2185"
 HREF="node128.html">
<IMG WIDTH="26" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="up"
 SRC="icons.gif/up_motif.gif"></A> 
<A NAME="tex2html2179"
 HREF="node129.html">
<IMG WIDTH="63" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="previous"
 SRC="icons.gif/previous_motif.gif"></A> 
<A NAME="tex2html2187"
 HREF="node1.html">
<IMG WIDTH="65" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="contents"
 SRC="icons.gif/contents_motif.gif"></A> 
<A NAME="tex2html2188"
 HREF="node216.html">
<IMG WIDTH="43" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="index"
 SRC="icons.gif/index_motif.gif"></A> 
<BR>
<B> Next:</B> <A NAME="tex2html2190"
 HREF="node131.html">The Quasi-Newton Method.</A>
<B> Up:</B> <A NAME="tex2html2186"
 HREF="node128.html">Outline of the Available</A>
<B> Previous:</B> <A NAME="tex2html2180"
 HREF="node129.html">The Newton-Raphson Method.</A>
<BR>
<BR>
<!--End of Navigation Panel-->

<H2><A NAME="SECTION001312000000000000000">
The Modified Gauss-Newton Method.</A>
</H2>
From a first guess of the
parameters <I>a</I><SUP>(1)</SUP>, a sequence 
<!-- MATH: $a^{(2)},a^{(3)},\ldots$ -->
<IMG
 WIDTH="119" HEIGHT="51" ALIGN="MIDDLE" BORDER="0"
 SRC="img278.gif"
 ALT="$a^{(2)},a^{(3)},\ldots $">
is generated
and is intended to converge to a local minimum of <IMG
 WIDTH="57" HEIGHT="48" ALIGN="MIDDLE" BORDER="0"
 SRC="img279.gif"
 ALT="$\chi^2(a)$">.
At each iteration, one computes
<BR><P></P>
<DIV ALIGN="CENTER">
<!-- MATH: \begin{displaymath}
a^{(k+1)}~=~a^{(k)}~+\alpha^{(k)}~d^{(k)}
\end{displaymath} -->


<IMG
 WIDTH="245" HEIGHT="36"
 SRC="img280.gif"
 ALT="\begin{displaymath}a^{(k+1)}~=~a^{(k)}~+\alpha^{(k)}~d^{(k)}\end{displaymath}">
</DIV>
<BR CLEAR="ALL">
<P></P>
where <I>d</I><SUP>(<I>k</I>)</SUP> is a certain descent direction and 
<!-- MATH: $\alpha^{(k)}$ -->
<IMG
 WIDTH="42" HEIGHT="27" ALIGN="BOTTOM" BORDER="0"
 SRC="img281.gif"
 ALT="$\alpha^{(k)}$">
is a real
coefficient which is chosen such that

<!-- MATH: $\chi^2(a^{(k)}+\alpha^{(k)}~d^{(k)})$ -->
<IMG
 WIDTH="187" HEIGHT="51" ALIGN="MIDDLE" BORDER="0"
 SRC="img282.gif"
 ALT="$\chi^2(a^{(k)}+\alpha^{(k)}~d^{(k)})$">
is approximately minimum. The
direction <I>d</I><SUP>(<I>k</I>)</SUP> is ideally the solution of the Newton equation
<BR><P></P>
<DIV ALIGN="CENTER">
<!-- MATH: \begin{displaymath}
H(a^{(k)})~d^{(k)}~=~-g(a^{(k)})
\end{displaymath} -->


<I>H</I>(<I>a</I><SUP>(<I>k</I>)</SUP>)&nbsp;<I>d</I><SUP>(<I>k</I>)</SUP>&nbsp;=&nbsp;-<I>g</I>(<I>a</I><SUP>(<I>k</I>)</SUP>)
</DIV>
<BR CLEAR="ALL">
<P></P>
which can also be rewritten 
<BR><P></P>
<DIV ALIGN="CENTER">
<!-- MATH: \begin{displaymath}
[J(a^{(k)})^T~J(a^{(k)})~+~B(a^{(k)})]~d^{(k)}~=~-J(a^{(k)})~r(a^{(k)})~~.
\end{displaymath} -->


[<I>J</I>(<I>a</I><SUP>(<I>k</I>)</SUP>)<SUP><I>T</I></SUP>&nbsp;<I>J</I>(<I>a</I><SUP>(<I>k</I>)</SUP>)&nbsp;+&nbsp;<I>B</I>(<I>a</I><SUP>(<I>k</I>)</SUP>)]&nbsp;<I>d</I><SUP>(<I>k</I>)</SUP>&nbsp;=&nbsp;-<I>J</I>(<I>a</I><SUP>(<I>k</I>)</SUP>)&nbsp;<I>r</I>(<I>a</I><SUP>(<I>k</I>)</SUP>)&nbsp;&nbsp;.
</DIV>
<BR CLEAR="ALL">
<P></P>
Neglecting the second derivatives matrix 
<!-- MATH: $B(a^{(k)})$ -->
<I>B</I>(<I>a</I><SUP>(<I>k</I>)</SUP>), we obtain
the ``normal equations'' and the Gauss-Newton direction
<BR><P></P>
<DIV ALIGN="CENTER">
<!-- MATH: \begin{displaymath}
J(a^{(k)})^T~J(a^{(k)})~d^{(k)}~=~-J(a^{(k)})~r(a^{(k)})
\end{displaymath} -->


<I>J</I>(<I>a</I><SUP>(<I>k</I>)</SUP>)<SUP><I>T</I></SUP>&nbsp;<I>J</I>(<I>a</I><SUP>(<I>k</I>)</SUP>)&nbsp;<I>d</I><SUP>(<I>k</I>)</SUP>&nbsp;=&nbsp;-<I>J</I>(<I>a</I><SUP>(<I>k</I>)</SUP>)&nbsp;<I>r</I>(<I>a</I><SUP>(<I>k</I>)</SUP>)
</DIV>
<BR CLEAR="ALL">
<P></P>
This so-called Gauss-Newton method is intended for problems where
<IMG
 WIDTH="73" HEIGHT="44" ALIGN="MIDDLE" BORDER="0"
 SRC="img283.gif"
 ALT="$\Vert B(a)\Vert$">
is small. If the Jacobian
<I>J</I>(<I>a</I>) is singular or near singular or if <IMG
 WIDTH="67" HEIGHT="44" ALIGN="MIDDLE" BORDER="0"
 SRC="img284.gif"
 ALT="$\Vert r(a)\Vert$">
is very large (the
so-called large residuals problem), the Gauss-Newton equation is not a
good approximation of the normal equations and the convergence is not
guaranteed.

<P>
The algorithm implemented here is a modification
of that Gauss-Newton method, that allows convergence even for
rank deficient Jacobians or for large residuals. The Gauss-Newton
direction is computed in  
<!-- MATH: $V_1~=~\Im m~[J(a^{(k)})^T~J(a^{(k)})]$ -->
<IMG
 WIDTH="278" HEIGHT="51" ALIGN="MIDDLE" BORDER="0"
 SRC="img285.gif"
 ALT="$V_1~=~\Im m~[J(a^{(k)})^T~J(a^{(k)})]$">,
the invariant space corresponding to the non-null eigenvalues.
A correction is taken in <I>V</I><SUB>2</SUB>, the orthogonal of <I>V</I><SUB>1</SUB>,
according to the second derivatives if
the decrease of the objective function at the last
iteration is considered too small.
The Hessian matrix is estimated using finite
differences of the gradient.

<P>
This method requires the availability of the derivatives and as
the number of gradient evaluations is almost <I>p</I> at each iteration, it
is recommended for problems with a small number of parameters,
let us say <IMG
 WIDTH="83" HEIGHT="39" ALIGN="MIDDLE" BORDER="0"
 SRC="img286.gif"
 ALT="$p~\leq~10$"><HR>
<!--Navigation Panel-->
<A NAME="tex2html2189"
 HREF="node131.html">
<IMG WIDTH="37" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="next"
 SRC="icons.gif/next_motif.gif"></A> 
<A NAME="tex2html2185"
 HREF="node128.html">
<IMG WIDTH="26" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="up"
 SRC="icons.gif/up_motif.gif"></A> 
<A NAME="tex2html2179"
 HREF="node129.html">
<IMG WIDTH="63" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="previous"
 SRC="icons.gif/previous_motif.gif"></A> 
<A NAME="tex2html2187"
 HREF="node1.html">
<IMG WIDTH="65" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="contents"
 SRC="icons.gif/contents_motif.gif"></A> 
<A NAME="tex2html2188"
 HREF="node216.html">
<IMG WIDTH="43" HEIGHT="24" ALIGN="BOTTOM" BORDER="0" ALT="index"
 SRC="icons.gif/index_motif.gif"></A> 
<BR>
<B> Next:</B> <A NAME="tex2html2190"
 HREF="node131.html">The Quasi-Newton Method.</A>
<B> Up:</B> <A NAME="tex2html2186"
 HREF="node128.html">Outline of the Available</A>
<B> Previous:</B> <A NAME="tex2html2180"
 HREF="node129.html">The Newton-Raphson Method.</A>
<!--End of Navigation Panel-->
<ADDRESS>
<I>Petra Nass</I>
<BR><I>1999-06-09</I>
</ADDRESS>
</BODY>
</HTML>