File:Arnoldi.f: Difference between revisions

From openpipeflow.org
Jump to navigation Jump to search
mNo edit summary
mNo edit summary
 
(35 intermediate revisions by the same user not shown)
Line 5: Line 5:
The Arnoldi method is a method for calculating the eigenvalues and eigenvectors of a matrix, i.e. for calculating the scalar $\sigma$ and $n$-vectors $\vec{x}$ that satisfy
The Arnoldi method is a method for calculating the eigenvalues and eigenvectors of a matrix, i.e. for calculating the scalar $\sigma$ and $n$-vectors $\vec{x}$ that satisfy
\[
\[
  \sigma\,A = A\,\vec{x}
\sigma\,\vec{x} = A\,\vec{x}
\]
\]
for a given $n\times n$ matrix $A$.
for a given $n\times n$ matrix $A$.


The main advantage of the method is that it only requires calculations of multiplies by $A$ for a given $\vec{x}$ -- it does not need to know $A$ itself.  This means that $A$ need not even be stored, and could correspond to a very complex linear 'action' on $\vec{x}$, e.g. a time integral with initial condition $x$.  The method seeks eigenvectors in $span\{\vec{x},\,A\vec{x},\,A^2\vec{x},...\}.$, but uses orthogonalisation to improve the suitability of the basis.  An extra vector must be stored after each iterationIt is possible to restart without losing information. This has not been implemented.
The main advantage of the method is that it only requires calculations of multiplies by $A$ for a given $\vec{x}$ -- it does not need to know $A$ itself.  This means that $A$ need not even be stored, and could correspond to a very complex linear 'action' on $\vec{x}$, e.g. a time integral with initial condition $\vec{x}$.  Given a starting vector $\vec{x}_0$, the method seeks eigenvectors $\vec{x}$ in $\mathrm{span}\{\vec{x}_0,\,A\vec{x}_0,\,A^2\vec{x}_0,...\}$, but uses Gram-Schmidt orthogonalisation to improve the numerical suitability of this basis setThe set of orthogonalised vectors is called the Krylov-subspaceIn practice we need to limit the size of this space to m vectors, but if this number is reached, it is possible to restart without losing information. Restarts are not implemented here; see the following which can dramatically reduce the number of multiplies by $A$ required.


The method finds the largest eigenvalues and those most separated in the complex plane first.  If $A$ is expected to have many negative eigenvalues of little interest, it may be better to work with $\tilde{A}=\mathrm{e}^A=1+A+\frac{1}{2}A^2+...$, which has no effect on the eigenvectors but exponentiates the eigenvalues, $\tilde{\sigma}=\mathrm{e}^\sigma$.  The negative eigenvalues then correspond to bunched eigenvalues close to the origin.  The Arnoldi method then favours the largest $|\tilde{\sigma}|$, being the $\sigma$ with largest real parts.
===Time Integration and Exponentiation===


Note that time integration corresponds to exponentiation: An eigenvector of a differential equation with a strongly negative eigenvalue decaysUpon time integration, the decay corresponds to multiplication of the initial condition by a value close to zero, i.e. the eigenvalue of the action of the time integral is close to zeroTime integration of a marginal eigenvector corresponds to multiplication by an eigenvalue on the unit circle.
The method finds the eigenvalues most separated in the complex plane first.  If $A$ is expected to have many negative eigenvalues of little interest, it may be better to work with $\tilde{A}=\mathrm{e}^A=1+A+\frac{1}{2!}A^2+...$, i.e. the eigenproblem $\mathrm{e}^\sigma\vec{x}=\mathrm{e}^A\vec{x}$.  This problem
shares the same eigenvectors as the original problem, but its eigenvalues, $\tilde{\sigma}=\mathrm{e}^\sigma$, are often better suited to the Arnoldi method.  The negative eigenvalues $\sigma$, which typically correspond to modes of little interest, now correspond to eigenvalues $\tilde{\sigma}$ bunched close to the origin.  The Arnoldi method favours the $\tilde{\sigma}$ most separated in the complex plane, being the $\sigma$ with largest real parts.
 
Note that for the system $\partial_t \vec{x} = A\,\vec{x}$, time integration corresponds to exponentiation:  
First, for eigenvector $\vec{x}_0$ with eigenvalue $\sigma$ we may write $\sigma\, \vec{x}_0 = A\,\vec{x}_0$. 
Also, taking $\vec{x}_0$ as an intial condition, after a time $T$ it will have grown or decayed to
$\vec{x}_T=\mathrm{e}^{\sigma T}\vec{x}_0 =\vec{x}_0 + \int_0^T A\,\vec{x}\,dt$We may write this as
$\mathrm{e}^{\sigma T}\vec{x}_0 \equiv B\,\vec{x}_0$, where $B$ is the time integration operator.
Then $\vec{x}_0$ is an eigenvector of both $A$ and $B$, where its eigenvalue for $B$ is $\tilde{\sigma}=\mathrm{e}^{\sigma T}$.
We apply the Arnoldi method to $B$, then recover $\sigma$ from $\tilde{\sigma}$.
 
Now let's consider the case of a perturbation $\vec{\delta x}$ linearised about a solution $\vec{x}_0$.  Evolution of the perturbation is given by $\partial_t \vec{\delta x} = A(\vec{x}_0)\,\vec{\delta x}$. Let $\vec{X}(\vec{x})$ be the result of time integration of $\vec{x}$. For this system, the result of $B\,\vec{\delta x}$ for a given $\vec{\delta x}$ may be approximated by $\frac{1}{\epsilon}(\vec{X}(\vec{x}_0+\epsilon\,\vec{\delta x})-\vec{X}(\vec{x}_0))$ for some small value $\epsilon$.  (Although we expect that $\vec{X}(\vec{x}_0)=\vec{x}_0$ for a solution $\vec{x}_0$, numerical accuracy is likely to be better with the given form.) Note that to find the eigenvalues $\tilde{\sigma}=\mathrm{e}^{\sigma T}$ of $B$ with the Arnoldi method, only a routine for time integration of a given initial condition is required.


== How to use the code ==
== How to use the code ==


To download, click the link above.
To download, click the link above.  The Lapack package is also required.


The subroutine <tt>arnold(...)</tt> needs to be passed a subroutine that calculates the dot product of two eigenvectors.  It should look like, for example,
The subroutine <tt>arnold(...)</tt> needs to be passed a subroutine that calculates the dot product of two eigenvectors.  It should look like, for example,
Line 26: Line 37:
     dotprod = sum(a*b)
     dotprod = sum(a*b)
  end function dotprod
  end function dotprod
<tt>arnold(...)</tt> needs to be called repeatedly.  It communicates the status of the computation via the flag <tt>ifail</tt>, which tells the user how many eigenvalues are converged, up to a given tolerance, to multiply a vector by $A$ again, or if the method has failed (e.g. reached maximum number of vector that can be stored.
<tt>arnold(...)</tt> needs to be called repeatedly.  It communicates the status of the computation via the flag <tt>ifail</tt>, which tells the user how many eigenvalues are converged up to a given tolerance, to multiply a vector by $A$ again, or tells the user if the method has failed, e.g. reached maximum number of vector that can be stored.


An example of use of the code:
An example of use of the code:
   ! declare workspace vectors, h, q, b...
   ! declare workspace vectors, h, q, b... - see header of arnoldi.f
   sv = ... ! random initial vector x
   sv = ... ! random initial vector x
   k = 0    ! initialise iteration counter
   k = 0    ! initialise iteration counter
   do while(.true.)
   do while(.true.)
       call arnold(n,k,kmax,ncgd,dotprod,tol,sv,h,q,b,wr,wi,ifail)
       call arnold(n,k,kmax,ncgd,dotprod,tol,sv,h,q,b,wr,wi,ifail)
      if(k>ncgd+2) then
        print*, ' k =', k
        do i = 1, ncgd
            print*, wr(i), wi(i)
        end do
      end if
       if(ifail==-1) then
       if(ifail==-1) then
         print*, ' arnoldi converged!'
         print*, ' arnoldi converged!'
Line 46: Line 51:
         call multA(sv, sv)      ! possibly complicated routine that multiplies sv by A
         call multA(sv, sv)      ! possibly complicated routine that multiplies sv by A
       else if(ifail==1) then
       else if(ifail==1) then
         print*, 'getEigen: WARNING: arnoldi reached max its'
         print*, 'WARNING: arnoldi reached max its'
         exit
         exit
       else if(ifail>=2) then
       else if(ifail>=2) then
         print*, 'arnoldi error'
         print*, 'WARNING: arnoldi error:', ifail
         exit
         exit
       end if   
       end if   
   end do
   end do
On exit, the eigenvectors are stored in columns of <tt>b</tt>, in order corresponding to eigenvalues in <tt>wr, wi</tt>.  If the first eigenvalue is real, the eigenvector occupies the first column of <tt>b</tt> only.  If the next eigenvalue is complex, it will occupy the next two columns of <tt>b</tt>.
On exit, the eigenvectors are stored in columns of <tt>b</tt>, in order corresponding to eigenvalues in <tt>wr</tt> and <tt>wi</tt>.  If the first eigenvalue is real (<tt>wi(1)==0.</tt>), the eigenvector occupies the first column of <tt>b</tt> only.  If the next eigenvalue is complex, the real and imaginary parts will occupy the next two columns of <tt>b</tt>.
 
== Parallel use ==
 
No special adaptations are required for parallel (MPI) use -- let each thread pass its subsection for the vector <tt>sv</tt>, and let the <tt>dotprod</tt> function <tt>allreduce</tt> the result of the dot product.

Latest revision as of 02:22, 21 October 2021

$ \renewcommand{\vec}[1]{ {\bf #1} } \newcommand{\bnabla}{ \vec{\nabla} } \newcommand{\Rey}{Re} \def\vechat#1{ \hat{ \vec{#1} } } \def\mat#1{#1} $

The Arnoldi Method

The Arnoldi method is a method for calculating the eigenvalues and eigenvectors of a matrix, i.e. for calculating the scalar $\sigma$ and $n$-vectors $\vec{x}$ that satisfy \[ \sigma\,\vec{x} = A\,\vec{x} \] for a given $n\times n$ matrix $A$.

The main advantage of the method is that it only requires calculations of multiplies by $A$ for a given $\vec{x}$ -- it does not need to know $A$ itself. This means that $A$ need not even be stored, and could correspond to a very complex linear 'action' on $\vec{x}$, e.g. a time integral with initial condition $\vec{x}$. Given a starting vector $\vec{x}_0$, the method seeks eigenvectors $\vec{x}$ in $\mathrm{span}\{\vec{x}_0,\,A\vec{x}_0,\,A^2\vec{x}_0,...\}$, but uses Gram-Schmidt orthogonalisation to improve the numerical suitability of this basis set. The set of orthogonalised vectors is called the Krylov-subspace. In practice we need to limit the size of this space to m vectors, but if this number is reached, it is possible to restart without losing information. Restarts are not implemented here; see the following which can dramatically reduce the number of multiplies by $A$ required.

Time Integration and Exponentiation

The method finds the eigenvalues most separated in the complex plane first. If $A$ is expected to have many negative eigenvalues of little interest, it may be better to work with $\tilde{A}=\mathrm{e}^A=1+A+\frac{1}{2!}A^2+...$, i.e. the eigenproblem $\mathrm{e}^\sigma\vec{x}=\mathrm{e}^A\vec{x}$. This problem shares the same eigenvectors as the original problem, but its eigenvalues, $\tilde{\sigma}=\mathrm{e}^\sigma$, are often better suited to the Arnoldi method. The negative eigenvalues $\sigma$, which typically correspond to modes of little interest, now correspond to eigenvalues $\tilde{\sigma}$ bunched close to the origin. The Arnoldi method favours the $\tilde{\sigma}$ most separated in the complex plane, being the $\sigma$ with largest real parts.

Note that for the system $\partial_t \vec{x} = A\,\vec{x}$, time integration corresponds to exponentiation: First, for eigenvector $\vec{x}_0$ with eigenvalue $\sigma$ we may write $\sigma\, \vec{x}_0 = A\,\vec{x}_0$. Also, taking $\vec{x}_0$ as an intial condition, after a time $T$ it will have grown or decayed to $\vec{x}_T=\mathrm{e}^{\sigma T}\vec{x}_0 =\vec{x}_0 + \int_0^T A\,\vec{x}\,dt$. We may write this as $\mathrm{e}^{\sigma T}\vec{x}_0 \equiv B\,\vec{x}_0$, where $B$ is the time integration operator. Then $\vec{x}_0$ is an eigenvector of both $A$ and $B$, where its eigenvalue for $B$ is $\tilde{\sigma}=\mathrm{e}^{\sigma T}$. We apply the Arnoldi method to $B$, then recover $\sigma$ from $\tilde{\sigma}$.

Now let's consider the case of a perturbation $\vec{\delta x}$ linearised about a solution $\vec{x}_0$. Evolution of the perturbation is given by $\partial_t \vec{\delta x} = A(\vec{x}_0)\,\vec{\delta x}$. Let $\vec{X}(\vec{x})$ be the result of time integration of $\vec{x}$. For this system, the result of $B\,\vec{\delta x}$ for a given $\vec{\delta x}$ may be approximated by $\frac{1}{\epsilon}(\vec{X}(\vec{x}_0+\epsilon\,\vec{\delta x})-\vec{X}(\vec{x}_0))$ for some small value $\epsilon$. (Although we expect that $\vec{X}(\vec{x}_0)=\vec{x}_0$ for a solution $\vec{x}_0$, numerical accuracy is likely to be better with the given form.) Note that to find the eigenvalues $\tilde{\sigma}=\mathrm{e}^{\sigma T}$ of $B$ with the Arnoldi method, only a routine for time integration of a given initial condition is required.

How to use the code

To download, click the link above. The Lapack package is also required.

The subroutine arnold(...) needs to be passed a subroutine that calculates the dot product of two eigenvectors. It should look like, for example,

double precision function dotprod(n,a,b)
   implicit none
   integer :: n
   double precision :: a(n), b(n)
   dotprod = sum(a*b)
end function dotprod

arnold(...) needs to be called repeatedly. It communicates the status of the computation via the flag ifail, which tells the user how many eigenvalues are converged up to a given tolerance, to multiply a vector by $A$ again, or tells the user if the method has failed, e.g. reached maximum number of vector that can be stored.

An example of use of the code:

  ! declare workspace vectors, h, q, b... - see header of arnoldi.f
  sv = ... ! random initial vector x
  k = 0    ! initialise iteration counter
  do while(.true.)
     call arnold(n,k,kmax,ncgd,dotprod,tol,sv,h,q,b,wr,wi,ifail)
     if(ifail==-1) then
        print*, ' arnoldi converged!'
        exit
     else if(ifail==0) then
        call multA(sv, sv)      ! possibly complicated routine that multiplies sv by A
     else if(ifail==1) then
        print*, 'WARNING: arnoldi reached max its'
        exit
     else if(ifail>=2) then
        print*, 'WARNING: arnoldi error:', ifail
        exit
     end if   
  end do

On exit, the eigenvectors are stored in columns of b, in order corresponding to eigenvalues in wr and wi. If the first eigenvalue is real (wi(1)==0.), the eigenvector occupies the first column of b only. If the next eigenvalue is complex, the real and imaginary parts will occupy the next two columns of b.

Parallel use

No special adaptations are required for parallel (MPI) use -- let each thread pass its subsection for the vector sv, and let the dotprod function allreduce the result of the dot product.

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeDimensionsUserComment
current02:06, 13 December 2016 (8 KB)Apwillis (talk | contribs)For calculating the eigenvalues of a matrix.

The following 3 pages use this file: