.. _sec_algorithms:

Algorithms of the OpenMPS
=========================

In this section we provide some details on the algorithms used by OSMPS in
order to give the user some understanding of the available convergence
parameters. The reader interested in a broader view of MPSs and their
algorithms should consult Ulrich Schollwoeck's review *The density-matrix
renormalization group in the age of matrix product states*, which is the
standard reference on the subject at the time of writing of this
manual :cite:`Schollwoeck2011`.

.. _sec_mps:

Definitions
-----------

We define a *tensor* as a map from a product of Hilbert spaces to the complex
numbers

.. math::

   T:\mathbb{H}_1\otimes \mathbb{H}_2\otimes \dots \otimes \mathbb{H}_r\to\mathbb{C}\, .

Here :math:`r` is the *rank* of the tensor. If we evaluate the elements of the
tensor :math:`T` in a fixed basis :math:`\left\{|i_j\rangle \right\}` for each
Hilbert space :math:`\mathbb{H}_j`, then equivalent information is carried in
the multidimensional array :math:`T_{i_1\dots i_r}`. We will also refer to
this multidimensional array as a tensor. The information carried in a tensor
does not change if we change the order in which its indices appear. We will
call such a generalized transposition a *permutation* of the tensor. As an
example, the permutations of the rank-3 tensor :math:`T` are

.. math::

   T_{ijk}&=\left[T'\right]_{kij}=\left[T''\right]_{jki}=\left[T'''\right]_{jik}=\left[T''''\right]_{kji}=\left[T'''''\right]_{ikj}\, .


Here, the primes indicate that the tensor differs from its unprimed counterpart
only by a permutation of indices. Similarly, by combining two such indices
together using the Kronecker product we can define an equivalent tensor of
lower rank, a process we call *index fusion*. We denote the Kronecker product
of two indices :math:`a` and :math:`b` using parentheses as
:math:`\left(ab\right)`, and a representation is provided by

.. math::

   \left(ab\right)&=\left(a-1\right)d_b+b\, ,

where :math:`d_b` is the dimension of :math:`\mathbb{H}_b` and :math:`a` and
:math:`b` are both indexed starting from 1. An example of fusion is 

.. math::

   T_{ijk}&=\left[T'\right]_{i\left(jk\right)}\, .

Here, :math:`T` is a rank-3 tensor of dimension :math:`d_i\times d_j\times d_k`
and :math:`T'` is a matrix of dimension :math:`d_i\times d_jd_k`. The inverse
operation of fusion, which involves creating a tensor of higher rank by
splitting a composite index, we refer to as *index splitting*.

Just as permutations generalize the notion of matrix transposition,
*tensor contraction* generalizes the notion of matrix multiplication. In a
contraction of two tensors A and B some set of indices :math:`\mathbf{c}_A`
and :math:`\mathbf{c}_B` which describe a common Hilbert space are summed,
and the resulting tensor :math:`C` consists of products of the elements of
:math:`A` and :math:`B` as

.. math::
   :label: eq_tensorcontraction

   C_{\bar{\mathbf{c}}_A\bar{\mathbf{c}}_B}&=\sum_{\mathbf{c}}A_{\bar{\mathbf{c}}_A\mathbf{c}}B_{\mathbf{c}\bar{\mathbf{c}}_B}\, .

Here :math:`\bar{\mathbf{c}}_B` denotes the indices of :math:`A` which are not
contracted and likewise for :math:`\bar{\mathbf{c}}_B`. The rank of :math:`C`
is :math:`r_A+r_B-2n_c`, where :math:`n_c` is the number of indices contracted
(i.e., the number of indices in :math:`\mathbf{c}`) and :math:`r_A` and
:math:`r_B` are the ranks of :math:`A` and :math:`B`, respectively. In writing
expression Eq. :eq:`eq_tensorcontraction` we have permuted all of the indices
:math:`\mathbf{c}_A` to be contracted to the furthest right position in
:math:`A` and the indices :math:`\mathbf{c}_B` to the furthest leftmost
position in :math:`B` for notational simplicity.


.. _fig_TD:

.. figure:: ../images/TD.*
   :width: 600px
   :alt: Tensor operations
   :align: center

   *Examples of basic tensor operations in diagrammatic notation. a) A rank-3
   tensor.  b) The conjugate of a rank-3 tensor.  c) The contraction of two
   rank-3 tensors over a single index produces a rank-4 tensor.*

At this stage, it is advantageous to develop a graphical notation for tensors
and their operations :cite:`Shi2006`. A tensor is represented graphically
by a box with lines extending upwards from it. The number of lines is equal
to the rank of the tensor. The order of the indices from left to right is the
same as the ordering of lines from left to right. A contraction of two tensors
is represented by a line connecting two points. Finally, the complex conjugate
of a tensor is denoted by a point with lines extending downwards. Some basic
tensor operations are shown in graphical notation in :numref:`fig_TD`.

Following a similar line of reasoning as for contractions above, we may also 
*decompose* tensors into contractions of tensors using permutation, fusion,
and any of the well-known matrix decompositions such as the singular value
decomposition (SVD) or the QR decomposition. For example, a rank-3 tensor
:math:`T` can be factorized as

.. math::

   T_{ijk}&=\sum_lU_{\left(ij\right)l}S_lV_{lk}\, ,

where :math:`U` and :math:`V` are unitary and :math:`S` is a positive
semidefinite real vector. Such decompositions are of great use in MPS
algorithms.

A *tensor network* is now defined as a set of tensors whose indices are
connected in a network pattern, see :numref:`fig_TN`. Let us consider that some
set of the network's indices :math:`\mathbf{c}` are contracted over, and the
complement :math:`\bar{\mathbf{c}}` remain uncontracted. Then, this network
is a decomposition of some tensor :math:`T_{\bar{\mathbf{c}}}`. The basic
idea of tensor network algorithms utilizing MPSs and their higher dimensional
generalizations such as projected entangled-pair states
(PEPS) :cite:`Verstraete2004`, :cite:`Verstraete2008` and the multiscale
entanglement renormalization algorithm (MERA) :cite:`Vidal2007`, :cite:`Evenbly2009`
are to represent the high-rank tensor :math:`c_{i_1\dots i_L}` encoding a
many-body wavefunction in a Fock basis,

.. math::

   |\psi\rangle&=\sum_{i_1\dots i_L}c_{i_1\dots i_L}|{i_1\dots i_L}\rangle\, ,

as a tensor network with tensors of small rank.  We set the convention that
indices which are contracted over in the tensor network decomposition will be
denoted by Greek indices, and indices which are left uncontracted will be
denoted by Roman indices. The former type of index will be referred to as a
bond index, and the latter as a physical index.

.. _fig_TN:

.. figure:: ../images/TN.*
   :width: 600px
   :alt: 7-site MPS
   :align: center

   *An MPS with 7 sites and open boundary conditions.*

In particular, an MPS imposes a one-dimensional topology on the tensor network
such that all the tensors appearing in the decomposition are rank-3. The
resulting decomposition has the structure shown in :numref:`fig_TN`. Explicitly,
an MPS may be written in the form

.. math::
   :label: eq_mpsdef

   |\psi_{\mathrm{MPS}}\rangle=\sum_{i_1,\dots i_L=1}^{d}\mathrm{Tr}\left(A^{\left[1\right]i_1}\dots A^{\left[L\right]i_L}\right)|i_1\dots i_L\rangle\, .

Here, :math:`i_1\dots i_L` label the :math:`L` distinct sites, each of which
contains a :math:`d` dimensional Hilbert space. We will call :math:`d` the
*local dimension*. The superscript index in brackets :math:`\left[j\right]`
denotes that this is the tensor of the :math:`j^{\mathrm{th}}` site, as these
tensors are not all the same in general. Finally, the trace effectively sums
over the first and last dimensions of :math:`A^{\left[1\right]i_1}` and
:math:`A^{\left[L\right]i_L}` concurrently, and is necessary only for periodic
boundary conditions where these dimensions are greater than 1. All algorithms
in OSMPS work only with open boundary conditions. Obscured within the matrix
product of Eq. :eq:`eq_mpsdef` is the size of the matrix
:math:`A^{\left[j\right]i_j}` formed from the tensor :math:`A^{\left[j\right]}`
with its physical index held constant. We will refer to the left and right
dimensions of this matrix as :math:`\chi_{j}` and :math:`\chi_{j+1}`, and the
maximum value of :math:`\chi_j` for any tensor, the *bond dimension*, will be
denoted as :math:`\chi`. The  bond dimension is the parameter which determines
the efficiency of an MPS simulation, and also its dominant computational
scaling. From the relation :math:`S_{\mathrm{vN}} \le \log\chi`, where
:math:`S_{\mathrm{vN}}` is the maximum von Neumann entropy of entanglement of
any bipartite splitting, we also have that :math:`\chi` represents an
entanglement cutoff for MPSs.

.. _sec_vmps:

Variational ground state search
-------------------------------

OSMPS implements the two-site variational ground state search algorithm. In
this algorithm, the state which minimizes the energy globally is sought by
minimizing the energy of the state with all but two contiguous sites held
fixed. The position of the two variational sites is then moved throughout the
lattice as in :numref:`fig_vMPS`.  

The local minimization begins by fusing the MPS tensors of the two sites to be
optimized together to create a rank four tensor,

.. math::

    \Theta_{\alpha\beta}^{ij}&=\sum_{\gamma}A^{\left[k\right] i}_{\alpha \gamma}A^{\left[k+1\right]j}_{\gamma\beta}\, .

The Hamiltonian is then projected onto the Hilbert space of :math:`\Theta` by
fixing all other MPS tensors, resulting in the *effective Hamiltonian*
:math:`\hat{H}_{\mathrm{eff}}`. The energy is then minimized locally by solving
the effective Hamiltonian eigenvalue problem

.. math::

   \hat{H}_{\mathrm{eff}}{\Theta}&=E{\Theta}

for the lowest energy eigenpair :math:`\left(E,{\Theta}\right)`. The effective
linear dimension of :math:`\hat{H}_{\mathrm{eff}}` is very large, but this
matrix is also sparse in typical situations. Hence, we use a sparse iterative
eigenvalue solver, the Lanczos algorithm, to determine the eigenpair
corresponding to the lowest energy.  In this algorithm there are two
convergence parameters,  The first is :math:`n_{\mathrm{l}}`, the number of
iterations which are allowed before the algorithm claims to have not converged.
The second is a stopping tolerance :math:`\epsilon_{\mathrm{l}}` on an
eigenpair which is determined by the residual

.. math::

   \Vert \hat{H}_{\mathrm{eff}}\Theta-E\Theta\Vert_2<\epsilon_{\mathrm{l}}\, .

After the local minimization has been performed, we must return the state to
its canonical MPS decomposition. We do so by decomposing :math:`\Theta` into
two rank three tensors

.. math::

   \sum_{\gamma}A^{\left[k\right] i}_{\alpha \gamma}A^{\left[k+1\right]j}_{\gamma\beta}&=\Theta_{\alpha\beta}^{ij}\, .

This decomposition may involve a truncation of the bond dimension, which here
would be the dimension of the space indexed by :math:`\gamma`. In OSMPS, this
dimension :math:`\chi` is determined implicitly by

.. math::
   :label: eq_loctrunc

    e_{\mathrm{local}}\equiv 1-\sum_{\alpha=1}^{\chi}S_{\alpha}^2/ \mathbf{S}\cdot\mathbf{S}<\epsilon_{\mathrm{local}}\, ,

where :math:`\mathbf{S}` is the vector of singular values of the matrix

.. math::

   \Theta_{\left(\alpha i\right)\left(j\beta\right)}&=\Theta_{\alpha\beta}^{ij}\,,

`\epsilon_{\mathrm{local}}` is a user-supplied local tolerance, and we will
call :math:`e_{\mathrm{local}}` the local truncation error. The user may also
specify a maximum bond dimension :math:`\chi_{\mathrm{max}}` which takes
precedence over the condition Eq. :eq:`eq_loctrunc`. The complete sequence of
local minimization is shown in the blue box in :numref:`fig_vMPS`.

After the minimization of two sites with indices :math:`\kappa` and
:math:`\kappa+1`, we minimize the sites :math:`\kappa+1` and :math:`\kappa+2`
and continue optimizing pairs of sites further to the right until we reach the
right boundary. At the right boundary, we reverse direction and optimize pairs
of sites with decreasing indices until we reach the left boundary. Here, we
again reverse direction and optimize pairs of sites until we reach the pair of
sites :math:`\kappa` and :math:`\kappa+1` again. We have now completed an
*inner sweep* in which each pair of contiguous sites has been optimized twice.
Convergence of the variational state to the ground state is ascertained by the
variance condition

.. math::

   \langle \hat{H}^2-\langle \hat{H}\rangle^2\rangle<\epsilon_{\mathrm{v}}L\, .

If convergence is achieved, the program exits, otherwise another inner sweep
is performed. If convergence is not reached when the maximum number of inner
sweeps has been reached, then the program tries to decrease the local tolerance
to meet the desired variance tolerance by assuming that the variance is
proportional to the site-integrated local truncation error and performing
linear extrapolation. The inner sweeping procedure is then repeated. We call
such an iteration an *outer sweep*. The convergence parameters for this
algorithm are set by an object of the :py:class:`convergence.MPSConvParam`
class. In particular, the relevant parameters and their default values are
collected in the table :py:class:`convergence.MPSConvParam`.


.. _fig_vMPS:

.. figure:: ../images/vMPS.*
   :width: 600px
   :alt: Variational ground state
   :align: center

   *Variational ground state search on 7 sites. The energy of the two sites
   enclosed in the red box is locally minimized by solving the effective
   Hamiltonian eigenproblem with the Lanczos iteration. The associated
   two-site eigenvector is then decomposed to maintain a consistent MPS
   canonical form with local tolerance* :math:`\epsilon_{\mathrm{local}}` *and
   maximum bond dimension* :math:`\chi` *as shown in the blue box. The position
   of the two sites being minimized is moved throughout the entire lattice in
   a sweeping motion.  After each pair of sites has been optimized twice,
   convergence is checked by considering whether the variance meets the
   desired tolerance.*


.. _sec_emps:

Variational excited state search: eMPS
--------------------------------------

The algorithm for finding excited states variationally with MPSs, which we
call eMPS :cite:`Wall2012`, uses a process of local minimization and
sweeping similar to the variational ground state search. The difference is
that the the local minimization is performed using a projected effective
Hamiltonian which projects the variational state into the space orthogonal to
all other previously obtained eigenstates. Hence, the convergence parameters
which are used for eMPS are identical to those used for the variational ground
state search collected in the table of :py:class:`convergence.MPSConvParam`.
The eMPS method is used whenever the key ``'n_excited_states'`` in
``parameters`` has a value greater than zero, see Sec. :ref:`sec_parameters`.


.. _sec_impsinitial:

iMPS as initial ansatz
----------------------

As the MPS methods used in OSMPS are variational, their efficiency is greatly
enhanced by the availability of a good initial guess for the wavefunction. For
the ground state, a good guess can be found by using a fixed number of
iterations of the infinite size variational ground state search with MPSs
(iMPS) to be discussed in more detail in Sec. :ref:`sec_imps`. In this method,
we begin by considering two sites and minimize the energy locally using the
Lanczos iteration as discussed in Sec. :ref:`sec_vmps`.  We then decompose the
two-site wavefunction into two separate rank three tensors and then use these
tensors as an effective environment into which two new sites are embedded as
in :numref:`fig_iMPS`. The energy of these two inner sites is minimized with
the environment sites held fixed, and then these sites are absorbed into the
environment and a new pair [#f1]_ of sites is inserted into the center. This
process is repeated until we have a chain of sites which has the desired
length. We call this process the *warmup* phase.

The parameters used to control the convergence of the warmup phase are
included as part of the :py:class:`convergence.MPSConvParam` class. In
addition to the Lanczos convergence parameters discussed in
Sec. :ref:`sec_vmps` and the table in :py:class:`convergence.MPSConvParam`, the
warmup-specific parameters are :math:`\chi_{\mathrm{warmup}}`, the maximum bond
dimension allowed during warmup and :math:`\epsilon_{\mathrm{warmup}}`, the local
truncation determining the bond dimension according to Eq. :eq:`eq_loctrunc`
during warmup. These convergence parameters are set in an object of the
:py:class:`convergence.MPSConvParam` class as specfied in the table in its
class documentation. Note that only the values of the warmup
convergence parameters from the first set of convergence parameters are used.
That is to say, warmup is only used to construct an initial state, and
subsequent refinements use the output from variational ground state search as
the input to a more refined variational ground state search. Also note that
warmup is only relevant to ground state search and not excited state search,
and so setting values of the warmup convergence parameters for objects of the
:py:class:`convergence.MPSConvParam` class to be used for eMPS has no
effect.


.. _fig_iMPS:

.. figure:: ../images/warmup.*
   :width: 400px
   :alt: Warmup with iMPS
   :align: center

   *The iMPS iteration successively adds sites to the center of a chain,
   performs local minimization of the energy with only these sites, and then
   absorbs these sites into the environment.*


.. [#f1] For finite lattices with an odd number of sites, a single site is
   inserted on the last iteration.


.. _sec_imps:

Infinite-size ground state search: iMPS
---------------------------------------

In addition to being used to initialize finite-size simulations, a variation
of the iMPS method presented in Sec. :ref:`sec_impsinitial` can also be used
to find a representation of an infinitely large wavefunction which is
translationally invariant under shifts by some number of sites :math:`L\ge 2`.
The convergence behavior of iMPS is determined by an object of the
:py:class:`convergence.iMPSConvParam` class. The iMPS minimization is
performed by inserting :math:`L` sites at each iMPS iteration as shown in
:numref:`fig_iMPS` and then minimizing the energy of these :math:`L` sites
with the given fixed environment. After minimization, these sites are absorbed
into the environment, :math:`L` new sites are added, and the minimization is
repeated.  The iteration has converged when two unit cells are close in some
sense. This sense is measured rigorously by the orthogonality
fidelity :cite:`McCulloch2008`. In OSMPS, we take the stopping condition to be
that the orthogonality fidelity is less than the unit-cell averaged truncation
error as measured by Eq. :eq:`eq_loctrunc` for 10 successive iterations. If
there is no truncation error, then the stopping criterion is that the
orthogonality fidelity is less than :math:`\epsilon_{\mathrm{v}}` for 10
successive iterations, where :math:`\epsilon_{\mathrm{v}}` is the
``'variance_tol'`` in :py:class:`tools.MPSConvergenceParameters`. This
convergence condition on the orthogonality fidelity denotes that the
differences between successive iterations are due only to truncation arising
from a finite bond dimension. A maximum number of iterations may also be
specified as ``'max_num_imps_iter'``.

The variance is not used to determined convergence of a unit cell to its
minimum. Rather, a fixed number of sweeps, specified by ``'min_num_sweeps'``
are used to converge. Hence, the relevant convergence parameters for an iMPS
simulation are the parameters of :py:class:`convergence.iMPSConvParam`
collected in Table in :py:class:`convergence.iMPSConvParam`.


.. _sec_krylov:

Krylov-based time evolution : tMPS
----------------------------------

The Krylov-based tMPS algorithm :cite:`Wall2012` has its own class of
convergence parameters called :py:class:`convergence.KrylovConvParam`.
Its parameters are collected in the class description. While many of these
parameters have the same names as those in
:py:class:`convergence.MPSConvParam` they have different interpretations.
The Lanczos procedure now refers to the Lanczos method for determining the
matrix exponential, and so the stopping criterion is that the difference
between our variational state and the true state acted on by the matrix
exponential is less than ``'lanczos_tol'`` in the 2-norm. The action of an
operator :math:`\hat{H}` on a state :math:`|\psi\rangle` and the representation
of a sum :math:`\sum_kc_k|\phi_k\rangle` cannot be represented exactly as MPSs
for a given fixed bond dimension, and so both of these operations are performed
with variational algorithms as discussed in Ref. :cite:`Wall2012`. The
associated convergence parameters for these two variational algorithms are given
in the functions description at :py:class:`convergence.KrylovConvParam` as well.


.. _sec_tebd:

Time Evolving Block Decimbation (TEBD)
--------------------------------------

The TEBD algorithm uses the Sornborger-Stewart decomposition
:cite:`Sornborger1999` instead of the more common Trotter decomposition. As
for the Trotter decomposition, this method is only valid for nearest-neighbor
Hamiltonians built from site and bond rules. At present, it uses the Krylov
subspace method to apply the exponential of the two-site Hamiltonian to the
state. The convergence parameters are described in details in
:py:class:`convergence.TEBDConvParam`.


.. _sec_tdvp:

Time-Dependent Variational Principle (TDVP)
-------------------------------------------

The Time-Dependent Variational Principle is the second algorithm after
Krylov to support long-range interactions represented in the MPO. Its
convergence parameters are specified in :py:class:`convergence.TDVPConvParam`.
Details on the algorithm can be found in :cite:`Haegeman2016`.


.. _sec_lrk:

Local Runge-Kutta (LRK)
-----------------------

The Local Runge-Kutta algorithm :cite:`Zaletel2015` generates a MPO
representation of the propagator which can be applied to the state. The
details on the convergence parameters are in
:py:class:`convergence.LRKConvParam`.