.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "examples/hdf5.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_examples_hdf5.py>`
        to download the full example code.

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_examples_hdf5.py:

Out-of-core Operations via HDF5
===================================

In-core operations are generally faster and convenient, but they have
limited scalability: if we distribute our operations across several machines,
we can make use of more memory and parallelize computations.
This is particularly relevant for sketched methods, in the cases where
linear operators have intractable sizes and/or linop evaluations take a
long time.

`HDF5 <https://www.h5py.org/>`_ is a popular way to store numerical data
persistently. From Python, it looks mostly like a NumPy array, but it is
stored in disk, and it can be partitioned across multiple sub-files in the
filesystem. This allows us to work with very large arrays while satisfying
our memory constraints. Also, the different sub-files can be processed
by different machines independently, with the resulting speedup.

In this example we illustrate the functionality provided in :mod:`skerch.hdf5`
in order to facilitate out-of-core operations. We first create a distributed
HDF5 numerical array, and then simulate multiple independent processes to
populate it with data. Finally, we test its correctness. Note that ``skerch``
also privdes access to some of the HDF5 functionality via CLI, see
:ref:`Command Line Interface`.

.. GENERATED FROM PYTHON SOURCE LINES 27-37

.. code-block:: Python


    import os
    import tempfile

    import torch

    from skerch.hdf5 import DistributedHDF5Tensor
    from skerch.measurements import GaussianNoiseLinOp
    from skerch.utils import torch_dtype_as_str


.. GENERATED FROM PYTHON SOURCE LINES 38-46

##############################################################################

Setup
-----

We start creating a matrix-free linear operator, which could be of very
large dimensionality and hence require distributed memory and/or
computation.

.. GENERATED FROM PYTHON SOURCE LINES 47-57

.. code-block:: Python


    SEED = 9876531
    SHAPE = (1000, 2000)
    DEVICE = "cuda" if torch.cuda.is_available() else "cpu"
    DTYPE = torch.complex128
    BLOCKSIZE = 100

    mop = GaussianNoiseLinOp(SHAPE, SEED, blocksize=BLOCKSIZE, by_row=True)


.. GENERATED FROM PYTHON SOURCE LINES 58-62

Now we create a distributed HDF5 database, and write chunks of the linear
operator onto the respective HDF5 chunks. Since the chunks are indidivual
files and the linear operator is seed-reproducible, this loop
can be distributed across independent process and machines:

.. GENERATED FROM PYTHON SOURCE LINES 63-82

.. code-block:: Python


    tmpdir = tempfile.TemporaryDirectory()
    h5_pth, h5_subpaths, h5_begs_ends = DistributedHDF5Tensor.create(
        os.path.join(tmpdir.name, mop.__class__.__name__ + "_{}"),
        SHAPE,
        BLOCKSIZE,
        torch_dtype_as_str(DTYPE),
    )
    h5_map = dict(zip(h5_begs_ends, h5_subpaths))

    for block, idxs in mop.get_blocks(DTYPE, DEVICE):
        # each one of these iterations could be in a parallel process/machine
        subpath = h5_map[(idxs.start, idxs.stop)]
        data, flags, h5 = DistributedHDF5Tensor.load(subpath)
        data[:] = block.cpu()
        flags[:] = "OK"
        h5.close()


.. GENERATED FROM PYTHON SOURCE LINES 83-89

We can now test our HDF5 database, verifying that all flags have been
set to OK and the contents match our linear operator.
The exact moment in which we load the data from disk to memory is
when calling ``data[:]``. The ``data`` reference is just a pointer to the
filesystem and can be used to efficiently access portions of the array
via ``data[idxs...]``, without having to load the whole array at once.

.. GENERATED FROM PYTHON SOURCE LINES 90-100

.. code-block:: Python


    data, flags, h5 = DistributedHDF5Tensor.load(h5_pth)
    is_ok = bool((flags.asstr()[:] == "OK").all())
    same_data = bool((data[:] == mop.to_matrix(DTYPE, "cpu")).all())
    h5.close()

    print("All flags set to OK:", is_ok)
    print("HDF5 data matches linear operator:", same_data)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    All flags set to OK: True
    HDF5 data matches linear operator: True


.. GENERATED FROM PYTHON SOURCE LINES 101-107

Once we are done with the (potentially concurrent) writing, we may want
to merge all individual chunks into a single, monolithic HDF5 file.
This may be useful to e.g. prevent OS issues from trying to open too many
files at once. The following line of code merges our HDF5 database into
one file under the same name. It also deletes the chunk files in the process,
so memory never blows up:

.. GENERATED FROM PYTHON SOURCE LINES 108-116

.. code-block:: Python


    print("Number of files prior to merging:", len(os.listdir(tmpdir.name)))
    DistributedHDF5Tensor.merge(
        h5_pth, check_success_flag="OK", delete_subfiles_while_merging=True
    )
    print("Number of files after merging:", len(os.listdir(tmpdir.name)))
    tmpdir.cleanup()


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Number of files prior to merging: 11
    Number of files after merging: 1


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (0 minutes 0.806 seconds)


.. _sphx_glr_download_examples_hdf5.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: hdf5.ipynb <hdf5.ipynb>`

    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: hdf5.py <hdf5.py>`

    .. container:: sphx-glr-download sphx-glr-download-zip

      :download:`Download zipped: hdf5.zip <hdf5.zip>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_