ptychography-4-0 / ptychography Goto Github PK

View Code? Open in Web Editor NEW

24.0 10.0 14.0 71.88 MB

Code repository for Ptychography 4.0 project.

Home Page: https://ptychography-4-0.github.io/ptychography/

License: GNU General Public License v3.0

Python 3.06% Jupyter Notebook 96.63% CMake 0.06% C++ 0.24% Cuda 0.01%

ptychography's Introduction

This repository collects implementations for ptychography that result from the work of the Ptychography 4.0 project.

Installation

The short version:

$ virtualenv -p python3.9 ~/ptychography-venv/
$ source ~/ptychography-venv/bin/activate
(ptychography-venv) $ pip install ptychography40

Please see our documentation for details!

Applications

Scalable, parallel implementation of the Single Side Band method that is suitable for live data processing.

Please see the algorithms section of our documentation for details!

Ptychography 4.0 is evolving rapidly and prioritizes features following user demand and contributions. In the future we'd like to implement more ptychography methods. If you like to influence the direction this project is taking, or if you'd like to contribute, please contact us in the GitHub Issue tracker.

License

Ptychography 4.0 is licensed under GPLv3.

ptychography's People

Contributors

Stargazers

Watchers

Forkers

uellue lesan20 simeonehrig x-rayscattering bangunarya sk1p sniper2k ben777777 que-vector w-markus yisonghan cominghere xiongh15 rose-luo-h

ptychography's Issues

Numba code coverage

In particular #60 has a lot of Numba code. We should have a separate Numba coverage job like LiberTEM. :-)

Rename the source folder?

Right now, the package name is ptychography40 but the folder is named ptychography and it is installed as import ptychography, right?

Should we rename everything to be ptychography40?

Release version 0.1 with SSB and stitching

Since the code for SSB and for stitching is pretty mature and we are about to publish a paper regarding live ptychography based on this SSB implementation, it is now time for the first release.

Dependencies

Release LiberTEM and LiberTEM-live with prerequisites for #35
Merge #35

Before (using a release candidate package)

After releasing on GitHub

Confirm that all release packages are built and release notes are up-to-date
Install release package
Confirm correct version info
confirm package upload to PyPI
Publish new version on zenodo.org
Update documentation with new links, if necessary
- Add zenodo badge for the new release to Changelog page
Send announcement message on mailing list
Bump version in master branch to next .dev0

Documentation update: Instruct users to fork!

Since we don't push directly, but work with forks, our documentation should include that information for new contributors.

This is missing in https://github.com/LiberTEM/LiberTEM as well, should be fixed in both.

CC @sk1p @w-markus @que-vector

Alpaka EPIE version

Update Alpaka version and make further backends running. Currently, it runs at FZJ and DESY on CUDA and CPU.

Then do benchmark tests.

Improvement of single-side-band algorithm

Improve code, test performance

Data sets

Hi,
where to store the ptycho data sets? Data from DESY is too big. Github obviously has a limit of 25 MB, and we are in the GB range.

Cheers,
Heide

Achim's EPIE implementation

Give access to Achim
Insert Achim's EPIE to repository and evaluate performance.
Comparison to DESY EPIE implementation.
Link to Libertem: easier since this is python

Compare with other implementation(s)

There's now another open source SSB implementation available here: https://gitlab.com/pyptychostem/pyptychostem

We could compare results with that implementation in our unit tests.

Collect Numba coverage

In #35 the coverage is poor because now most of the work is done within a Numba-compiled function, which only generates coverage if it is run with disabled Numba compilation.

For that reason we should run a separate CI pipeline with disabled Numba compilation on selected tests like in LiberTEM to get coverage.

Documentation: instruct users to install LiberTEM ?!

When following the SSB example in ssb.html , of course, libertem is imported.

However, it is not mentioned in the installation.
I may take care of this, but not sure where/how to add this:

in the current Installation-text of Ptychography-4-0,
as a separate page
just a sentence and a link pointing to the LiberTEM repository

Best wishes
Markus

Tests w/ data access in CI

From #32 - we should run tests w/ data access, and also run the notebook in CI

Test on Python 3.11

Update tox envs, azuer pipelines etc :-)

Create interfaces between reconstruction methods and libertem

For EPIE (DESY-version and Alpaka-version) an interface between C++ and python is required.

Start with Achim's EPIE which is written in python to have an easy full workflow.

UDFs for use of EPIE

Make Libertem ready for ptycho data (input and reconstruction output) using UDFs

Use single-side band algorithm to create an initial solution

Test this idea with DESY data

updating EPIE-Alpaka to latest version

Efficient distributed forward model

Solvers require calculating a next sample vector from evaluating the error and/or local gradient of the forward model with respect to the measured data.

In LiberTEM, the data and computation can be distributed, parallelized and serialized as desired. Using this approach for solvers requires a forward model that can be distributed in the same fashion so that parts of the sample can be evaluated separately and the new sample vector resp. delta is merged from partial sample vector results.

Implement stitching procedure

Implement stitching procedure from Nashed et al. 2014. This is required at the end of a reconstruction performed individually for more than one subset of diffraction data due to phase shifts being different for each subset and adjust the geometrical positions to one complete object.

Data distribution

Distribute diffraction images in a way that the corresponding raster positions are neighbored. This will enable:

multi-GPU calculations, and
adding diffraction images dynamically for reconstruction

Distributing the software

About

At the moment, LiberTEM is distributed via pip. For this project, pip could not be the right solution, because we have additional needs for the alpaka backend:

dependencies to non-Python packages, like Boost and CUDA
system depend parameters at build and install time, like accelerator backends or GPU architecture [1]

[1] automatic detecting at build time is not a good idea because a usual workflow on HPC is installing the packages on the login node (with has no GPUs) and allocate GPUs afterwards

Prerequirements

Develop a dummy alpaka backend function with python binding #10

Potential candidates

pip
conda

Update the install instructions for the alpaka binding prototype

[x] opened PR for fix in Pybind11: pybind/pybind11#2240
[x] new install method for Alpaka: alpaka-group/alpaka#1016
[x] new release canidate with many improvements for installation: https://github.com/alpaka-group/alpaka/tree/release-0.5.0

Document `if name == "main": ...` in SSB example (was: Error in SSB example)

when starting the SSB example, upon creating the context:

ctx = lt.Context()

I get numerous copies of:

Task exception was never retrieved
future: <Task finished coro=<_wrap_awaitable() done, defined at /home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/asyncio/tasks.py:623> exception=RuntimeError('\n        An attempt has been made to start a new process before the\n        current process has finished its bootstrapping phase.\n\n        This probably means that you are not using fork to start your\n        child processes and you have forgotten to use the proper idiom\n        in the main module:\n\n            if __name__ == \'__main__\':\n                freeze_support()\n                ...\n\n        The "freeze_support()" line can be omitted if the program\n        is not going to be frozen to produce an executable.')>
Traceback (most recent call last):
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/asyncio/tasks.py", line 630, in _wrap_awaitable
    return (yield from awaitable.__await__())
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/site-packages/distributed/core.py", line 285, in _
    await self.start()
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/site-packages/distributed/nanny.py", line 298, in start
    response = await self.instantiate()
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/site-packages/distributed/nanny.py", line 381, in instantiate
    result = await self.process.start()
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/site-packages/distributed/nanny.py", line 578, in start
    await self.process.start()
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/site-packages/distributed/process.py", line 33, in _call_and_set_future
    res = func(*args, **kwargs)
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/site-packages/distributed/process.py", line 203, in _start
    process.start()
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/multiprocessing/process.py", line 112, in start
    self._popen = self._Popen(self)
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/multiprocessing/context.py", line 284, in _Popen
    return Popen(process_obj)
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/multiprocessing/popen_spawn_posix.py", line 32, in __init__
    super().__init__(process_obj)
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/multiprocessing/popen_fork.py", line 20, in __init__
    self._launch(process_obj)
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/multiprocessing/popen_spawn_posix.py", line 42, in _launch
    prep_data = spawn.get_preparation_data(process_obj._name)
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/multiprocessing/spawn.py", line 143, in get_preparation_data
    _check_not_importing_main()
  File "/home/cri/Software/conda/miniconda/miniconda3/envs/ppp4_py37/lib/python3.7/multiprocessing/spawn.py", line 136, in _check_not_importing_main
    is not going to be frozen to produce an executable.''')
RuntimeError: 
        An attempt has been made to start a new process before the
        current process has finished its bootstrapping phase.

        This probably means that you are not using fork to start your
        child processes and you have forgotten to use the proper idiom
        in the main module:

            if __name__ == '__main__':
                freeze_support()
                ...

        The "freeze_support()" line can be omitted if the program
        is not going to be frozen to produce an executable.

Software runs on a HPE ProLiant DL385 Gen10, 2x Epyc 7F72, 512 GB RAM under Debian Linux, testing distribution.

Update notebook ssb-live-reconstruction.ipynb with new LiberTEM-live API

...namely make_connection and make_acquisition

Link to publications in the docs

It should be easy to find our publication(s) related to this project, for example our live processing paper and maybe others. Could we add a list directly to the README?

In addition, would be also nice to support CITATION.cff, see LiberTEM/LiberTEM#1083

3D tomography

To the best of our knowledge, tomographical ptychography is always reconstructed by

reconstructing the 2D measurements acquired for the same angle
the fusing the results by a filtered backprojection

Wolfgang had the idea of trying a real 3D reconstruction by inverting all data together, i.e. using polar coordinate shifts instead of cartesian ones.

From a mathematical point of view this seems to be feasible. The benefit would be less required data points and thus less artefacts. The drawback is the amount of data which is now considerably increased by magnitudes which is needed for each single iteration now putting through all data. This is the task for WP3 when the algorithm is adapted => WP1

Poster for Helmholtz Imaging Platform matchmaking workshop

Ptychography 4.0 – An Information and Data Science Pilot Project Data infrastructure and applications

--> Poster: PtychoHIP_FZJ_HZB_HZDR-DiWe-0.2-web.pdf <--

Interest in projects

Applications of ptychography
Implementations of ptychography
Solving large-scale optimization problems
Distributed high-performance data processing
Advanced imaging
Data management, data logistics

See related

Poster P26: Imaging at Institute of Radiation Physics @hzdr: https://owncloud.hzdr.de/index.php/s/19nvqlWrUkan4zb (password: imagingFWK)
Poster P27: Alpaka, https://owncloud.hzdr.de/index.php/s/69Zip2M4zJYhoX3 (poster, password: alpakaPoster) and https://github.com/ComputationalRadiationPhysics/alpaka
Poster P09: LiberTEM, LiberTEM/LiberTEM#671

Contact

We are looking forward to collaborate on new project ideas! Contact us to arrange a follow-up discussion for details.

Alexander Clausen [email protected] (FZ Jülich, LiberTEM, data logistics)
Simeon Ehrig [email protected] (HZDR, Alpaka, implementation)
Heide Meißner [email protected] (HZDR, Alpaka, application)
Knut Müller-Caspary [email protected] (FZ Jülich, electron microscopy)
Dieter Weber [email protected] (FZ Jülich, LiberTEM, application, presenting author)
Markus Wollgarten [email protected] (HZB, electron microscopy)

Visualization

Use of Qt GUI to show processing of reconstruction

Missing requriement documentation for the SSB prototype

Please document, that the SSB prototype needs liberTEM 0.6~dev, because of the CUDA support.

Check meaningful SSB parameters

SSB should detect undersampling and throw a warning, and give help to find good parameters for the illumination for a given scan resolution.

Use of coarse reconstruction as initial object

Test the following: Given a subset (or the whole object) which is roughly reconstructed using a coarse distribution of raster points, what happens when this image is used as start value for a smaller subset with finer rastering. What do we loose in comparison with a reconstruction of all diffraction images used for reconstruction.

Add and evaluate reco methods

First step: add reconstruction methods from WP2 plus interfaces to libertem
Second step: Evaluate concerning:

applicability and limitations of the algorithms
performance analysis (parallelization, scalability)
accuracy (e.g. robustness to noise)
convergence to correct / unique solution (expected)
applicability to interactive measurements ("online reconstruction")?
data formats and data handling, e.g. zeroMQ, Check with WP1 colleagues

test of GPU version for CDI

The reconstruction algorithm RAAR (= Relaxed Averaged Alternating Reflection) used in Jena for coherent diffraction imaging exist in python and in pytorch. Pytorch is beneficial when using GPUs, since some optimization features are automatically applied. In Jena, differences between the results where observed although started with the same seed. Only for experimental data sets. And no obvious difference in the code was visible. That means the accuracy needs to be tested.