Warning
This pipeline is under development. For the legacy EASI-FISH Nextflow pipeline see https://github.com/JaneliaSciComp/multifish
JaneliaSciComp/easifish is a bioimage analysis pipeline that reconstructs large microscopy image volumes. It ingests raw images in CZI format from Zeiss Lightsheet microscopes, computes tile stitching, and outputs a multi-resolution image pyramid in OME-ZARR or N5 format. The pipeline also supports registration to a reference round as well as segmentation of selected rounds.
- Stitch the image tiles from the source CZI using either Saalfeld stitcher or BigStitcher
- Register low resolution moving rounds with respect to the corresponding low resolution fixed round using Bigstream.
- Register high resolution moving rounds with respect to the corresponding low resolution fixed round using Bigstream.
- Apply deformation to the specified scale of the moving image using Bigstream.
- Generate multiscale pyramid for the warped moving image.
- Generate the segmentation for the selected round(s) using Cellpose.
- Extract the spots from the stitched images using either RS-FISH or FISHSPOT
Note If you are new to Nextflow and nf-core, please refer to this page on how to set-up Nextflow. Make sure to test your setup with
-profile testbefore running the workflow on actual data.
First, prepare a samplesheet with your input data that looks as follows:
samplesheet.csv:
id,filename,pattern
LHA3_R3_tiny,LHA3_R3_small.czi,LHA3_R3_small.czi
LHA3_R3_tiny,LHA3_R3_tiny.mvl,
LHA3_R5_tiny,LHA3_R5_small.czi,LHA3_R5_small.czi
LHA3_R5_tiny,LHA3_R5_tiny.mvl,Each row represents a file in the input data set. The identifier (id) groups files together into acquisition rounds. In the example above, each acquisition is a single CZI file containing all of the tiles and channels, and an MVL file containing the acquisition metadata (e.g. stage coordinates for each tile.)
Note: If you use BigStitcher for the registration the MVL is not required, for Saalfeld stitcher you must have it. Also with BigStitcher the samplesheet.csv may have the BDV project file (dataset.xml), and then you can skip the stitching step and only run create-container and fuse.
Now, you can run the pipeline using:
nextflow run JaneliaSciComp/easifish \
-profile <docker/singularity/.../institute> \
--input samplesheet.csv \
--outdir <OUTDIR>:::warning
Please provide pipeline parameters via the CLI or Nextflow -params-file option. Custom config files including those
provided by the -c Nextflow option can be used to provide any configuration except for parameters;
see docs.
:::
For more details and further functionality, please refer to the usage documentation and the parameter documentation.
To see the results of an example test run with a full size dataset refer to the results tab on the nf-core website pipeline page. For more details about the output files and reports, please refer to the output documentation.
The stitching-spark tools used in this pipeline were developed by the Saalfeld Lab at Janelia Research Campus.
The modules for running Spark clusters on Nextflow were originally prototyped by Cristian Goina.
The JaneliaSciComp/easifish pipeline was originally constructed by Konrad Rokicki.
The workflow diagram is based on the SVG source from the cutandrun pipeline.
If you would like to contribute to this pipeline, please see the contributing guidelines.
For further information or help, don't hesitate to get in touch on the Slack #easifish channel (you can join with this invite).
If you use JaneliaSciComp/easifish for your analysis, please cite the EASI-FISH article as follows:
Yuhan Wang, Mark Eddison, Greg Fleishman, Martin Weigert, Shengjin Xu, Fredrick E. Henry, Tim Wang, Andrew L. Lemire, Uwe Schmidt, Hui Yang, Konrad Rokicki, Cristian Goina, Karel Svoboda, Eugene W. Myers, Stephan Saalfeld, Wyatt Korff, Scott M. Sternson, Paul W. Tillberg. Expansion-Assisted Iterative-FISH defines lateral hypothalamus spatio-molecular organization. Cell. 2021 Dec 22;184(26):6361-6377.e24. doi: 10.1016/j.cell.2021.11.024. PubMed PMID: 34875226.
An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.
This pipeline uses code and infrastructure developed and maintained by the nf-core community, reused here under the MIT license.
The nf-core framework for community-curated bioinformatics pipelines.
Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.
Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x.
