Skip to content

Latest commit

 

History

History
30 lines (17 loc) · 824 Bytes

File metadata and controls

30 lines (17 loc) · 824 Bytes

Firefly

Dependable pipeline for processing 16S amplicon data

Pipeline

Custom parameters stored in config.yaml

Preprocessing

  • Clip off primers with cutadapt
  • Check quality with fastqc, using multiqc to compile report
  • Quality filter with dada2::filterAndTrim using max expected error

Dada2 pipeline

  • Learn error with whole dataset: learn_errors.R
  • De-replicate and infer sequences: infer_seqs.R
  • Remove bimeras, assign taxonomy (including species): assign_taxonomy.R

Make tree

  • Align seqs with ssu-align
  • Make tree with FastTree

Note to potential users

This code is made available primarily for reproducability and is not intended as a general use tool. That being said if you want to borrow ideas on how to use Snakemake to run R scripts then by all means, borrow away!