Welcome to Poirot¶

the pipeline is running at Clinical Genomics Uppsala to call variants from short-read illumina WGS data from rare disease patients.

You can find the github repository at https://github.com/clinical-genomics-uppsala/poirot_rd_wgs/

This snakemake pipeline is built using module system from Hydra Genetics to process paired-end .fastq.gz files from Illumina whole genome sequencing.

We call it Poirot after the fictive detective, Hercule Poirot, created by Agatha Christie (one of the developers is a big Christie nerd and thought a detective's name was perfect since the pipeline should detect variants). We also have a whole exome pipeline, named after Poirot's side-kick Hastings.

Poirot uses the following hydra genetics modules:

Hydra-genetics¶

We are an organization/community with the goal of making snakemake pipeline development easier, faster, a bit more structured and of higher quality.

We do this by providing snakemake modules that can be combined to create a complete analysis or included in already existing pipelines. All modules are subjected to extensive testing to make sure that new releases doesn't unexpectedly break existing pipeline or deviate from guidelines and best practices on how to write code.

Snakemake¶

Poirot and Hydra-genetics are snakemake bases pipeline/tools. The Snakemake workflow management system is a tool to create reproducible and scalable data analyses. Workflows are described via a human readable, Python based language. They can be seamlessly scaled to server, cluster, grid and cloud environments, without the need to modify the workflow definition. Finally, Snakemake workflows can entail a description of required software, which will be automatically deployed to any execution environment.

If Snakemake is new to you a good place to start is doing the snakemake tutorial since this will help you setting Poirot up.