-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: qc rules #16
feat: qc rules #16
Conversation
…map + corresponding config/config.yml and profile/config.yml
workflow/Snakefile
Outdated
shell: | ||
"cat {input.samples} {input.mapping} >> QC-report.txt" | ||
"tar zcvf {input}.tar.gz {input} 2> {log}" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this read tar zcvf {ouput} {input}
? Also, &> {log}
because there is no output stream (stdout and stderr) for which we do not want to gather in our log, right?
…hed qc rules from snakefile into their own .smk
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
essentially ready, only minor changes, before moving on.
.test/config-simple/config.yml
Outdated
@@ -17,6 +17,8 @@ repo: "/~https://github.com/snakemake-workflows/transriptome-differential-expressi | |||
transcriptome: "/lustre/miifs01/project/m2_zdvhpc/transcriptome_data/GCA_917627325.4_PGI_CHIRRI_v4_genomic.fa" | |||
# Annotation GFF/GTF (absolute path) | |||
annotation: "/lustre/miifs01/project/m2_zdvhpc/transcriptome_data/GCA_917627325.4_PGI_CHIRRI_v4_genomic.gff" | |||
#gtf is needed by qualimap | |||
gtf: "/lustre/miifs01/project/m2_zdvhpc/transcriptome_data/GCA_917627325.4_PGI_CHIRRI_v4_genomic.gtf" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please set a TODO-comment stating that we ought to evaluate just using the gtf version then: Basically both gff and gtf contain the same information and we should not require both, if one is sufficient.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is an artifact from previous versions which is only present in the /.test/config-simple/config.yml. Qualimap has since been removed from the workflow as running it with our .gtf file only returns warnings.
config/config.yml
Outdated
@@ -19,6 +19,9 @@ transcriptome: "/lustre/miifs01/project/m2_zdvhpc/transcriptome_data/GCA_9176273 | |||
annotation: "/lustre/miifs01/project/m2_zdvhpc/transcriptome_data/GCA_917627325.4_PGI_CHIRRI_v4_genomic.gff" | |||
# these samples ought to contain all samples comprising of the | |||
|
|||
#NanoPlot QC with sequencing summary. leave as None for QC with .fastq files | |||
summary: None |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this implicates the split between summary and non-summary evaluation. However, did we not want to have both, eventually?
workflow/Snakefile
Outdated
@@ -3,6 +3,13 @@ from os import path | |||
from collections import OrderedDict | |||
import sys | |||
|
|||
<<<<<<< HEAD |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks like an unresolved merge conflict (also see the lines below)
workflow/Snakefile
Outdated
log: | ||
"logs/NanoPlot/{sample}.log", | ||
resources: | ||
## max of 39 for our SLURM partition | ||
cpus_per_task= min(8,39) #problem with max(len(input.fastq),39) | ||
cpus_per_task=min(8, 39), #problem with max(len(input.fastq),39) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
cpus_per_task=min(len(input.fastq), config["max_cpus_in_default_partition/PC/
Server"])
vielleicht den Parameter anders nennen? ;-)
config/config.yml
Outdated
@@ -19,6 +19,9 @@ transcriptome: "/lustre/miifs01/project/m2_zdvhpc/transcriptome_data/GCA_9176273 | |||
annotation: "/lustre/miifs01/project/m2_zdvhpc/transcriptome_data/GCA_917627325.4_PGI_CHIRRI_v4_genomic.gff" | |||
# these samples ought to contain all samples comprising of the | |||
|
|||
# OPTIONAL; NanoPlot QC with summary from sequencer. leave as "None" for regular QC with .fastq files |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Optional: NanoPlot QC summary using a summary file from the sequencer. If this file is not supplied, put the parameter to "None". It will then to an independent QC run per FASTQ input.
|
||
|
||
if config["summary"] == "None": | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
no empty line after if statement
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
linting with snakefmt adds empty lines.
"tar zcvf {output} {input} 2> {log}" | ||
|
||
else: | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
no empty line here
"NanoPlot -t {resources.cpus_per_task} --tsv_stats -f svg " | ||
"--fastq {input} -o {output} 2> {log}" | ||
|
||
rule compress_nplot: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should be a local rule, shouldn't it?
Added NanoPlot and Qualimap rules.