Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nanoplot results #393

Open
QuentinPerriere opened this issue Dec 11, 2024 · 3 comments
Open

Nanoplot results #393

QuentinPerriere opened this issue Dec 11, 2024 · 3 comments

Comments

@QuentinPerriere
Copy link

QuentinPerriere commented Dec 11, 2024

Thank you again for this tool,

I tested again NanoPlot on my data, and it showed that only 657 reads had a Phred score greater than Q10 ( I initially have 51653 reads in this fastq.gz file)
However, when I ran Fastp with the command fastp -i input_path -o fastp_output_path -A -G --qualified_quality_phred 10 --thread 8 followed by Seqtk using seqtk sample -s100 fastp_output_path 7000 | gzip > seqtk_output_path, I successfully extracted and generated a file containing 7000 reads with Q >10.

I use the latest version 1.42.0 of nanoplot

@wdecoster
Copy link
Owner

Yes, there are two ways to average quality scores across a read (see my blog post), and I would argue that NanoPlot does this correctly.

@QuentinPerriere
Copy link
Author

QuentinPerriere commented Dec 11, 2024

Yes, there are two ways to average quality scores across a read (see my blog post), and I would argue that NanoPlot does this correctly.

thank you for responding. but what I should do in this case ? fastp and Seqtk results are not reflected in the QC report generated by NanoPlot. This inconsistency is causing confusion in my upcoming presentation, as the QC metrics do not align with the actual quality of my filtered data.
I only need to know these metrics because it will determine my filtration parameters ( score phred and number of reads ) : number_of_reads', 'number_of_bases', 'median_read_length', 'mean_read_length', 'Reads >Q10:' 'Reads >Q15

@wdecoster
Copy link
Owner

You must decide if you want to use metrics from fastp, seqtk, or NanoPlot.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants