Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

command-line plugin idea: contig gather #2564

Open
bluegenes opened this issue Apr 11, 2023 · 2 comments
Open

command-line plugin idea: contig gather #2564

bluegenes opened this issue Apr 11, 2023 · 2 comments
Labels
enhancement plugin a plugin for sourmash!

Comments

@bluegenes
Copy link
Contributor

bluegenes commented Apr 11, 2023

We have a working contig-level gather implementation in charcoal, but there are applications for this besides contamination detection and removal. It would be neat to write a sourmash plugin that would run a contig-level gather workflow on any query + database.

Briefly, the charcoal approach is:

  1. sketch the query as a whole
  2. prefetch against the full database
  3. using prefetch matches as a new, reduced database, go through each contig and run gather to find the best match per contig
  4. summarize/ report
@bluegenes bluegenes added enhancement plugin a plugin for sourmash! labels Apr 11, 2023
@ctb
Copy link
Contributor

ctb commented Apr 29, 2023

ref #2592

@ctb
Copy link
Contributor

ctb commented Sep 23, 2023

note that multigather is effectively contig gather ref #2328 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement plugin a plugin for sourmash!
Projects
None yet
Development

No branches or pull requests

2 participants