Skip to content

some questions about ANI and compare #2226

Open
@ctb

Description

while working on #2225 and making sure that plot will work with both similarity and distance matrices, I ran a few comparisons with the 64 genomes from podar-ref (sequences here).

When I run:

sourmash compare -o cmp podar-ref/*.fa.sig -k 21 
sourmash plot cmp

I get:
Screen Shot 2022-08-20 at 10 59 28 AM

good so far!

when I add ANI,

sourmash compare -o cmp podar-ref/*.fa.sig -k 21 --ani
sourmash plot

I get:
Screen Shot 2022-08-20 at 11 00 25 AM

which is a lot busier! I think this reflects the fact that translation to ANI involves a logarithmic transformation step from the Jaccard, and so even really low similarities etc pop up into view with the ANI.

Is this an ok way to think about things, @bluegenes @dkoslicki? Is there more going on?

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions