Skip to content

Commit

Permalink
Merge pull request #32 from PacificBiosciences/feature/20240827
Browse files Browse the repository at this point in the history
doc and data sync
  • Loading branch information
holtjma authored Aug 27, 2024
2 parents fa580ee + a095643 commit 0a6e2ae
Show file tree
Hide file tree
Showing 4 changed files with 13 additions and 0 deletions.
8 changes: 8 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,11 @@
# v0.14.1
## Changes
- Added a new output folder through the `--output-debug` option: `cyp2d6_igv_custom`. This folder contains an XML file describing an IGV session as well as the supporting data files to visualize full length alignments through the two constructed CYP2D6 haplotypes. For details, see the updated user docs.
- Released an updated database file: `data/v0.14.1/pbstarphase_20240826.json.gz`

## Fixed
- Fixed some off-by-one errors in the coordinates for miscellaneous extra regions in CYP2D6. These caused tiny overlaps in the output BAM file described above, but otherwise did not have an impact on diplotyping accuracy.

# v0.14.0
## Changes
- HLA allele labeling has been updated to improve 4th-field accuracy: When two potential definitions are compared, we now restrict the initial comparison to _only_ the shared regions of the two haplotype sequence definitions (this is often different, especially for DNA sequences). In the event of a tie, we revert to the full-length allele definitions.
Expand Down
Binary file added data/v0.14.1/pbstarphase_20240826.json.gz
Binary file not shown.
5 changes: 5 additions & 0 deletions docs/user_guide.md
Original file line number Diff line number Diff line change
Expand Up @@ -300,6 +300,11 @@ The outputs contained in this folder are subject to change as the algorithms evo
Here is a brief list of some of the current debug outputs:

* `consensus_{GENE}.fa` - Contains the full consensus sequences generated for a given `{GENE}`. Currently, this is only for HLA genes and _CYP2D6_.
* `cyp2d6_igv_custom` - Folder containing an IGV session and supporting data files for viewing the full-length reads aligned to the two CYP2D6 haplotypes for this dataset. Allows for manual detection of missed variants and inspection of direct evidence of copy-number in the reads. An example loaded session file should look similar to [example_session.png](../images/example_session.png). Exact formatting may vary depending on IGV version and user settings.
* `custom_igv_session.xml` - Contains the IGV session information that loads the following files. This is what most users should access first.
* `custom_alignments.bam` - Contains the full-length reads from the input BAM file re-aligned to the custom reference file.
* `custom_reference.fa` - Contains a single contig composed of the two haplotypes that were identified by StarPhase. The haplotypes are separated by a buffer region ("N"s).
* `custom_regions.bed` - Contains the region annotations for the custom reference. Consensus sequences are labeled using the same format as the other debug consensus files (e.g., `{consensus_index}_{consensus_label}`). For most consensus regions, we do not expect to see variants because they should be captured by the consensus algorithm (exceptions include homo-polymer variation). Un-labeled regions are directly copied from the reference genome and are more likely to contain variants.
* `cyp2d6_link_graph.svg` - A graphical representation of the connections present between CYP2D6 consensus segments.
* `debug_consensus.bam` - Contains debug mappings for the alignment-based genes
* _CYP2D6_ - Contains mapped substrings from the reads that were used to generate CYP2D6 consensus sequences. The haplotype tag (HP) indicates which consensus the sequence was a part of. Useful for visualizing how the consensus ran and whether there are potential errors.
Expand Down
Binary file added images/example_session.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 0a6e2ae

Please sign in to comment.