Skip to content

Align target sequence to selected transcript at the protein level #80

@bencap

Description

@bencap

When mapping protein-level variants, the mapper currently assumes the target sequence aligns to the reference starting at a fixed offset with no gaps. This produces incorrect mappings when the target protein doesn't align cleanly to the reference — for example, when there are internal gaps or the target covers a non-contiguous region of the reference.

This change should introduce a protein-to-protein alignment step (via BLAT) between the target sequence and the selected transcript's reference protein. Variant positions will then be mapped through the actual alignment subranges rather than a simple offset, correctly handling gaps and partial coverage.

Motivation:
Improves accuracy of protein variant coordinate mapping for targets whose relationship to the reference protein is more complex than a simple positional shift.

Metadata

Metadata

Assignees

Labels

app: mapperTask implementation touches the mappertype: bugSomething isn't workingtype: enhancementNew feature or request

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions