feat: implement fast_typing.wdl workflow for #55#75
Merged
Conversation
Add comprehensive test suite for fast typing workflow (issue #55). Tests verify: - Workflow file structure and location in builtin/ - Import of shared tasks from tasks/ directory - Linear pipeline without alignment stage - Standard inputs (query_sequences, allele_database, profiles_table) - Standardized JSON output format - Strategy metadata (strategy: "fast", alignment_used: false) - MinHash-only allele calling and profile lookup - Edge cases (empty queries, partial profiles) All tests fail as expected (RED phase). Total: 27 failing tests covering file structure, imports, pipeline logic, output format, and execution. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
- Create torchbase/workflows/builtin/fast_typing.wdl with MinHash-based typing pipeline - Implement shared tasks: minhash.wdl (sketch_sequences, compare_sketches, call_alleles) - Implement profile_lookup.wdl task for profile matching and result generation - Pipeline: sketch query → compare → call alleles → lookup profile - Output includes standardized JSON with method.strategy='fast' and alignment_used=false - Supports multi-scheme torches with scheme-prefixed locus names - All 27 structural and validation tests pass - Syntax validated via miniwdl check Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
db051a7 to
82ad93c
Compare
- Accept updated profile_lookup.wdl from main - Update fast_typing.wdl to use new output name and alignment_used parameter
- Allow 'alignment_used' parameter in fast workflow (it's metadata, not a task call) - Check specifically for alignment.wdl import, not just presence of word 'alignment' - Check for align_and_call task name, not generic 'align' substring
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Implements the fast typing strategy workflow that performs allelic typing using MinHash-based similarity only, skipping alignment entirely. This is the fastest strategy suitable for high-quality assemblies or when speed is critical.
fast_typing.wdlworkflow intorchbase/workflows/builtin/minhash.wdlandprofile_lookup.wdlAcceptance Criteria Met
torchbase/workflows/builtin/fast_typing.wdlexistsTest Results
Implementation Details
Co-authored-by: Claude Sonnet 4.5 noreply@anthropic.com