Skip to content

Could the read name modification be improved? #52

Description

@TomSmithCGAT

Hello,

Is there are reason why the read name information needs to be concatenated here?

read.name.replace(" ", "_").replace("/", "").replace("\\", "")

I ask because it breaks downstream tools like umi_tools dedup which use the read names to identify read pairs. Would it be tolerable to add the 'rbc:` string to the end of the first element of the space-delimited read name?

Happy to issue a PR if that would work too. I note this would be a non-backward compatibile change if any other downstream tool/code is dependent upon the current read naming convention.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions