Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trimming repeated nucleotides #96

Open
adamcatto opened this issue Apr 29, 2024 · 2 comments
Open

Trimming repeated nucleotides #96

adamcatto opened this issue Apr 29, 2024 · 2 comments

Comments

@adamcatto
Copy link

Is there an option in miRge to trim off nucleotides that are repeated more than k times? e.g. ACGT[A* >= k]TGCA gets trimmed to ACGTTGCA. I know this can be done prior to running the miRge pipeline but it would be nice to just include as an argument in the miRge run script.

@arunhpatil
Copy link
Collaborator

Hi @adamcatto,

Thank you for your suggestion, I don't see why this makes an improvement in the current pipeline and/or its benifits over all. We don't currently have the option of removing internal repeated nucleotides. (Please expect delays due to travel and will be back on May 04 EST).

Thank you,
Arun.

@adamcatto
Copy link
Author

I think for some reads there may be strings of identical nucleotides that are technical artifacts which should be removed. In any case, I have forked the repository and added an option to remove repeated nucleotides ≥ a given length. You can view the changes here if it sounds interesting: adamcatto@8709dfe

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants