-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MACS2 peaks beyond chromosome #1050
Comments
In looking at the CellRanger peak calling, it looks like there is a peak called at the very end of chr23. Is it possible that the CallPeaks function might have extended this peak with the default ext.size = 200 such that it would extend beyond the boundary of the chromosome? If so, is there any way to remove this peak (and any others extending beyond their chromosome) so it does not interfere with downstream analysis? |
So macs2 did in fact extend a peak at the end of the chromosome beyond the chromosome, and removing that single peak by taking the CallPeaks() output and setting that row to NULL worked to resolve the issue. This might be an enhancement suggestion, in case there is any way to include a limitation in the CallPeaks() function to clip any peaks that would extend beyond the annotated chromosome. |
@t-and thanks for looking into it, I agree it would be good to have an automatic way of preventing this but it's difficult to do since we don't currently require any information about the chromosome lengths when running |
The optional parameter would definitely be helpful, since I'd think most users would likely have some sort of BSgenome or other genome attached to use as a reference for where the chromosome should end. Thanks for being so attentive to the issues posted on here! If it's okay, I am going to close this issue now that the actual error is resolved. |
I'll just leave this open until we have something added to Signac to handle this properly |
Hi @timoast,
I am running into an issue I have seen on here, but cannot seem to identify a solution. I am currently trying to analyze an snATAC dataset from zebrafish. While running AddMotifs, I get the error:
Based on issue #860, it looks like this is because I have peaks that extend beyond the chromosome.
Interestingly, I was able to run AddMotifs without error on the peaks from my CellRanger pre-processed data - this error only occurred after I re-did the peak-calling with macs2.
I know you mentioned in #860 that the issue may be that the version of the genome is different between the data and the BSgenome object, however both the BSgenome and data use the same assembly, GRCz11. Are there any other potential causes for this issue, or is there any way around it?
Thank you for this great tool and your support,
Troy
The text was updated successfully, but these errors were encountered: