-
Hello, I'm currently testing out MSFragger with a 4 protein dataset (4 glycoproteins known to be in the sample, and nothing else should be in there), and tried using a human proteome fasta file (from uniprot) with reversed decoys appended. I was able to generate search results via MSFragger, but when I try and use philosopher's implementation of peptideprophet, I get messages saying that the mixture model fails quality test for charges +1 to +7. Here's the command: philosopher peptideprophet --database uniprot-reviewed_yes+taxonomy_9606_W_DECOYS_v3.fasta --nonparam --decoy DECOY_ --masswidth 2000 --decoyprobs 4ProteinMix_01_pickpeak_v3.pepXML Here's the output: file 1: C:\Users\newot\ms_fragger\MSFragger_20170103\4ProteinMix_01_pickpeak_v3.pepXML
Using Decoy Label "DECOY_". INFO: Processing standard MixtureModel ... I'm going to guess that most 'non-decoy' peptide sequences are going to have matches with spectra due to chance or overlap in fragment masses with peptides from the 4 'correct' proteins (that are actually in the sample). Thus, the probability of getting a high score from MSFragger given that you have a 'non-decoy' peptide should be roughly equivalent to that for peptides from decoy proteins. Should philosopher's peptideprophet be able to identify the peptides tiny subset of non-decoy proteins (4) that should have very high scores? Thanks, |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments
-
Hey @whitleyo ; Could you give me a little more information about your experiment ? It seems to me that you have a database with +- 20k decoys and only 4 targets, is that correct ? |
Beta Was this translation helpful? Give feedback.
-
Hello, I basically downloaded a FASTA file from this search http://www.uniprot.org/uniprot/?query=reviewed:yes%20taxonomy:9606 (reviewed and human), which includes about 20k sequences, and appended decoys via a python script I wrote and tested. So the database has 20k FASTAs and 20k reversed decoys. The MS data I am using come from a 4-protein mixture of 4 glycoproteins (2 of them are bovine, 2 are human) (see PDX006031 on pride for the raw data), and proteins underwent CID fragmentation. The search I did had a precursor mass tolerance window of 2000 Da since these proteins are glycosylated and I wanted to see if MS-Fragger + Philosopher could pick up peptides with large mass shifts (from glycosylation). The fragment mass tolerance window was 20ppm. The thinking behind that was to hopefully identify precursors that faithfully matched up to theoretical fragment spectra but would not be within the precursor mass tolerance. |
Beta Was this translation helpful? Give feedback.
-
Looking again through some reviews (e.g. Allen et al. 2013) and looking at the spectral data through some other software (glycopat), the MS2 spectra should be dominated by precursor peptides with mass shifts of varying magnitude (due to CID predominantly breaking bonds between sugars of the glycan groups). |
Beta Was this translation helpful? Give feedback.
-
The PeptideProphet version you have inside Philosopher is pretty much the same you find in TPP v5.01, the few adjustments I do have no affect over the results. I don't see why it shouldn't find your proteins, regarding that you have a properly set parameter file for the search and the correct set of parameters for PeptideProphet. I suggest you to get in touch with the developers that are maintaining the Prophets code and ask them. |
Beta Was this translation helpful? Give feedback.
The PeptideProphet version you have inside Philosopher is pretty much the same you find in TPP v5.01, the few adjustments I do have no affect over the results. I don't see why it shouldn't find your proteins, regarding that you have a properly set parameter file for the search and the correct set of parameters for PeptideProphet. I suggest you to get in touch with the developers that are maintaining the Prophets code and ask them.