Replies: 8 comments 21 replies
-
For AP-MS, intensities from IonQuant, it is better to use topN approach.
But there are two questions:
1. if you are running philosopher pipeline, how do you execute IonQuant? In FragPIpe, IonQuant rewrites reprint.tsv tables generated by philosopher.
2. Fengchao, what intensities are you writing in reprint.int.tsv? Ideally, we want topN. Do we alays calculate topN now, even if MaxLFQ is checked?
Alexey
From: Tobias Kockmann ***@***.***>
Sent: Friday, December 10, 2021 8:36 AM
To: Nesvilab/philosopher ***@***.***>
Cc: Subscribed ***@***.***>
Subject: [Nesvilab/philosopher] REPRINT analysis (Discussion #298)
External Email - Use Caution
Dear Philosopher developers,
we are trying to setup a std. data analysis pipeline for AP-MS and proximity labeling (BioID, APEX, ...) experiments using Philosopher. The pipeline runs on Linux and follows
https://github.com/Nesvilab/philosopher/wiki/REPRINT-Analysis
but instead of using Comet, we decided for MSFragger as a search engine. We are also running IonQuant on top in order to have the option to forward labelfree quant results (int).
Questions:
Is there anything that one needs to keep in mind when moving from spc data to int data with respect to philosopher parameter choices? We would like to pass the reprint.spc.tsv and reprint.int.tsv to the REPRINT server for interaction scoring or use a local SAINTexpress instance. I am for instance unsure if IonQuant normalization should be applied or not, or if mbr should be restricted somehow.
Best,
Tobi
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub<#298>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AIIMM62BBLNIQ4J2X77XQGLUQH62LANCNFSM5JZE5SKQ>.
Triage notifications on the go with GitHub Mobile for iOS<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
**********************************************************
Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues
|
Beta Was this translation helpful? Give feedback.
-
Ok, that’s good. And with top-N, I think I usually use N=6 for AP-MS.
This being said, I often get better results with spectral counts still when looking at SAINT results. Or at least different from when using the intensity data from the same run. The two seem to be complementary, with my personal preference for spectral count-based analysis.
From: Fengchao ***@***.***>
Sent: Friday, December 10, 2021 9:56 AM
To: Nesvilab/philosopher ***@***.***>
Cc: Nesvizhskii, Alexey ***@***.***>; Comment ***@***.***>
Subject: Re: [Nesvilab/philosopher] REPRINT analysis (Discussion #298)
External Email - Use Caution
1. Fengchao, what intensities are you writing in reprint.int.tsv? Ideally, we want topN. Do we alays calculate topN now, even if MaxLFQ is checked?
If using the latest IonQuant, the intensity in reprint.int.tsv is always from topN. Currently, we write both topN and MaxLFQ if MaxLFQ is checked, and only topN if MaxLFQ is unchecked.
Best,
Fengchao
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub<#298 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AIIMM64WLIZWL7KPEH4E2QLUQIIJJANCNFSM5JZE5SKQ>.
Triage notifications on the go with GitHub Mobile for iOS<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
**********************************************************
Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues
|
Beta Was this translation helpful? Give feedback.
-
I fear we have to go two steps backwards! We have mayor problems running the base REPRINT workflow on Linux using the philosopher CLI (so let's focus on the spc data first!). I have attached the philosopher.yaml and the std.out/std.error in two zip archives. Once the
But the
In addition, we see some parsing errors for our FASTA-formated spike-in sequences (header fields are not separated!):
The spike-in entries look like:
and were inserted by Can you give us a hint why the REPRINT pipeline fails? |
Beta Was this translation helpful? Give feedback.
-
This seem to be philosopher issue. So Felipe needs to investigate
From: Tobias Kockmann ***@***.***>
Sent: Thursday, December 16, 2021 5:34 AM
To: Nesvilab/philosopher ***@***.***>
Cc: Nesvizhskii, Alexey ***@***.***>; Mention ***@***.***>
Subject: Re: [Nesvilab/philosopher] REPRINT analysis (Discussion #298)
External Email - Use Caution
Dear @anesvi<https://github.com/anesvi> and @fcyu<https://github.com/fcyu>,
I fear we have to go two steps backwards! We have mayor problems running the base REPRINT workflow on Linux using the philosopher CLI (so let's focus on the spc data first!). I have attached the philosopher.yaml and the std.out/std.error in two zip archives.
Once the philosopher pipeline is done, we get the following picture:
***@***.***:/scratch/FRAGPIPE/WU272904$ ls -l
total 333456
-rw-rw-rw-+ 1 bfabric SG_Employees 26973252 Dec 15 17:56 2021-12-15-decoys-reviewed-contam-UP000005640.fas
-rw-rw-rw-+ 1 bfabric SG_Employees 74942921 Dec 16 10:03 2021-12-15-decoys-reviewed-contam-UP000005640.fas.1.pepindex
-rw-rw-rw-+ 1 bfabric SG_Employees 38279511 Dec 16 10:03 2021-12-15-decoys-reviewed-contam-UP000005640.fas.2_-1207187027.peptide_idx_dict
-rw-rw-rw-+ 1 bfabric SG_Employees 74942988 Dec 16 10:03 2021-12-15-decoys-reviewed-contam-UP000005640.fas.2.pepindex
-rw-rw-rw-+ 1 bfabric SG_Employees 1319427 Dec 16 10:27 combined_peptide.tsv
-rw-rw-rw-+ 1 bfabric SG_Employees 117323437 Dec 16 10:26 combined.pep.xml
-rw-rw-rw-+ 1 bfabric SG_Employees 247684 Dec 16 10:28 combined_protein.tsv
-rw-rw-rw-+ 1 bfabric SG_Employees 7127018 Dec 16 10:26 combined.prot.xml
drwxr-xr-x+ 1 bfabric SG_Employees 774 Dec 16 10:27 CONTROL_1
drwxr-xr-x+ 1 bfabric SG_Employees 774 Dec 16 10:27 CONTROL_4
drwxr-xr-x+ 1 bfabric SG_Employees 774 Dec 16 10:27 CONTROL_5
drwxr-xr-x+ 1 bfabric SG_Employees 854 Dec 16 10:27 dCas9_PIPOX_2
drwxr-xr-x+ 1 bfabric SG_Employees 814 Dec 16 10:27 dCas9_PIPOX_3
drwxr-xr-x+ 1 bfabric SG_Employees 814 Dec 16 10:27 dCas9_PIPOX_6
-rw-rw-rw-+ 1 bfabric SG_Employees 878 Dec 16 10:08 Makefile
drwxr-xr-x+ 1 bfabric SG_Employees 734 Dec 16 10:27 P3F_7
drwxr-xr-x+ 1 bfabric SG_Employees 734 Dec 16 10:27 P3F_8
drwxr-xr-x+ 1 bfabric SG_Employees 734 Dec 16 10:27 P3F_9
-rw-rw-rw-+ 1 bfabric SG_Employees 7775 Dec 16 10:03 philosopher.1.txt
-rw-rw-rw-+ 1 bfabric SG_Employees 2834 Dec 16 10:03 philosopher.2.txt
-rw-rw-rw-+ 1 bfabric SG_Employees 6116 Dec 15 17:56 philosopher.yaml
-rw-rw-rw-+ 1 bfabric SG_Employees 90756 Dec 16 10:28 reprint.int.tsv
-rw-rw-rw-+ 1 bfabric SG_Employees 34860 Dec 16 10:28 reprint.spc.tsv
-rw-r--r--+ 1 bfabric SG_Employees 118810 Dec 16 10:43 WU272904.tmux-buffer.txt
-rw-r--r--+ 1 bfabric SG_Employees 6909 Dec 15 17:56 WU272904.yaml
But the reprint.spc.tsv is only initialised (it contains only zero entries):
***@***.***:/scratch/FRAGPIPE/WU272904$ head reprint.spc.tsv
PROTID GENEID PROTLEN CONTROL_1_SPC CONTROL_4_SPC CONTROL_5_SPC P3F_7_SPC P3F_8_SPC P3F_9_SPC dCas9_PIPOX_2_SPC dCas9_PIPOX_3_SPC dCas9_PIPOX_6_SPC
na na na CONTROL CONTROL CONTROL P3F P3F P3F dCas9_PIPOX dCas9_PIPOX dCas9_PIPOX
A1X283 SH3PXD2B 911 0 0 0 0 0 0 0 0 0
A5YKK6 CNOT1 2376 0 0 0 0 0 0 0 0 0
A6NHR9 SMCHD1 2005 0 0 0 0 0 0 0 0 0
O00148 DDX39A 427 0 0 0 0 0 0 0 0 0
O00264 PGRMC1 195 0 0 0 0 0 0 0 0 0
O00267 SUPT5H 1087 0 0 0 0 0 0 0 0 0
O00567 NOP56 594 0 0 0 0 0 0 0 0 0
O14497 ARID1A 2285 0 0 0 0 0 0 0 0 0
In addition, we see some parsing errors for our FASTA-formated spike-in sequences (header fields are not separated!):
Q9Y4L1 HYOU1 999 0 0 0 0 0 0 0 0 0
Q7Z739 YTHDF3 585 0 0 0 0 0 0 0 0 0
Q9Y5A9 YTHDF2 579 0 0 0 0 0 0 0 0 0
Q9Y618 NCOR2 2514 0 0 0 0 0 0 0 0 0
up|P3F|P3F NoneDescription1 836 0 0 0 0 0 0 0 0 0
up|dCas9|dCas9 NoneDescription2 1403 0 0 0 0 0 0 0 0 0
O60315 ZEB2 1214 0 0 0 0 0 0 0 0 0
O75396 SEC22B 215 0 0 0 0 0 0 0 0 0
P02675 FGB 491 0 0 0 0 0 0 0 0 0
P08246 ELANE 267 0 0 0 0 0 0 0 0 0
The spike-in entries look like:
***@***.***:/scratch/FRAGPIPE/WU272904$ tail 2021-12-15-decoys-reviewed-contam-UP000005640.fas
...
up|dCas9|dCas9 NoneDescription2
MDYKDHDGDYKDHDIDYKDDDDKMAPKKKRKVGRGMDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDAIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGD
up|P3F|P3F NoneDescription1
MTTLAGAVPRMMRPGPGQNYPRSGFPLEVSTPLGQGRVNQLGGVFINGRPLPNHIRHKIVEMAHHGIRPCVISRQLRVSHGCVSKILCRYQETGSIRPGAIGGSKPKQVTTPDVEKKIEEYKRENPGMFSWEIRDKLLKDAVCDRNTVPSVSSISRILRSKFGKGEEEEADLERKEAEESEKKAKHSIDGILSERASAPQSDEGSDIDSEPDLPLKRKQRRSRTTFTAEQLEELERAFERTHYPDIYTREELAQRAKLTEARVQVWFSNRRARWRKQAGANQLMAFNHLIPGGFPPTAMPTLPTYQLSETSYQPTSIPQAVSDPSSTVHRPQPLPPSTVHQSTIPSNPDSSSAYCLPSTRHGFSSYTDSFVPPSGPSNPMNPTIGNGLSPQNSIRHNLSLHSKFIRVQNEGTGKSSWWMLNPEGGKSGKSPRRRAASMDNNSKFAKSRSRAAKKKASLQSGQEGAGDSPGSQFSKWPASPGSHSNDDFDNWSTFRPRTSSNASTISGRLSPIMTEQDDLGEGDVHSMVYPPSAAKMASTLPSLSEISNPENMENLLDNLNLLSSPTSLTVSTQSSPGTMMQQTPCYSFAPPNTSLNSPSPNYQKYTYGQSSMSPLPQMPIQTLQDNKSSYGGMSQYNCAPGLLKELLTSDSPPHNDIMTPVDPGVAQPNSRVLGQNVMMGPNSVMSTYGSQASHNKMMNPSSHTHPGHAQQTSAVNGRPLPHTVSTMPHTSGMNRLTQVKTPVQVPLPHPMQMSALGGYSSVSSCNGYGRMGLLHQEKLPSDLDGMFIERLDCDMESIIRNDLMDGDTLDFNFDNVLPNQSFPHSVKTTTHSWVSG
and were inserted by philosopher database ... --add
Can you give as a hint why the REPRINT pipeline fails?
philosopher.yaml.zip<https://github.com/Nesvilab/philosopher/files/7726424/philosopher.yaml.zip>
WU272904.tmux-buffer.txt.zip<https://github.com/Nesvilab/philosopher/files/7726425/WU272904.tmux-buffer.txt.zip>
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub<#298 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AIIMM6YICJ65I6H77D4PZ73URG6DHANCNFSM5JZE5SKQ>.
Triage notifications on the go with GitHub Mobile for iOS<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
**********************************************************
Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues
|
Beta Was this translation helpful? Give feedback.
-
@prvst btw, our current 'FragPipe' versions are: |
Beta Was this translation helpful? Give feedback.
-
Out of curiosity I also tested the same raw files and FASTA DB using FragPipe GUI v17.1 using the default workflow template and the |
Beta Was this translation helpful? Give feedback.
-
For now can just enable it in the Validation tab (there is a checkbox)
From: Fengchao ***@***.***>
Sent: Friday, December 17, 2021 8:58 AM
To: Nesvilab/philosopher ***@***.***>
Cc: Nesvizhskii, Alexey ***@***.***>; Mention ***@***.***>
Subject: Re: [Nesvilab/philosopher] REPRINT analysis (Discussion #298)
External Email - Use Caution
I think I know why the reprint.spec.tsv is missing. We disabled “generate protein level report” in LFQ-MBR workflow because Philosopher needs lots of memory for that if there are many experiments. I am supposed to let IonQuant to generate it, but I forgot. Sorry about that. Will fix this in the next release.
As to the format, I will investigate and get back to you soon.
Best,
Fengchao
—
Reply to this email directly, view it on GitHub<#298 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AIIMM6ZW4CY3RQ4ZZWA7VV3URM6WXANCNFSM5JZE5SKQ>.
Triage notifications on the go with GitHub Mobile for iOS<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>
**********************************************************
Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues
|
Beta Was this translation helpful? Give feedback.
-
Thanks, Need to fix that, so all files have GENEID and PoroteinID
For the intensity file, LENGTH is not needed because SAINT does not use it with intensity data
From: Tobias Kockmann ***@***.***>
Sent: Friday, December 17, 2021 7:04 AM
To: Nesvilab/philosopher ***@***.***>
Cc: Nesvizhskii, Alexey ***@***.***>; Mention ***@***.***>
Subject: Re: [Nesvilab/philosopher] REPRINT analysis (Discussion #298)
External Email - Use Caution
In addition: reprint.spc.tsv contains the columns, PROTID, GENEID, PROTLEN. reprint.int.tsv written by the LFQ-MBR lacks GENEID and PROTLEN. And reprint.int.tsv written by default contains PROTID and GENEID. So 3 different version of a reprint file layout.
—
Reply to this email directly, view it on GitHub<#298 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AIIMM65QVHCHS4JQKOTZMKDURMRJPANCNFSM5JZE5SKQ>.
Triage notifications on the go with GitHub Mobile for iOS<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>
**********************************************************
Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues
|
Beta Was this translation helpful? Give feedback.
-
Dear Philosopher developers,
we are trying to setup a std. data analysis pipeline for AP-MS and proximity labeling (BioID, APEX, ...) experiments using Philosopher. The pipeline runs on Linux and follows
https://github.com/Nesvilab/philosopher/wiki/REPRINT-Analysis
but instead of using Comet, we decided for MSFragger as a search engine. We are also running IonQuant on top in order to have the option to forward labelfree quant results (int).
Questions:
Is there anything that one needs to keep in mind when moving from spc data to int data with respect to philosopher parameter choices? We would like to pass the reprint.spc.tsv and reprint.int.tsv to the REPRINT server for interaction scoring or use a local SAINTexpress instance. I am for instance unsure if IonQuant normalization should be applied or not, or if mbr should be restricted somehow.
Best,
Tobi
Beta Was this translation helpful? Give feedback.
All reactions