-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathrelease_notes.html
1176 lines (1176 loc) · 48.6 KB
/
release_notes.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<html>
<head>
<title>X! tandem relase notes</title>
<style>
td {color: #444444;
font-weight: normal;
font-family: Verdana, Arial, Helvetica;
font-size: 10pt;}
li {color: #444444;
font-weight: normal;
font-family: Verdana, Arial, Helvetica;
font-size: 10pt;
padding-bottom: .5em;}
.td_b {color: black;
font-weight: bold;
font-family: Verdana, Arial, Helvetica;
font-size: 10pt;}
</style>
</head>
<body>
<table border="0" cellpadding="3" cellspacing="3">
<tr>
<td width="600" align="left" valign="top" class="td_b">X! tandem release notes</td>
</tr>
<tr>
<td width="600" align="left" valign="top">X! tandem will be released periodically,
with the version numbering system formulated from the date of release. The
changes made to the system on each release are detailed in the list below.
Releases are listed with the most recent release on top.
<hr>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3" ID="Table5">
<tr>
<td class="td_b" bgcolor="#dd6622"><span class="top_label">Latest release: 2007.07.01</span></td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release adds compatibility with 64 bit floating point
data in mzXML or mzData formats.
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>An new override of the dtohl method in the saxhandler.cpp file
was added to deal with 64 bit floats.
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3" ID="Table5">
<tr>
<td class="td_b" bgcolor="#dd6622"><span class="top_label">2007.04.01</span></td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release is the first version to support amino acid residue
polymorphism annotation. A file containing known coding mutations can be specified and the
search engine will check each specified version of those modified residues.
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>Including SAPS required modification of the classes <i>mprocess</i> and
<i>mscore</i> as well as the addition of a new classs <i>mscoresap</i>, which
is specified in the mscorepam.h file. The new class follows the
same pattern as the other state machines for tracking sequence modifications. A class
that reads the XML-formatted SAPS annotation information has also
been added, <i>saxsaphandler</i>, which follows the same pattern as
the other BIOML processing classes.
</LI>
<LI>This version (and all subsequent ones) will use the preprocessor commands associated with
the compiler make processor to specify the platform being compiled. Previous
versions required the commenting out undesired options in the stdafx.h file. This
change includes the PLUGGABLE_SCORING preprocessor definition in mscore.h that
is necessary to alter the peptide scoring portion of the code.
</LI>
<LI>The Mac OSX version of the executable binaries is statically linked to
the most recent version of the XML parser expat. Unlike previous versions
it will not be necessary to have expat installed on the computer used
to run the search engine. The Linux/Unix builds are now the only
platforms onwhich dynamic linking is necessary.
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3" ID="Table5">
<tr>
<td class="td_b" bgcolor="#dd6622"><span class="top_label">2007.01.01</span></td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release is the first release to support rho-diagrams
for the determination of expectation value thresholds. It also has a minor, but important
change to the interpretation of results to make the results of refinement rounds more
consistent. This release is also the first to support the Mac OS X 10.4 version for Intel
processors. Support for non-Intel processors will be discontinued as of the next release.
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>The output file has a new output value in the "performance parameters" group,
for example:
<pre>
<group label="performance parameters" type="parameters">
<note label="quality values">117 34 22 10 5 2 1 0 0 1 0 1 0 0 0 0 0 0 0 0</note>
</group></pre>
This change was necessary to support the use of rho-diagrams in the GPM display software.
</LI>
<LI>The proteins reported as possible correct identifications have been changed somewhat.
In previous versions, it was possible for a protein to be reported as identified even if
it did not have any qualifying peptides that were found to have the specified enzymatic
cleavage: a protein could have peptides found only during the cleavage-at-every site round
or the point-mutation round. This behavior has been changed so that a protein must have
at least one significant peptide found to have the specified enzymatic cleavage.
</LI>
<LI>The implementation of point mutation detection has been altered so that if a particular
possible point mutation has been explained by any set of potential modifications, it will not
be included as a possible solution in the output.
</LI>
<li>The stdafx.h file has been altered to add in new preprocessor statements that
deal with the different versions of Mac OS X. These new statements are #OSX_TIGER
and #OSX_INTEL. To compile with OS X 10.4 on a PPC computer, uncomment both #OSX
and #OSX_TIGER. To compile with OS X 10.4 on an Intel computer, uncomment only
#OSX_INTEL.
</li>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3" ID="Table5">
<tr>
<td class="td_b" bgcolor="#dd6622"><span class="top_label">2006.09.15</span></td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release of X! Tandem/P3 contains several
minor changes to maintain cross-platform compatibility.
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
Code that uses iterator math to determine the limits of a calculation
have been altered so that the iterators are not incremented passed the
end of an STL container. Incrementing iterators passed the end of a
container generates a run-time error when compiled with Microsoft Visual
C++ 2005. The precompiler variable _CRT_SECURE_NO_DEPRECATE has been
defined for the Visual C++ compiler, to prevent the generation of
unnecessary compiler warnings for the use of C string functions, such as "strcpy".
</LI>
<LI>
The SAXTandemInputHandler::characters method has been updated to
improve its performance and to handle escape characters correctly (suggested by Brendan Maclean).
</LI>
<LI>
The documentation and precompiler defines in stdafx.h that provide cross-platform compatibility
for 64-bit integer types have been updated.
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3">
<tr>
<td class="td_b" bgcolor="#dd6622"><span class="top_label">2006.06.01</span></td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release of X! Tandem contains two
fixes to improve compatibility with Linux and one adjustment to be compatible
with X! Hunter.
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
On at least some Linux platforms, astersiks (*) in FASTA file were not being
processed properly. This has been corrected by Brendan Maclean.
</LI>
<LI>
Some mzXML files could produce memory problems, when combined with some
spectrum processing parameters because a vector was not being cleared between
processing individual spectra. This problem did not affect the results of the
search, but it could cause memory paging when using a large file.
</LI>
<LI>
The taxonomy xml file has always contained <i>type</i> specification, which was
not used by X! Tandem. Now X! Tandem enforces that FASTA or FASTA.PRO files
must be specificed with <i>type="peptide"</i>.
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3">
<tr>
<td class="td_b" bgcolor="#dd6622"><span class="top_label">2006.04.01</span></td>
</tr>
<tr>
<td width="600" align="left" valign="top"><p>This release of X! Tandem includes a
number of additions to the system API. These changes are mainly for
programmers, allowing for greater customization of searches. Some of these
features have been present in previous versions of X! Tandem, but have been
either undocumented or unsufficiently well tested.</p>
<p>This version of the X! Tandem code also merges the code for X! P3. The P3
executable can be compiled by uncommenting the preprocessor variable X_P3 in
"stdafx.h". Adding in the P3 code was done by creating several
classes that are extensions of the normal Tandem classes and using a small
number of preprocessor directives. These new classes have the prefix <b>p3</b>.</p>
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top">Added parameters:
<ol>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/olp.html">output, log path</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/om.html">output, message</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/oosc.html">output, one sequence copy</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/osp.html">output, sequence path</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/refmm.html">refine, modification mass</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/rsp.html">refine, sequence path</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/rtp.html">refine, tic percent</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/scp.html">scoring, cyclic permuation</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/sir.html">scoring, include reverse</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/spsbs.html">spectrum, sequence batch size</a>
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3">
<tr>
<td class="td_b" bgcolor="#dd6622"><span class="top_label">2006.04.01</span></td>
</tr>
<tr>
<td width="600" align="left" valign="top"><p>This release of X! Tandem includes a
number of additions to the system API. These changes are mainly for
programmers, allowing for greater customization of searches. Some of these
features have been present in previous versions of X! Tandem, but have been
either undocumented or unsufficiently well tested.</p>
<p>This version of the X! Tandem code also merges the code for X! P3. The P3
executable can be compiled by uncommenting the preprocessor variable X_P3 in
"stdafx.h". Adding in the P3 code was done by creating several
classes that are extensions of the normal Tandem classes and using a small
number of preprocessor directives. These new classes have the prefix <b>p3</b>.</p>
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top">Added parameters:
<ol>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/olp.html">output, log path</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/om.html">output, message</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/oosc.html">output, one sequence copy</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/osp.html">output, sequence path</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/refmm.html">refine, modification mass</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/rsp.html">refine, sequence path</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/rtp.html">refine, tic percent</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/scp.html">scoring, cyclic permutation</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/sir.html">scoring, include reverse</a>
</LI>
<LI>
<a href="http://www.thegpm.org/TANDEM/api/spsbs.html">spectrum, sequence batch size</a>
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3">
<tr>
<td class="td_b" bgcolor="#dd6622"><span class="top_label">2006.02.01</span></td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release of X! Tandem contains
several small fixes in response to error reports.
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
The C-ion mass calculation has been improved for electron-capture ion source
identifications, suggested by David Fenyo.
</LI>
<LI>
A problem relating to an include file that caused compilation difficulties for
some versions of GCC on some version of Linux has been fixed.
</LI>
<li>
The calculation of parent ion mass difference has been improved, to provide
better consistency for very accurate (< 1 ppm) parent ion mass
determinations.
</li>
<li>
The "semi" cleavage state machine has been adjusted for better
performance.
</li>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3">
<tr>
<td class="td_b" bgcolor="#dd6622"><span class="top_label">2005.12.01</span></td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release of X! Tandem adds several
new features, as well as improving the XML standards compatibility of the
system.
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
An improved handling of hex encoded binary information in mzXML and mzData
files, for 64-bit processors, added by Steven Wiley.
</LI>
<LI>
Addition of testing for N-terminal glutamic acid cyclization, suggested by Oleg
Krohkin.
</LI>
<li>
Addition of "semi" enzymatic cleavage (specific enzyme cleavage at
one end of a peptide and non-specific cleavage at the other), suggested by Matt
Monroe.
</li>
<li>
An improved system for detecting XML file types, suggested by Steven Wiley.
</li>
<li>
Support for variant methods of expressing parent ion charge in mzData v. 1.05,
added by Fredrik Levander.
</li>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3">
<tr>
<td class="td_b" bgcolor="#dd6622"><span class="top_label">2005.10.01</span></td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release of X! Tandem adds several
new features, as well as improving on some of the existing features. It
contains a number of engineering architectural changes meant to allow simpler
access to some of the key algorithms in the system.
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
An improved version of the state machine that lists all of the possible
potential modification states of a peptide sequence was written by Brendan
Maclean. This version is both more thorough and faster than the previous code.
</LI>
<LI>
The capability of using chemical average masses for fragment ion mass
calculations was added by Brendan Maclean.
</LI>
<li>
A simplification of the mprocess class that allows for a "pluggin"
approach to adding new refinement modules was designed and implemented by Rob
Craig.
</li>
<li>
An improved routine for correcting for isotope peaks and multiple observations
of similar masses was made by Patrick Lacasse.
</li>
<li>
An additional state machine using cyclic peptide sequence permutation to
compensate for small sequence collections and for large mass peptides was
added. This feature is based on a suggestions made separately by Tom Blackwell
& David States and Patrick Lacasse.</li>
<li>
An improved sorting method to improve the consistency of homologous sequence
assignments was added by Rob Craig.
</li>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3">
<tr>
<td class="td_b" bgcolor="#dd6622"><span class="top_label">2005.08.15</span></td>
</tr>
<tr>
<td width="600" align="left" valign="top">The changes in this release are aimed at
increasing XML compliance and high accuracy mass calculation consistency.
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
The handlers for GAML spectra, taxonomy files and input parameter files have to
changed to using expat, rather than custom routines.
</LI>
<LI>
A more flexible mass calculation class has been added to improve molecular mass
consistency for high accuracy calculations.
</LI>
<li>
The input spectrum file type detection method has been improved by adding the
possibility of forcing it to select one file type. This forcing is done using
the input parameter "spectrum, path type" parameters, which can have
the values: dta, pkl, mgf, gaml, mzxml or mzdata. If this parameter is missing
or of zero length, the normal file type detection scheme is used.
</li>
</ol>
</td>
</tr>
<tr>
<td class="td_b" bgcolor="#dd6622"><span class="top_label">2005.06.01</span></td>
</tr>
<tr>
<td width="600" align="left" valign="top">This version corrects an issue that could
arise in large MudPIT data sets with large numbers of redundant
identifications. The calculation of the protein expectation value in previous
versions was susceptible to floating point overflows when making this
calculation, resulting in unpredictable values.
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2005.03.21</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release adds the ability to process <a href="http://sashimi.sourceforge.net/software_glossolalia.html">
mzxml</a> and <a href="http://psidev.sourceforge.net/ms/#mzdata">mzdata</a>
file formats using <a href="http://expat.sourceforge.net/">eXpat</a> library of
functions. Most of the changes in this release were initially made by Patrick
Lacasse (Université Laval, Dept. of Medicine, supported by Genome Québec) with
the final version and optimizations made by Brendan MacLean, from the Fred
Hutchinson Cancer Research Center. Also, the ability to define the amino acid
residue masses has been added allowing users to change the default masses when
doing N15 experiments for example.
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
New classes have been added to allow the processing of mzxml and mzdata file
formats.<br />
Two of the new classes are publicly derived from loadspectrum, a custom class
specific to Tandem. Two others are publicly derived from the xml parser class
SAXSpectraHandler which is imported from the expat library of functions. The
xml parser classes use the expat functions exclusively to parse the input in
order to load it into the traditional Tandem spectra data members.
</LI>
<LI>
base64.cpp and base64.h have been added to allow b64_decode_mio() function
calls, which are needed to decode the spectra in mzxml and mzdata spectra
files.
</LI>
<LI>
Included in the src folder is the libexpat.lib which is required to compile new
versions of the executable on Windows. Linux and OSX machines should have the
required libraries as part of the core operating system.
</LI>
<LI>
A new function has been added to msequtilites that allows amino acid residue
masses to be defined by an xml input file. If the parameter 'protein, modified
residue mass file' is defined in the input.xml, the masses are taken from the
file defined by that parameter. An example of the format can be viewed <a href="http://human.thegpm.org/tandem/methods/n15.xml">
here</a>.
</LI>
</ol>
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2005.02.01</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release contains modifications
necessary to insert new types of peptide scoring systems as well as to deal
effectively with high accuracy parent ion measurements, which are now available
in some types of mass spectrometers. Most of the changes in this release were
made by Brendan MacLean, from the Fred Hutchinson Cancer Research Center.
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
Several new classes have been added, to make the scoring system
"pluggable", i.e., it is now much easier to alter the scoring system
used, for the purposes of bioinformatics investigations. These changes are
mainly of interest to informatics professionals and they should not affect the
normal operation of the software for users.
</LI>
<LI>
The calculation of parent ion mass has been changed, taking more care as to the
mass of added groups and correctly accounting for electron masses.
</LI>
<LI>
Better statistical methods have been added to deal with the small number of
possible peptides generated from a list of protein sequences that have a very
high accurately determined parent ion mass.
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3">
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2004.11.15.3</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release adds in several features
that were originally scheduled to appear in the 2004.11.15 release, but which
were pushed back from the initial release. The 2004.11.15.2 version was not
generally released.</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
Spectra that are interpreted as being caused by a prompt neutral loss now have
the prompt loss specified in the appropriate <aa> node in the output.
</LI>
<LI>
Correction of an issue with the OS X version that resulted in improper reading
of ".pro" sequence files. Initially, the ".pro" format was
to have both little endian and big endian versions, however this became too
confusing to maintain. The current plan is to only use the little endian format
and to compensate for this on-the-fly in the OS X version.
</LI>
<LI>
The maximum parent ion charge to be used can now be specified using the
"spectrum, maximum parent charge" parameter. This parameter has a
default value of 4. This change was made necessary because of high charge
states being called by some MS peak assignment software, which caused spurious
assignments.
</LI>
<LI>
The first round of refinement (finding partially cleaved peptides) has been
extended, so that it possible to repeat it with different sets of modifications
and motifs. These additional refinement rounds are specified by adding
parameters using the following format:
<ul>
<li>
Round 1: "refine, potential modification mass"<br>
"refine, potential modification motif"
</li>
<li>
Round 2: "refine, potential modification mass 1"<br>
"refine, potential modification motif 1"
</li>
<li>
Round 3: "refine, potential modification mass 2"<br>
"refine, potential modification motif 2"
</li>
</ul>
This will continue until both of the next pair of parameters are either missing
or neither contain an ampersand (@).
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3">
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2004.11.15</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This is a maintenance release, to correct
one issue identified in the 2004.09.01 release.</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
An error that resulted in the incorrect interpretation of some motifs was
corrected.
</LI>
<LI>
Addition of spectrum prefiltering to remove repeated spectra from the initial
set of mass spectra. This feature compares spectra using a dot product
calculation and removes spectra that have vector representations that point in
the same direction. The most intense spectrum out of a set of repeated spectra
is kept and used for analysis. This type of filtering can remove up to 90% of
spectra from a MudPit-style run, making data analysis and interpretation
easier.
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3">
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2004.09.01</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This is a maintenance release, to correct
one issue identified in the 2004.08.01 release.</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
A possible floating-point overrun error that could lead to 0.0 expectation
values for high scoring peptides was detected from GPMDB submissions has been
corrected.
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3">
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2004.08.01</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This is a maintenance release, to correct
several issues identified in the 2004.07.15 release.</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
An error that reduced the score for triply-charged ions was corrected.
</LI>
<LI>
Quantitation information was added to the output XML file.
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3" ID="Table4">
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2004.07.15</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This is a major release of TANDEM,
sufficiently different from previous releases to merit a major revision number:
this release will be referred to as TANDEM 2.</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
The memory management throughout the program has been analyzed and altered to
minimize the amount of memory used per spectrum. This effort has reduced the
amount of memory used in single threaded operation by as much as 60%: the
improvement for double threaded operation may be as much as 80%.
</LI>
<LI>
The threading model has been changed to allow for the use of multiple
processeors in the refinement process. TANDEM 1 separated work between the
threads by dividing up the sequences to search, so that each thread would only
search a subset of the sequences in a FASTA file. TANDEM 2 divides up the mass
spectra between threads, so that each thread searches a subset of the mass
spectra. This change makes it easier to divide up the refinement job, but means
that running more than one thread on a single processor will degrade the
performance of the software. For best performance, it is now important to keep
the number of threads and the number of processors the same.
</LI>
<LI>
The refinement process has been improved in accuracy by applying a logical
filter after each step of refinement. This means that once a refinement step is
completed, the new results obtained from the refinement are examined and it the
new results are not significantly better than those obtained from a simpler
search, they are discarded and the simpler results retained. This filtering
significantly reduces the complexity of analyzing results when there may be a
variety of similar modification patterns or point mutations that explain a
particular spectrum.
</LI>
<LI>
Validation of results using reversed sequence databases has been built-in to
the search process. This validation may be turned on or off, using the new
input parameter "<i>scoring, include reverse</i>" (values = yes|no).
This validation process tabulates the number of unique high probability hits
from the reversed sequence search and places them in the output file, along
with estimates of the false positive rate based on TANDEM's stochastic
histogramming technique and the estimate derived from the reversed sequence
process. NOTE: When this validation method is used, twice as many sequences
must be processed (both forward and reversed), which may require significantly
more processing time.
</LI>
<LI>
Numerous small optimizations have been made, particularly for loading and
reporting the results for very large collections of mass spectra.
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3" ID="Table4">
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2004.06.01</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release adds three new
functionalities to X! TANDEM. These new functions make it possible to modify
protein sequences in new ways.</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
The ability to specify modifications based on sequence motifs was added to both
the normal search and refinement steps. A comma separated list of motifs in
slightly modified PROSITE format can be used to only modify specific residues.
An example of this format is:<BR>
204@[N!]{P}[ST]{P} - which says a motif that has an N, followed by any residue
except P, followed by an S or a T, followed by any residue except P is
specified. Modify the residue in the group containing the exclamation point (in
this case the N) by adding 204 Da.
<BR>
The peptides containing this motif are checked both with and without this
modification, so it is interpreted the same way that a "potential" modification
is interpretted. The rules for creating these motifs are:
<ul>
<li>
Square brackets "[]" indicate any of the residues contained is
possible;</li>
<li>
French brackets "{}" indicate that any of the residues contained is
forbidden;</li>
<li>
A bare letter is interpreted as if it was in square brackets and can be
modified with an exclamation point, e.g. 16@[M!] is the same as 16M!;</li>
<li>
An exclamation point indicates the position of modification;</li>
<li>
The letter "X" indicates any residue;</li>
and
<li>
Round brackets "()" indicate a count, e.g. "X(10)" means
ten X's in a row;</li>
<li>
All other characters are ignored, e.g. 80@[ST!]PX[KR] is the same as
80@[ST!]-P-X-[KR].</li>
</ul>
</LI>
<LI>
The ability to specify prompt neutral losses for potential modifications
(including motifs) has been added. This neutral loss is specified by adding a
colon followed by the mass corresponding to the loss. For example:<BR>
80@S specified phosporylation without loss, while 80:-98@S specifies the
neutral loss of the phosphate group.
</LI>
<LI>
The ability to specify on a sequence by sequence level specific fixed
modifications of residues by the residue number has been added. This capability
cannot be exploited currently because of a lack of sequence lists that contain
this type of information. However, an appropriately translated version of a
database such as SWISSPROT could be used to provide this information.
</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3" ID="Table4">
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2004.05.01</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release is a maintenance release. It
should improve memory usage for very long sequence lists, but other wise should
be neutral.</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
The mechanism for storing sequences in the mprocess class has been changed.
Previously, a copy of each protein sequence was stored with each peptide model
associated with a spectrum. Now, a master list of protein sequences is kept and
only a lookup number is stored with each peptide model. This change improves
memory management for very large pools of redundant proteins or very long lists
of spectra.</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3" ID="Table4">
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2004.04.10</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release corrects an interpretation
problem introduced in the 2004.04.01 release. This problem results in an
overemphasis on peptides found in the refinement steps.</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
The refinement processing was returned to its previous state, so that only the
best scoring peptides from the refinement process are reported.</LI>
</ol>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3" ID="Table4">
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2004.04.01</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release is the result of an effort
to reduce memory usage by TANDEM. This effort has resulted in a 70% reduction
in memory usage, when using large data files.</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
GAML spectra, such as those in output xml files, can now be used as input data.</LI>
<LI>
The length of scoring histogram arrays have been altered to improve memory
usage.
</LI>
<LI>
Several instances of temporary copies of data have been removed and other data
structures cleared as soon as possible after use.</LI>
</ol>
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">Corrected problems</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
A behavior that resulted in the lost of the last character in sequences in some
FASTA files has been corrected.</LI>
<LI>
A compatibility issue resulting from various choices for the size of the size_t
STL variable on unix platforms has been corrected, so that most unix platforms
should compile without modifying the linux version of the code.</LI>
</ol>
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">Known problems</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
No problems known at time of release</LI>
</ol>
<hr>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3" ID="Table4">
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2004.03.01</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release fixes a number of
compatibility issues and unexpected behaviors in Tandem and associated
formating files.</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
A new state machine was added to perform N- and C- terminal partial
modifications. Previous versions used these modifications as complete
modifications only.</LI>
<LI>
The optimization for the minimum number of residues considered was removed and
replaced with the constant value of 4. The prior optimization did not produce a
significant improvement in speed, but it did cause occasional problems with
large neutral losses.</LI>
<LI>
The xslt and css files have been updated to conform more closely to
specification, making them compatible with the FireFox browser.</LI>
</ol>
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">Corrected problems</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
A behavior that allowed the occasional consideration of peptides with too many
missed cleavage sites was fixed.</LI>
<LI>
A compatibility issue for starting threads on some unix platforms has been
corrected, so that most unix platforms should compile without modifying the
linux version of the code.</LI>
</ol>
</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">Known problems</td>
</tr>
<tr>
<td width="600" align="left" valign="top"><ol>
<LI>
No problems known at time of release</LI>
</ol>
<hr>
</td>
</tr>
</table>
<table border="0" cellpadding="3" cellspacing="3" ID="Table3">
<tr>
<td width="600" align="left" valign="top" class="td_b" bgcolor="#dd6622">2004.02.01</td>
</tr>
<tr>
<td width="600" align="left" valign="top">This release introduces the capability of
detecting point mutations in protein sequences</td>
</tr>
<tr>
<td width="600" align="left" valign="top" class="td_b">System level changes</td>