1: \begin{thebibliography}{10}
2:
3: \bibitem{agmml90}
4: S.~F. Altschul, W.~Gish, W.~Miller, E.~W. Myers, and D.~J. Lipman.
5: \newblock Basic local alignment search tool.
6: \newblock {\em Journal of Molecular Biology}, 215:403--410, 1990.
7:
8: \bibitem{amszzml97}
9: S.~F. Altschul, T.~L. Madden, A.~A. Sch{\"{a}}ffer, J.~Zhang, Z.~Zhang,
10: W.~Miller, and D.~J. Lipman.
11: \newblock Gapped {BLAST} and {PSI}-{BLAST}: a new generation of protein
12: database search programs.
13: \newblock {\em Nucleic Acids Research}, 25:3389--3402, 1997.
14:
15: \bibitem{bttp01}
16: A.~Bahr, J.~D. Thompson, J.-C. Thierry, and O.~Poch.
17: \newblock {BA}li{BASE} ({B}enchmark {A}lignment data{BASE}): enhancements for
18: repeats, transmembrane sequences and circular permutations.
19: \newblock {\em Nucleic Acids Research}, 29:323--326, 2001.
20:
21: \bibitem{bc04}
22: V.~C. Barbosa and L.~C.~D. Campos.
23: \newblock A novel evolutionary formulation of the maximum independent set
24: problem.
25: \newblock {\em Journal of Combinatorial Optimization}, 8:419--437, 2004.
26:
27: \bibitem{bcg94}
28: S.~A. Benner, M.~A. Cohen, and G.~H. Gonnet.
29: \newblock Amino acid substitution during functionally constrained divergent
30: evolution of protein sequences.
31: \newblock {\em Protein Engineering}, 7:1323--1332, 1994.
32:
33: \bibitem{bc01}
34: J.~D. Blake and F.~E. Cohen.
35: \newblock Pairwise sequence alignment below the twilight zone.
36: \newblock {\em Journal of Molecular Biology}, 307:721--735, 2001.
37:
38: \bibitem{bjeg98}
39: A.~Brazma, I.~Jonassen, I.~Eidhammer, and D.~Gilbert.
40: \newblock Approaches to the automatic discovery of patterns in biosequences.
41: \newblock {\em Journal of Computational Biology}, 5:279--305, 1998.
42:
43: \bibitem{b02}
44: T.~A. Brown.
45: \newblock {\em Genomes 2}.
46: \newblock Wiley-Liss, Oxford, UK, 2002.
47:
48: \bibitem{bw84}
49: T.~H. Byers and M.~S. Waterman.
50: \newblock Determining all optimal and near-optimal solutions when solving
51: shortest path problems by dynamic programming.
52: \newblock {\em Operations Research}, 32:1381--1384, 1984.
53:
54: \bibitem{ctrv02}
55: N.~Cannata, S.~Toppo, C.~Romualdi, and G.~Valle.
56: \newblock Simplifying amino acid alphabets by means of a branch and bound
57: algorithm and substitution matrices.
58: \newblock {\em Bioinformatics}, 18:1102--1108, 2002.
59:
60: \bibitem{c98}
61: K.-M. Chao.
62: \newblock On computing all suboptimal alignments.
63: \newblock {\em Journal of Information Sciences}, 105:189--207, 1998.
64:
65: \bibitem{clrs01}
66: T.~H. Cormen, C.~E. Leiserson, R.~L. Rivest, and C.~Stein.
67: \newblock {\em Introduction to Algorithms}.
68: \newblock The MIT Press, Cambridge, MA, second edition, 2001.
69:
70: \bibitem{dso78}
71: M.~O. Dayhoff, R.~M. Schwartz, and B.~C. Orcutt.
72: \newblock A model of evolutionary change in proteins.
73: \newblock In M.~O. Dayhoff, editor, {\em Atlas of Protein Sequence and
74: Structure}, volume 5, supplement 3, pages 345--352. National Biomedical
75: Research Foundation, Washington, DC, 1978.
76:
77: \bibitem{dhs84}
78: J.~Devereux, P.~Haeberli, and O.~Smithies.
79: \newblock A comprehensive set of sequence analysis programs for the {VAX}.
80: \newblock {\em Nucleic Acids Research}, 12:387--395, 1984.
81:
82: \bibitem{d81}
83: R.~F. Doolittle.
84: \newblock Similar amino acid sequences: chance or common ancestry?
85: \newblock {\em Science}, 214:149--159, 1981.
86:
87: \bibitem{dt01}
88: Z.~Doszt{\'{a}}nyi and A.~E. Torda.
89: \newblock Amino acid similarity matrices based on force fields.
90: \newblock {\em Bioinformatics}, 17:686--699, 2001.
91:
92: \bibitem{ejt04}
93: I.~Eidhammer, I.~Jonassen, and W.~R. Taylor.
94: \newblock {\em Protein Bioinformatics}.
95: \newblock John Wiley \& Sons, Chichester, UK, 2004.
96:
97: \bibitem{fjd85}
98: D.-F. Feng, M.~S. Johnson, and R.~F. Doolittle.
99: \newblock Aligning amino acid sequences: comparison of commonly used methods.
100: \newblock {\em Journal of Molecular Evolution}, 21:112--125, 1985.
101:
102: \bibitem{f66}
103: W.~M. Fitch.
104: \newblock An improved method of testing for evolutionary homology.
105: \newblock {\em Journal of Molecular Biology}, 16:9--16, 1966.
106:
107: \bibitem{gcg91}
108: {Genetics Computer Group}.
109: \newblock Program manual for the {GCG} package, version 7, April 1991.
110:
111: \bibitem{gcb92}
112: G.~H. Gonnet, M.~A. Cohen, and S.~A. Benner.
113: \newblock Exhaustive matching of the entire protein sequence database.
114: \newblock {\em Science}, 256:1443--1445, 1992.
115:
116: \bibitem{g99}
117: O.~Gotoh.
118: \newblock Multiple sequence alignment: algorithms and applications.
119: \newblock {\em Advances in Biophysics}, 36:159--206, 1999.
120:
121: \bibitem{g74}
122: R.~Granthram.
123: \newblock Amino acid difference formula to help explain protein evolution.
124: \newblock {\em Science}, 185:862--864, 1974.
125:
126: \bibitem{gb02}
127: R.~E. Green and S.~E. Brenner.
128: \newblock Bootstrapping and normalization for enhanced evaluations of pairwise
129: sequence comparison.
130: \newblock {\em Proceedings of the IEEE}, 90:1834--1847, 2002.
131:
132: \bibitem{gb86}
133: M.~Gribskov and R.~R. Burgess.
134: \newblock Sigma factors from {E}.\ coli, {B}.\ subtilis, phage {SP}01, and
135: phage {T}4 are homologous proteins.
136: \newblock {\em Nucleic Acids Research}, 14:6745--6763, 1986.
137:
138: \bibitem{g97}
139: D.~Gusfield.
140: \newblock {\em Algorithms on Strings, Trees, and Sequences}.
141: \newblock Cambridge University Press, Cambridge, UK, 1997.
142:
143: \bibitem{hh92}
144: S.~Henikoff and J.~G. Henikoff.
145: \newblock Amino acid substitution matrices from protein blocks.
146: \newblock {\em Proceedings of the National Academy of Sciences USA},
147: 89:10915--10919, 1992.
148:
149: \bibitem{hh93}
150: S.~Henikoff and J.~G. Henikoff.
151: \newblock Performance evaluation of amino acid substitution matrices.
152: \newblock {\em Proteins: Structure, Function, and Genetics}, 17:49--61, 1993.
153:
154: \bibitem{hh00}
155: S.~Henikoff and J.~G. Henikoff.
156: \newblock Amino acid substitution matrices.
157: \newblock In P.~Bork, editor, {\em Advances in Protein Chemistry}, volume 54,
158: Analysis of Amino Acid Sequences, pages 73--97. Academic Press, 2000.
159:
160: \bibitem{hr02}
161: I.~Holmes and G.~M. Rubin.
162: \newblock An expectation maximization algorithm for training hidden
163: substitution models.
164: \newblock {\em Journal of Molecular Biology}, 317:753--764, 2002.
165:
166: \bibitem{i97}
167: T.~R. Ioerger.
168: \newblock The context-dependence of amino acid properties.
169: \newblock In {\em Proceedings of the Fifth International Conference on
170: Intelligent Systems for Molecular Biology}, pages 157--166, 1997.
171:
172: \bibitem{jz81}
173: M.~A. Jim{\'{e}}nez-Monta{\~{n}}o and L.~Zamora-Cortina.
174: \newblock Evolutionary model for the generation of amino acid sequences and its
175: application to the study of fragments of mammal-hemoglobin chains.
176: \newblock In {\em Proceedings of the Seventh International Biophysics
177: Congress}, 1981.
178:
179: \bibitem{jo93}
180: M.~S. Johnson and J.~P. Overington.
181: \newblock A structural basis for sequence comparisons. {A}n evaluation of
182: scoring methodologies.
183: \newblock {\em Journal of Molecular Biology}, 233:716--738, 1993.
184:
185: \bibitem{jtt92}
186: D.~T. Jones, W.~R. Taylor, and J.~M. Thornton.
187: \newblock The rapid generation of mutation data matrices from protein
188: sequences.
189: \newblock {\em Computer Applications in the Biosciences}, 8:275--282, 1992.
190:
191: \bibitem{jl00}
192: J.~S. Jung and B.~Lee.
193: \newblock Use of residue pairs in protein sequence-sequence and
194: sequence-structure alignments.
195: \newblock {\em Protein Engineering}, 9:1576--1588, 2000.
196:
197: \bibitem{k96}
198: T.~M. Klingler.
199: \newblock {\em Structural Inference from Correlations in Biological Sequences}.
200: \newblock PhD thesis, Program in Medical Informatics, Stanford University,
201: 1996.
202:
203: \bibitem{kgb04}
204: C.~Kosiol, N.~Goldman, and N.~H. Buttimore.
205: \newblock A new criterion and method for amino acid classification.
206: \newblock {\em Journal of Theoretical Biology}, 228:97--106, 2004.
207:
208: \bibitem{ls02}
209: T.~Lassmann and E.~L.~L. Sonnhammer.
210: \newblock Quality assessment of multiple alignment programs.
211: \newblock {\em FEBS Letters}, 529:126--130, 2002.
212:
213: \bibitem{ltptp01}
214: O.~Lecompte, J.~D. Thompson, F.~Plewniak, J.-C Thierry, and O.~Poch.
215: \newblock Multiple alignment of complete sequences ({MACS}) in the post-genomic
216: era.
217: \newblock {\em Gene}, 270:17--30, 2001.
218:
219: \bibitem{lrg86}
220: J.~M. Levin, B.~Robson, and J.~Garnier.
221: \newblock An algorithm for secondary structure determination in proteins based
222: on sequence similarity.
223: \newblock {\em FEBS Letters}, 205:303--308, 1986.
224:
225: \bibitem{l03}
226: B.~Lewin.
227: \newblock {\em Genes VIII}.
228: \newblock Prentice Hall, Upper Saddle River, NJ, 2003.
229:
230: \bibitem{lfww03}
231: T~.P. Li, K.~Fan, J.~Wang, and W.~Wang.
232: \newblock Reduction of protein sequence complexity by residue grouping.
233: \newblock {\em Protein Engineering}, 16:323--330, 2003.
234:
235: \bibitem{lmt01}
236: K.~Lin, A.~C.~W. May, and W.~R. Taylor.
237: \newblock Amino acid substitution matrices from an artificial neural network
238: model.
239: \newblock {\em Journal of Computational Biology}, 8:471--481, 2001.
240:
241: \bibitem{lp85}
242: D.~J. Lipman and W.~R. Pearson.
243: \newblock Rapid and sensitive protein similarity searches.
244: \newblock {\em Science}, 227:1435--1441, 1985.
245:
246: \bibitem{lb93}
247: C.~D. Livingstone and G.~J. Barton.
248: \newblock Protein sequence alignments: a strategy for the hierarchical analysis
249: of residue conservation.
250: \newblock {\em Computer Applications in the Biosciences}, 9:745--756, 1993.
251:
252: \bibitem{m99}
253: A.~C.~W. May.
254: \newblock Towards more meaningful hierarchical classification of amino acid
255: scoring matrices.
256: \newblock {\em Protein Engineering}, 12:707--712, 1999.
257:
258: \bibitem{m71}
259: A.~D. McLachlan.
260: \newblock Tests for comparing related amino-acid sequences. {C}ytochrome
261: \textit{c} and cytochrome \textit{c}551.
262: \newblock {\em Journal of Molecular Biology}, 61:409--424, 1971.
263:
264: \bibitem{m96}
265: M.~Mitchell.
266: \newblock {\em An Introduction to Genetic Algorithms}.
267: \newblock The MIT Press, Cambridge, MA, 1996.
268:
269: \bibitem{mmy79}
270: T.~Miyata, S.~Miyazawa, and T.~Yasunaga.
271: \newblock Two types of amino acid substitutions in protein evolution.
272: \newblock {\em Journal of Molecular Evolution}, 12:219--236, 1979.
273:
274: \bibitem{m95}
275: G.~Mocz.
276: \newblock Fuzzy cluster analysis of simple physicochemical properties of amino
277: acids for recognizing secondary structure in proteins.
278: \newblock {\em Protein Science}, 4:1178--1187, 1995.
279:
280: \bibitem{msv02}
281: T.~M{\"{u}}ller, R.~Spang, and M.~Vingron.
282: \newblock Estimating amino acid substitution models: a comparison of
283: {D}ayhoff's estimator, the resolvent approach and a maximum likelihood
284: method.
285: \newblock {\em Molecular Biology and Evolution}, 19:8--13, 2002.
286:
287: \bibitem{mv00}
288: T.~M{\"{u}}ller and M.~Vingron.
289: \newblock Modeling amino acid replacement.
290: \newblock {\em Journal of Computational Biology}, 7:761--776, 2000.
291:
292: \bibitem{nb94}
293: D.~Naor and D.~L. Brutlag.
294: \newblock On near-optimal alignments of biological sequences.
295: \newblock {\em Journal of Computational Biology}, 1:349--366, 1994.
296:
297: \bibitem{nfjwn96}
298: D.~Naor, D.~Fischer, R.~L. Jernigan, H.~J. Wolfson, and R.~Nussinov.
299: \newblock Amino acid pair interchanges at spatially conserved locations.
300: \newblock {\em Journal of Molecular Biology}, 256:924--938, 1996.
301:
302: \bibitem{nw70}
303: S.~B. Needleman and C.~D. Wunsch.
304: \newblock A general method applicable to the search for similarities in the
305: amino acid sequence of two proteins.
306: \newblock {\em Journal of Molecular Biology}, 48:443--453, 1970.
307:
308: \bibitem{n02}
309: C.~Notredame.
310: \newblock Recent progress in multiple sequence alignment: a survey.
311: \newblock {\em Pharmacogenomics}, 3:131--144, 2002.
312:
313: \bibitem{pl88}
314: W.~R. Pearson and D.~J. Lipman.
315: \newblock Improved tools for biological sequence comparison.
316: \newblock {\em Proceedings of the National Academy of Sciences USA},
317: 85:2444--2448, 1988.
318:
319: \bibitem{p00}
320: P.~A. Pevzner.
321: \newblock {\em Computational Molecular Biology}.
322: \newblock The MIT Press, Cambridge, MA, 2000.
323:
324: \bibitem{r87}
325: J.~K.~M. Rao.
326: \newblock New scoring matrix for amino acid residue exchanges based on residue
327: characteristic physical parameters.
328: \newblock {\em International Journal of Peptide and Protein Research},
329: 29:276--281, 1987.
330:
331: \bibitem{rfpgp00}
332: I.~Rigoutsos, A.~Floratos, L.~Parida, Y.~Gao, and D.~Pratt.
333: \newblock The emergence of pattern discovery techniques in computational
334: biology.
335: \newblock {\em Metabolic Engineering}, 2:159--177, 2000.
336:
337: \bibitem{rddh88}
338: J.~L. Risler, M.~O. Delorme, H.~Delacroix, and A.~Henaut.
339: \newblock Amino acid substitutions in structurally related proteins. a pattern
340: recognition approach.
341: \newblock {\em Journal of Molecular Biology}, 204:1019--1029, 1988.
342:
343: \bibitem{rssbs97}
344: R.~B. Russell, M.~A.~S. Saqi, R.~A. Sayle, P.~A. Bates, and M.~J.~E. Sternberg.
345: \newblock Recognition of analogous and homologous protein folds: analysis of
346: sequence and structure conservation.
347: \newblock {\em Journal of Molecular Biology}, 269:423--439, 1997.
348:
349: \bibitem{sw03}
350: M.-F. Sagot and Y.~Wakabayashi.
351: \newblock Pattern inference under many guises.
352: \newblock In B.~A. Reed and C.~L. Sales, editors, {\em Recent Advances in
353: Algorithms and Combinatorics}, pages 245--287. Springer-Verlag, New York, NY,
354: 2003.
355:
356: \bibitem{sm97}
357: J.~Setubal and J.~Meidanis.
358: \newblock {\em Introduction to Computational Molecular Biology}.
359: \newblock PWS Publishing Company, Boston, MA, 1997.
360:
361: \bibitem{ss90}
362: R.~F. Smith and T.~F. Smith.
363: \newblock Automatic generation of primary sequence patterns from sets of
364: related protein sequences.
365: \newblock {\em Proceedings of the National Academy of Sciences USA},
366: 87:118--122, 1990.
367:
368: \bibitem{sw81}
369: T.~F. Smith and M.~S. Waterman.
370: \newblock Identification of common molecular subsequences.
371: \newblock {\em Journal of Molecular Biology}, 147:195--197, 1981.
372:
373: \bibitem{s66}
374: P.~H. Sneath.
375: \newblock Relations between chemical structure and biological activity in
376: peptides.
377: \newblock {\em Journal of Theoretical Biology}, 12:157--195, 1966.
378:
379: \bibitem{s96}
380: L.~E. Stanfel.
381: \newblock A new approach to clustering the amino acids.
382: \newblock {\em Journal of Theoretical Biology}, 183:195--205, 1996.
383:
384: \bibitem{t86}
385: W.~R. Taylor.
386: \newblock The classification of amino acid conservation.
387: \newblock {\em Journal of Theoretical Biology}, 119:205--218, 1986.
388:
389: \bibitem{t99}
390: W.~R. Taylor.
391: \newblock The properties of amino acids in sequences.
392: \newblock In M.~J. Bishop, editor, {\em Genetic Databases}, pages 81--103.
393: Academic Press, London, UK, 1999.
394:
395: \bibitem{tpp99a}
396: J.~D. Thompson, F.~Plewniak, and O.~Poch.
397: \newblock {BA}li{BASE}: a benchmark alignment database for the evaluation of
398: multiple alignment programs.
399: \newblock {\em Bioinformatics}, 15:87--88, 1999.
400:
401: \bibitem{tpp99b}
402: J.~D. Thompson, F.~Plewniak, and O.~Poch.
403: \newblock A comprehensive comparison of multiple sequence alignment programs.
404: \newblock {\em Nucleic Acids Research}, 27:2682--2690, 1999.
405:
406: \bibitem{v02}
407: W.~S.~J. Valdar.
408: \newblock Scoring residue conservation.
409: \newblock {\em Proteins: Structure, Function, and Genetics}, 48:227--241, 2002.
410:
411: \bibitem{vms99}
412: A.~Vanet, L.~Marsan, and M.-F. Sagot.
413: \newblock Promoter sequences and algorithmical methods for identifying them.
414: \newblock {\em Research in Microbiology}, 150:779--799, 1999.
415:
416: \bibitem{vst03}
417: S.~Veerassamy, A.~Smith, and E.~R.~M. Tillier.
418: \newblock A transition probability model for amino acid substitutions from
419: blocks.
420: \newblock {\em Journal of Computational Biology}, 10:997--1010, 2003.
421:
422: \bibitem{vb01}
423: M.~S. Venkatarajan and W.~Braun.
424: \newblock New quantitative descriptors of amino acids based on multidimensional
425: scaling of a large number of physical-chemical properties.
426: \newblock {\em Journal of Molecular Modeling}, 7:445--453, 2001.
427:
428: \bibitem{vw94}
429: M.~Vingron and M.~S. Waterman.
430: \newblock Sequence alignment and penalty choice. review of concepts, case
431: studies and implications.
432: \newblock {\em Journal of Molecular Biology}, 235:1--12, 1994.
433:
434: \bibitem{vvp01}
435: D.~Voet, J.~G. Voet, and C.~W. Pratt.
436: \newblock {\em Fundamentals of Biochemistry}.
437: \newblock John Wiley \& Sons, New York, NY, 2001.
438:
439: \bibitem{vea95}
440: G.~Vogt, T.~Etzold, and P.~Argos.
441: \newblock An assessment of amino acid exchange matrices in aligning protein
442: sequences: the twilight zone revisited.
443: \newblock {\em Journal of Molecular Biology}, 249:816--831, 1995.
444:
445: \bibitem{w83}
446: M.~S. Waterman.
447: \newblock Sequence alignments in the neighborhood of the optimum with general
448: application to dynamic programming.
449: \newblock {\em Proceedings of the National Academy of Sciences USA},
450: 80:3123--3124, 1983.
451:
452: \bibitem{wb96}
453: T.~D. Wu and D.~L. Brutlag.
454: \newblock Discovering empirically conserved amino acid substitution groups in
455: databases of protein families.
456: \newblock In {\em Proceedings of the Fourth International Conference on
457: Intelligent Systems for Molecular Biology}, pages 230--240, 1996.
458:
459: \bibitem{xm04}
460: W.~Xu and D.~P. Miranker.
461: \newblock A metric model of amino acid substitution.
462: \newblock {\em Bioinformatics}, 20:1214--1221, 2004.
463:
464: \end{thebibliography}
465: