1	Phylogenetic Reconstruction Using Competitive Neural Networks MS Computer Science Thesis Defense
2	Outline Phylogenetics Neural Networks Competitive SOTA KomPhy Approach Time Complexity Results Birth-death trees Uniform trees Comparison with other algorithms Parallel Implementation
3	What is Phylogenetics?
4	NP-Hard
5	Neural Networks Neural networks were inspired by the conception of neurons in the brain being very simple computational devices which, when linked together in interesting ways, are able to learn. Each neuron performs a simple calculation on its input and generates output. Neuron output flows from neuron to neuron until an output neuron is reached. Can produce approximate solutions to complex problems very quickly.
6	Why might neural networks be appropriate for phylogenetics? State Space Search NP-Hard (Graham and Foulds, 1982) Phylogenetics is a process of inference from uncertain and incomplete evidence. Heuristics and approximate solutions are welcome for large data sets.
7	Neural Network Each output neuron has an associated weight vector. Each component of the weight vector modulates a component of the input vector.
8	Competitive Neural Networks - Overview A particular input from an input set is mapped onto the input neurons. As each new input is presented to the network an output neuron becomes associated with it (wins). The winning neuron is made more likely to win again given the same input. In this way the network learns to recognize classes of input by associating each neuron with a subset of the input set.
9	Competitive Neural Network – Determining the winner The competitive neural network has two components. is a member of the input set R_i is Max net equivalent to argmax( R_i )
10	Competitive Neural Network – Learning Step Once a winning neuron has been determined the learning rule is applied to it.
11	Competitive Neural Network – Weight Space The weight vectors associated with each neuron can be thought of as points in a weight space. Since they are of the same dimensionality as the weights the inputs can also be mapped into weight space.
12	Competitive Training – Presentation Step Element 1 of input set S is presented to the network. The distance (dot product) between and S₁ is calculated.
13	Competitive Training – Learning Step is found to be closer to S₁ and the learning rule is applied to it.
14	Competitive Training – Learning Step is found to be closer to S₁ and the learning rule is applied to it. becomes more like S₁
15	Competitive Training – Presentation Step S₂ is presented to the network. is found to be closer to S₂ and the learning rule is applied to it.
16	Competitive Training – Learning Step S₂ is presented to the network. is found to be closer to S₂ and the learning rule is applied to it. becomes more like S₂
17	Competitive Training – Presentation Step S₃ is presented to the network. is found to be closer to S₃ and the learning rule is applied to it.
18	Competitive Training – Learning Step S₃ is presented to the network. is found to be closer to S₃ and the learning rule is applied to it. becomes more like S₃
19	Competitive Training – Presentation Step S₄ is presented to the network. is found to be closer to S₄ and the learning rule is applied to it.
20	Competitive Training – Learning Step S₄ is presented to the network. is found to be closer to S₄ and the learning rule is applied to it. becomes more like S₄
21	Competitive Training – Presentation Step S₅ is presented to the network. is found to be closer to S₅ and the learning rule is applied to it.
22	Competitive Training – Learning Step S₅ is presented to the network. is found to be closer to S₅ and the learning rule is applied to it. becomes more like S₅
23	Competitive Training – Presentation Step S₆ is presented to the network. is found to be closer to S₆ and the learning rule is applied to it.
24	Competitive Training – Learning Step S₆ is presented to the network. is found to be closer to S₆ and the learning rule is applied to it. becomes more like S₆
25	Competitive Training – Presentation Step S₇ is presented to the network. is found to be closer to S₇ and the learning rule is applied to it.
26	Competitive Training – Learning Step S₇ is presented to the network. is found to be closer to S₇ and the learning rule is applied to it. becomes more like S₇
27	Competitive Neural Network – Analysis Step S₆ is presented to the network. is found to be closer to S₆ and the learning rule is applied to it. becomes more like S₆
28	Competitive Neural Networks The set of input vectors is typically presented to the network numerous times. Each cycle of presentations is called an epoch. Various methods can be used to determine when to finish training the network and output the partitions: Root Mean Squared (RMS) error between weights and sequences. Fixed number of epochs.
29	Competitive Neural Networks The number of epochs required for the network to converge depends on α in the learning rule and the distances between input vectors. The smaller α is the more training cycles are needed for the neurons to reach a stable point in weight space. Large α values can cause the network to converge quickly but if they are too large the neurons may get stuck in an unstable cycle.
30	Application of Neural Networks to Phylogenetics - SOTA In 1997 Dopazo J. and J. M. Carazo published a paper in the Journal of Molecular Evolution describing a neural network approach to phylogeny reconstruction they called SOTA. SOTA employs a Self Organizing Network (SOM) to reconstruct phylogenies. SOMs use some of the same concepts as competitive neural networks.
31	Application of Neural Networks to Phylogenetics - SOTA The main difference between a SOM and a competitive network is that in the SOM numerous neurons are embedded in a large mesh. The neurons are divided up into neighborhoods (many local maxnets) and go though cycles of inhibition and excitation. The mesh determines the excitatory and inhibitory relationships. After some number of cycles the neurons congregate in the weight space in such a way that they map out areas of high probability.
32	Application of Neural Networks to Phylogenetics - SOTA Dopazo and Corazo’s paper describes the reconstruction of two phylogenies both of which are real data sets for which the true tree is unknown. SOTA and related SOMs were adapted to perform protein sequence clustering and function prediction but none of the follow up papers deal with phylogenetic reconstruction specifically [2][3]. There is a need for more analysis of neural network approaches to phylogeny reconstruction.
33	Greedy Kohonen Competition (KomPhy) Top-down divide-and-conquer algorithm Constructive (could be faulted for not providing alternative trees) The algorithm partitions the input sequences into two at each recursive step The partitioning is done by a two node Kohonen competitive neural network
34	KomPhy - Distances Distances between weights and input sequences can be calculated using available phylogenetic methods These methods must be modified to accept real numbered distances between characters Euclidian distance provides a good approximation for parsimony Jukes-Cantor, Tanamura-Nei, and F84 were adapted for KomPhy Unfortunately the initial weight to input distances are too great for F84 and Tanamua-Nei
35	KomPhy – Character Representation Each base is represented by an apex of this regular tetrahedron. Weight components transition smoothly from base to base as points within this volume.
36	KomPhy Overview
37	KomPhy Partitioning Step
38	KomPhy - Partitioning Step
39	Computational Complexity
40	Computational Complexity
41	Expected Computational Complexity – like QuickSort
42	Komphy’s Implicit Model That speciation events result in two populations which are equally able to generate new speciation events. That is, subtrees tend to be balanced. Poor assumption since the subtree sizes depend as much on selection bias of the researcher as on the evolutionary history.
43	Best Case and Worst Case Trees for Running Time
44	Best Case and Worst Case Trees for Running Time
45	Experimental Results
46	Experimental Results
47	Experimental Results
48	Experimental Results
49	Experimental Results
50	Experimental Results
51	Experimental Results
52	Experimental Results
53	Experimental Results
54	Experimental Results
55	Parallelization – Running Time Profile
56	Parallelization – Approach
57	Parallelization – Results
58	Conclusion Joshua Leiderberg (DENRAL): “It is easy to sympathize with computer scientists who are willing to rely on neural-network approaches that ignore, may even defeat, insight into causal relationships.” (Foreword, AI and Molecular Biology, 1992) “One of the most difficult steps in development of an expert system is the exploitation of domain wizards. Therein may lie the greatest hazards from the proliferation of expert systems: for much of that expert knowledge is fallible.” (Afterword, AI and Mol. Biol. 1992)
59	Summary KomPhy provides a point of comparison to the previous neural network approach. about as accurate as SOTA (worse on uniform trees and better on birth-death trees). faster than SOTA. Not as good or as fast as existing algorithms such as neighbor-joining Both neural network approaches show a strong bias towards generating balanced trees so they do poorly on uniform trees
60	Future Work Tanamura-Nei and F84 distances were used with KomPhy but were unable to converge the network because the weights and inputs are initially too far apart. Other sophisticated distances (such as K2P) should be tried. KomPhy supports the creation of n-fircatating trees with neurons dynamically added at each recursive step. Experiments are needed to determine the utility of this feature. Branches could be labeled with lengths derived from the distances between neurons at the previous recursive step.