Accurate Inference of Phylogenetic Relationships from Multilocus Data
(Georgia Institute of Technology, 20100309)Accurate inference of phylogenetic relationships of species, and understanding their relationships with gene trees are two central themes in molecular and evolutionary biology. Traditionally, a species tree is inferred by ... 
The Aha! Moment: From Data to Insight
(Georgia Institute of Technology, 20140207)The amount of data in the world is increasing at incredible rates. Largescale data has potential to transform almost every aspect of our world, from science to business; for this potential to be realized, we must turn ... 
An analytical GPU performance model and a dynamic compilation system for CPU/GPU systems
Automating Topology Aware Task Mapping on Large Supercomputers
(Georgia Institute of Technology, 20100330)Parallel computing is entering the era of petascale machines. This era brings enormous computing power to us and new challenges to harness this power efficiently. Machines with hundreds of thousands of processors already ... 
Blocked Plane Rotations for Band Reduction and Sparse SVD
(Georgia Institute of Technology, 20090826)With the success of Basic Linear Algebra Subroutines (BLAS) in using the memory efficiently, the algorithms with vector operations (BLAS2) have given way to algorithms with matrix operations (BLAS3). In some cases, BLAS3 ... 
Composite Objective Optimization and Learning for Massive Datasets
(Georgia Institute of Technology, 20100903)Composite objective optimization is concerned with the problem of minimizing a twoterm objective function which consists of an empirical loss function and a regularization function. Application with massive datasets often ... 
Coordinate Sampling for Sublinear Optimization and Nearest Neighbor Search
(Georgia Institute of Technology, 20110422)I will describe randomized approximation algorithms for some classical problems of machine learning, where the algorithms have provable bounds that hold with high probability. Some of our algorithms are sublinear, that is, ... 
Cyber Games
(Georgia Institute of Technology, 20130219)Over the last few years I have been working on game theoretic models of security, with a particular emphasis on issues salient in cyber security. In this talk I will give an overview of some of this work. I will first spend ... 
Dependable direct solutions for linear systems using a little extra precision
(Georgia Institute of Technology, 20090821)Solving a square linear system Ax=b often is considered a black box. It's supposed to "just work," and failures often are blamed on the original data or subtleties of floatingpoint. Now that we have an abundance of cheap ... 
Discovery of Mechanisms from Mathematical Modeling of DNA Microarray Data: Computational Prediction and Experimental Verification
(Georgia Institute of Technology, 20100216)Future discovery and control in biology and medicine will come from the mathematical modeling of largescale molecular biological data, such as DNA microarray data, just as Kepler discovered the laws of planetary motion ... 
Efficient HighOrder Discontinuous Galerkin Methods for Fluid Flow Simulations
The Exascale: Why and How
(Georgia Institute of Technology, 20110211)Sustained floatingpoint computation rates on real applications, as tracked by the ACM Gordon Bell Prize, increased by three orders of magnitude from 1988 (1 Gigaflop/s) to 1998 (1 Teraflop/s), and by another three orders ... 
Extending Hadoop to Support BinaryInput Applications
(Georgia Institute of Technology, 20121019)Many dataintensive applications naturally take multiple inputs, which is not well supported by some popular MapReduce implementations, such as Hadoop. In this talk, we present an extension of Hadoop to better support such ... 
Fast Algorithms for Querying and Mining Large Graphs
(Georgia Institute of Technology, 20100316)Graphs appear in a wide range of settings and have posed a wealth of fascinating problems. In this talk, I will present our recent work on (1) querying (e.g., given a social network, how to measure the closeness between ... 
Graphical Models for the Internet
(Georgia Institute of Technology, 20110429)In this talk I will present algorithms for performing large scale inference using Latent Dirichlet Allocation and a novel ClusterTopic model to estimate user preferences and to group stories into coherent, topically ... 
Gravity's Strongest Grip: A Computational Challenge
(Georgia Institute of Technology, 20101022)Gravitational physics is entering a new era driven by observation that will begin once gravitationalwave interferometers make their first detections. In the universe, gravitational waves are produced during violent events ... 
Highperformancecomputing challenges for heart simulations
(Georgia Institute of Technology, 20120831)The heart is an electromechanical system in which, under normal conditions, electrical waves propagate in a coordinated manner to initiate an efficient contraction. In pathologic states, propagation can destabilize and ... 
How much (execution) time and energy does my algorithm cost?
(Georgia Institute of Technology, 20120824)When designing an algorithm or performancetuning code, is timeefficiency (e.g., operations per second) the same as energyefficiency (e.g., operations per Joule)? Why or why not? To answer these questions, we posit a ... 
The Joy of PCA
(Georgia Institute of Technology, 20100917)Principal Component Analysis is the most widely used technique for highdimensional or large data. For typical applications (nearest neighbor, clustering, learning), it is not hard to build examples on which PCA "fails." ... 
LoadBalanced Bonded Force Calculations on Anton
(Georgia Institute of Technology, 20100315)Spiral (www.spiral.net) is a program and hardware design generation system for linear transforms such as the discrete Fourier transform, discrete cosine transforms, filters, and others. We are currently extending Spiral ...