<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>SMARTech Collection: Computational Science and Engineering Technical Reports</title>
    <link>http://smartech.gatech.edu/handle/1853/14334</link>
    <description>CSE supports interdisciplinary research and education in computer science and applied mathematics</description>
    <textInput>
      <title>The Collection's search engine</title>
      <description>Search the Channel</description>
      <name>search</name>
      <link>http://smartech.gatech.edu/simple-search</link>
    </textInput>
    <item>
      <title>Toward Faster Nonnegative Matrix Factorization: A New Algorithm and Comparisons</title>
      <link>http://smartech.gatech.edu/handle/1853/25538</link>
      <description>Title: Toward Faster Nonnegative Matrix Factorization: A New Algorithm and Comparisons
&lt;br/&gt;
&lt;br/&gt;Authors: Kim, Jingu; Park, Haesun
&lt;br/&gt;
&lt;br/&gt;Abstract: Nonnegative Matrix Factorization (NMF) is a dimension reduction method that has been widely used for&#xD;
various tasks including text mining, pattern analysis, clustering, and cancer class discovery. The mathematical&#xD;
formulation for NMF appears as a non-convex optimization problem, and various types of algorithms have been&#xD;
devised to solve the problem. The alternating nonnegative least squares (ANLS) framework is a block coordinate&#xD;
descent approach for solving NMF, which was recently shown to be theoretically sound and empirically efficient.&#xD;
In this paper, we present a novel algorithm for NMF based on the ANLS framework. Our new algorithm builds&#xD;
upon the block principal pivoting method for the nonnegativity constrained least squares problem that overcomes&#xD;
some limitations of active set methods. We introduce ideas to efficiently extend the block principal pivoting&#xD;
method within the context of NMF computation. Our algorithm inherits the convergence theory of the ANLS&#xD;
framework and can easily be extended to other constrained NMF formulations. Comparisons of algorithms using&#xD;
datasets that are from real life applications as well as those artificially generated show that the proposed new&#xD;
algorithm outperforms existing ones in computational speed.</description>
      <pubDate>Mon, 29 Oct 2007 22:58:59 GMT</pubDate>
    </item>
    <item>
      <title>SNARE: Spatio-temporal Network-level Automatic Reputation Engine</title>
      <link>http://smartech.gatech.edu/handle/1853/25135</link>
      <description>Title: SNARE: Spatio-temporal Network-level Automatic Reputation Engine
&lt;br/&gt;
&lt;br/&gt;Authors: Feamster, Nick; Gray, Alexander G.; Krasser, Sven; Syed, Nadeem Ahmed
&lt;br/&gt;
&lt;br/&gt;Abstract: Current spam filtering techniques classify email based on&#xD;
content and IP reputation blacklists or whitelists. Unfortunately,&#xD;
spammers can alter spam content to evade content based&#xD;
filters, and spammers continually change the IP addresses&#xD;
from which they send spam. Previous work has suggested&#xD;
that filters based on network-level behavior might be&#xD;
more efficient and robust, by making decisions based on how&#xD;
messages are sent, as opposed to what is being sent or who&#xD;
is sending them.&#xD;
This paper presents a technique to identify spammers&#xD;
based on features that exploit the network-level spatio temporal&#xD;
behavior of email senders to differentiate the spamming&#xD;
IPs from legitimate senders. Our behavioral classifier&#xD;
has two benefits: (1) it is early (i.e., it can automatically&#xD;
detect spam without seeing a large amount of email from&#xD;
a sending IP address-sometimes even upon seeing only a&#xD;
single packet); (2) it is evasion-resistant (i.e., it is based on&#xD;
spatial and temporal features that are difficult for a sender&#xD;
to change). We build classifiers based on these features using&#xD;
two different machine learning methods, support vector&#xD;
machine and decision trees, and we study the efficacy&#xD;
of these classifiers using labeled data from a deployed commercial&#xD;
spam-filtering system. Surprisingly, using only features&#xD;
from a single IP packet header (i.e., without looking at&#xD;
packet contents), our classifier can identify spammers with&#xD;
about 93% accuracy and a reasonably low false-positive rate&#xD;
(about 7%). After looking at a single message spammer&#xD;
identification accuracy improves to more than 94% with a&#xD;
false rate of just over 5%. These suggest an effective sender&#xD;
reputation mechanism.</description>
      <pubDate>Mon, 29 Oct 2007 22:58:59 GMT</pubDate>
    </item>
    <item>
      <title>Sparse Nonnegative Matrix Factorization for Clustering</title>
      <link>http://smartech.gatech.edu/handle/1853/20058</link>
      <description>Title: Sparse Nonnegative Matrix Factorization for Clustering
&lt;br/&gt;
&lt;br/&gt;Authors: Kim, Jingu; Park, Haesun
&lt;br/&gt;
&lt;br/&gt;Abstract: Properties of Nonnegative Matrix Factorization (NMF) as a clustering method are studied by relating&#xD;
its formulation to other methods such as K-means clustering. We show how interpreting the objective&#xD;
function of K-means as that of a lower rank approximation with special constraints allows comparisons&#xD;
between the constraints of NMF and K-means and provides the insight that some constraints can be&#xD;
relaxed from K-means to achieve NMF formulation. By introducing sparsity constraints on the coefficient&#xD;
matrix factor in NMF objective function, we in term can view NMF as a clustering method. We tested&#xD;
sparse NMF as a clustering method, and our experimental results with synthetic and text data shows&#xD;
that sparse NMF does not simply provide an alternative to K-means, but rather gives much better and&#xD;
consistent solutions to the clustering problem. In addition, the consistency of solutions further explains&#xD;
how NMF can be used to determine the unknown number of clusters from data. We also tested with a&#xD;
recently proposed clustering algorithm, Affinity Propagation, and achieved comparable results. A fast&#xD;
alternating nonnegative least squares algorithm was used to obtain NMF and sparse NMF.</description>
      <pubDate>Mon, 29 Oct 2007 22:58:59 GMT</pubDate>
    </item>
    <item>
      <title>Non-Negative Matrix Factorization Based on Alternating Non-Negativity Constrained Least Squares and Active Set Method</title>
      <link>http://smartech.gatech.edu/handle/1853/14463</link>
      <description>Title: Non-Negative Matrix Factorization Based on Alternating Non-Negativity Constrained Least Squares and Active Set Method
&lt;br/&gt;
&lt;br/&gt;Authors: Kim, Hyunsoo; Park, Haesun</description>
      <pubDate>Sun, 29 Oct 2006 22:58:59 GMT</pubDate>
    </item>
  </channel>
</rss>

