<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-7160049826242650176</id><updated>2011-07-31T18:49:11.764+08:00</updated><title type='text'>Frank's aMMAI</title><subtitle type='html'>R97922058 Che-Han Chang (Frank)&lt;br&gt;
Communication and Multimedia Laboratory&lt;br&gt;
Dept. of Computer Science and Information Engineering&lt;br&gt;
National Taiwan University</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>22</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-1020217640020717525</id><published>2009-06-19T20:57:00.002+08:00</published><updated>2009-06-19T21:13:54.029+08:00</updated><title type='text'>[Reading] Support Vector Learning for Ordinal Regression</title><content type='html'>"Learning to rank" is automatically creating a ranking function that assigns scores to instances, then rank the instances by using the scores.&lt;br /&gt;&lt;br /&gt;This paper formalizes learning to rank as a problem of binary classification, and uses SVM (support vector machine) to learn the binary classifier. This formulation minimizes pair-wise 0-1 loss.&lt;br /&gt;&lt;br /&gt;The learned ranking function can be viewed as (1)Ranking function: given an example, output its ranking score. (2)Classifier: given a pair of instances, output their relative ranking.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-1020217640020717525?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/1020217640020717525/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-support-vector-learning-for.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/1020217640020717525'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/1020217640020717525'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-support-vector-learning-for.html' title='[Reading] Support Vector Learning for Ordinal Regression'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-6481145140763678341</id><published>2009-06-19T20:51:00.002+08:00</published><updated>2009-06-19T20:56:56.915+08:00</updated><title type='text'>[Reading] The Structure and Function of Complex Networks</title><content type='html'>This paper reviews recent work on the structure and function of networked systems such as the Internet, the World Wide Web, social networks, networks of citations between papers, and many others. The study of networks, in the form of mathematical graph theory, is one of the fundamental pillars of discrete mathematics. Networks have also been studied extensively in the social sciences.&lt;br /&gt;&lt;br /&gt;This paper mainly describes three parts:&lt;br /&gt;(1) Empirical studies of the structure of networks, including social networks, information networks, technological networks and biological networks.&lt;br /&gt;(2) Some of the common properties that are observed in many of these networks, how they are measured, and why they are believed to be important for the functioning of networked systems.&lt;br /&gt;(3) The mathematical modeling of networks, including random graph models and their generalizations, exponential random graphs, and Markov graphs, the small-world model and its variations, and models of growing graphs including preferential attachment models and their many variations.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-6481145140763678341?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/6481145140763678341/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-structure-and-function-of.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/6481145140763678341'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/6481145140763678341'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-structure-and-function-of.html' title='[Reading] The Structure and Function of Complex Networks'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-4700343223771025787</id><published>2009-06-19T17:57:00.001+08:00</published><updated>2009-06-19T18:32:53.138+08:00</updated><title type='text'>[Reading] Lazy Snapping</title><content type='html'>This paper presents "Lazy Snapping", an interactive image cutout tool, also a novel coarse-to-fine UI design for image cutout. The task in image cutout is in specifying which parts of the image are "foreground" (the part you want to cut out) and which belong to the background.&lt;br /&gt;&lt;br /&gt;Lazy Snapping consists of two steps, both are formulated as a graph cut problem:&lt;br /&gt;(1) a quick object marking step&lt;br /&gt;Object marking (at a coarse scale) specifies the object of interest by a few marking lines. This step is intuitive and quick for object context specification. An efficient graph cut algorithm is proposed by employing pre-computed over-segmentation so that the marking UI can provide instant visual feedback for users.&lt;br /&gt;(2) a simple boundary editing step&lt;br /&gt;Boundary editing (at a finer scale or on the zoomed-in image) allows the user to edit the object boundary by simply clicking and dragging polygon vertices, and use the polygon locations as soft constraints to improve snapping results around ambiguous or low contrast edges. This step is easy and efficient for accurate boundary control.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-4700343223771025787?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/4700343223771025787/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-lazy-snapping.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/4700343223771025787'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/4700343223771025787'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-lazy-snapping.html' title='[Reading] Lazy Snapping'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-1246703557590133487</id><published>2009-06-05T22:47:00.004+08:00</published><updated>2009-06-06T11:28:12.818+08:00</updated><title type='text'>[Reading] Learning Low-Level Vision</title><content type='html'>This paper presents a learning-based method for low-level vision problems - estimating  underlying scenes from images, which is a combination themes of scene estimation and statistical learning. The estimates of underlying scenes are important for various tasks in image analysis, database search, and robotics.&lt;br /&gt;&lt;br /&gt;This approach is called VISTA - Vision by Image/Scene TrAining. It is as follows: one specifies prior probabilities on scenes by generating typical examples, creating a synthetic world of scenes and rendered images. It break the images and scenes into a Markov network, and learn the parameters of the network from the training data by applying belief propagation in the Markov network.&lt;br /&gt;&lt;br /&gt;Solving a Markov network involves a learning phase, where the parameters of the network connections are learned from training data, and an inference phase, when the scene corresponding to particular image data is estimated.&lt;br /&gt;&lt;br /&gt;This paper applies VISTA to the "super-resolution" problem (estimating high frequency details from a low-resolution image), showing good results.&lt;br /&gt;&lt;br /&gt;I think the important thing in this paper is that the power of the VISTA approach lies in the large training database, allowing rich prior probabilities, the selection of scene candidates, which focuses the computation on scenes that render to the image, and the bayesian belief propagation, which allows efficient inference.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-1246703557590133487?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/1246703557590133487/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-learning-low-level-vision.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/1246703557590133487'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/1246703557590133487'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-learning-low-level-vision.html' title='[Reading] Learning Low-Level Vision'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-8119467593365332901</id><published>2009-06-05T20:10:00.002+08:00</published><updated>2009-06-05T21:48:34.723+08:00</updated><title type='text'>[Reading] An Introduction to Graphical Models</title><content type='html'>This technical report gives an introduction to graphical models. it says that graphical models are a marriage between probability theory and graph theory.&lt;br /&gt;&lt;br /&gt;graphical models provide a natural tool for dealing with (1) uncertainty and (2) complexity. In particular, graph theory provides the notion of modularity, i.e. a complex system is built by combining simpler parts. Probability theory provides the glue whereby the parts are combined, ensuring that the system as a whole is consistent, and providing ways to interface models to data.&lt;br /&gt;&lt;br /&gt;This graphical model framework provides a way to view several systems as instances of a common underlying formalism, ex: mixture models, factor analysis, hidden Markovmodels, Kalman filters. Probabilistic graphical models are graphs in which nodes represent random variables, and the arcs represent conditional independence assumptions. They provide a compact representation of joint probability distributions.&lt;br /&gt;&lt;br /&gt;There are two main kinds of graphical models: undirected and directed. In a directed graphical model (a Bayesian network), an arc from A to B can be informally interpreted as indicating that &lt;span style="font-weight:bold;"&gt;A causes B&lt;/span&gt;. We can see that the conditional independence relationships allow us to represent the joint more compactly.&lt;br /&gt;&lt;br /&gt;Talking about inference, its goal is to estimate the values of hidden nodes, given the values of the observed nodes. In particular, we can use the conditional independence assumptions encoded in the graph to speed up exact inference. The key idea of the variable elimination algorithm (and many others) is to "push" the sums in as far as possible. Also, if we wish to compute several marginals at the same time, we can use dynamic programming to avoid the redundant computation that would be involved if we used variable elimination repeatedly. The reason why we use approximate inference is that the running time of these exact algorithms is exponential in the size of the largest cluster , and minimizing it is NP-hard. When it is large, it is necessary to use approximate inference.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-8119467593365332901?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/8119467593365332901/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-introduction-to-graphical.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/8119467593365332901'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/8119467593365332901'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-introduction-to-graphical.html' title='[Reading] An Introduction to Graphical Models'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-2027140482231213956</id><published>2009-06-02T22:38:00.000+08:00</published><updated>2009-06-02T22:39:39.748+08:00</updated><title type='text'>[Reading] Rapid Object Detection using a Boosted Cascade of Simple Features</title><content type='html'>This paper describes a machine learning approach for visual object detection which is capable of processing images extremely rapidly and achieving high detection rates. It uses haar features for weak learners, by using the "integral image"&lt;br /&gt;technique, those features can be computed very very quickly. It uses adaboost as learning algorithm. It selects a small number of most important features from a larger set and yields extremely efficient, discriminative classifiers. It propose a "cascade" framework for providing efficiently distiguishing between face and nonface. Overall, this paper propose an approach for object detection which minimizes computation time while achieving high detection accuracy.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-2027140482231213956?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/2027140482231213956/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-rapid-object-detection-using.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/2027140482231213956'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/2027140482231213956'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-rapid-object-detection-using.html' title='[Reading] Rapid Object Detection using a Boosted Cascade of Simple Features'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-1722682348853496767</id><published>2009-06-02T22:07:00.003+08:00</published><updated>2009-06-02T22:09:43.104+08:00</updated><title type='text'>[Reading] Normalized Cuts and Image Segmentation</title><content type='html'>The approach of this paper aims at extracting the global impression of an image and provides a hierarchical description of it. It is most related to the graph theoretic formulation of grouping. By treating the grouping problem (image segmentation) as a graph partitioning problem, this paper proposed the &lt;span style="font-weight:bold;"&gt;normalized cut criteria&lt;/span&gt; for segmenting the graph.&lt;br /&gt;&lt;br /&gt;Normalized cut is an unbiased measure of disassociation between subgroups of a graph and it has the nice property that minimizing normalized cut leads directly to maximizing the normalized association, which is an unbiased measure for&lt;br /&gt;total association within the subgroups. it also avoids the problem that unnatural bias for partitioning out small sets of points.&lt;br /&gt;&lt;br /&gt;minimizing normalized cut exactly is NP-complete. This paper shows that, when it embed the normalized cut problem in the real value domain, an approximate&lt;br /&gt;discrete solution can be found efficiently. it is formulated as a &lt;br /&gt;generalized eigenvalue problem.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-1722682348853496767?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/1722682348853496767/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-normalized-cuts-and-image.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/1722682348853496767'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/1722682348853496767'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/06/reading-normalized-cuts-and-image.html' title='[Reading] Normalized Cuts and Image Segmentation'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-2131892285626883858</id><published>2009-05-05T23:28:00.001+08:00</published><updated>2009-05-05T23:29:49.218+08:00</updated><title type='text'>[Reading] On Spectral Clustering: Analysis and an algorithm</title><content type='html'>Spectrual clustering methods are algorithms that cluster points using eigenvectors of matrices derived from the data. Essentially, it is K means in the eigenvector space of the affinity matrix.&lt;br /&gt;&lt;br /&gt;This paper present a simple spectral clustering algorithm and analyze it. It provides a theoretical analysis unlike previous works are empirical.&lt;br /&gt;&lt;br /&gt;This method provides 4 elasticities for user to control the clustering:&lt;br /&gt;(1) Affinity matrix construction (usually Gaussian kernel)&lt;br /&gt;(2) Choice of scaling factor (it can be done by search over and pick value that gives the tightest clusters)&lt;br /&gt;(3) Choice of k, the number of clusters&lt;br /&gt;(4) Choice of clustering method&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-2131892285626883858?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/2131892285626883858/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/05/reading-on-spectral-clustering-analysis.html#comment-form' title='1 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/2131892285626883858'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/2131892285626883858'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/05/reading-on-spectral-clustering-analysis.html' title='[Reading] On Spectral Clustering: Analysis and an algorithm'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-3084550503505212599</id><published>2009-04-30T00:47:00.001+08:00</published><updated>2009-04-30T00:49:04.839+08:00</updated><title type='text'>[Reading] Names and Faces in the News Abstract</title><content type='html'>This paper propose a dataset which is more realistic than usual face recognition datasets, because it contains faces captured "in the wild" in a variety of configurations with respect to the camera, taking a variety of expressions, and under illumination of widely varying color. Each face image is associated with a set of names, automatically extracted from the associated caption. Many, but not all such sets contain the correct name.&lt;br /&gt;&lt;br /&gt;This paper shows quite good &lt;span style="font-weight:bold;"&gt;face clustering&lt;span style="font-style:italic;"&gt;&lt;/span&gt;&lt;/span&gt; is possible for this dataset which has inaccurately and ambiguously labelled face images. The approach used in this paper is focus on adopting the kPCA/LDA methodology, rather than on building a multi-class classifier to do face recognition.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-3084550503505212599?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/3084550503505212599/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/04/reading-names-and-faces-in-news.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/3084550503505212599'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/3084550503505212599'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/04/reading-names-and-faces-in-news.html' title='[Reading] Names and Faces in the News Abstract'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-7596761901547647855</id><published>2009-04-29T23:41:00.002+08:00</published><updated>2009-04-29T23:43:23.980+08:00</updated><title type='text'>[Reading] Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary</title><content type='html'>This paper describe a model of object recognition as machine translation. 3 issues are addressed in this paper:&lt;br /&gt;(1) What counts as an object?&lt;br /&gt;(2) Which objects are easy to recognise?&lt;br /&gt;(3) Which objects are indistinguishable using our features?&lt;br /&gt;&lt;br /&gt;By viewing the object recognition problem as machine translation (i.e. recognition is a process of annotating image regions with words). This paper attack these 3 questions with the following answers respectively:&lt;br /&gt;(1) All words count as objects.&lt;br /&gt;(2) Words that can be reliably attached to image regions are easy to recognise and those that cannot, are not.&lt;br /&gt;(3) Words that are predicted with about the same posterior probability given any image group - such objects are indistinguishable given the current feature set.&lt;br /&gt;&lt;br /&gt;For training this model, first, segment images into regions. Second, classify regions into region types using a variety of features. Last, learn a mapping between region types and keywords supplied with the images using EM.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-7596761901547647855?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/7596761901547647855/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/04/reading-object-recognition-as-machine.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/7596761901547647855'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/7596761901547647855'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/04/reading-object-recognition-as-machine.html' title='[Reading] Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-5672998539858438793</id><published>2009-04-29T22:36:00.000+08:00</published><updated>2009-04-29T22:38:54.558+08:00</updated><title type='text'>[Reading] Algorithms for Fast Vector Quantization</title><content type='html'>Finding the nearest neighbor (NN) is a problem of significant importance in many applications. One important application is vector quantization, a technique used in the compression of speech and images. If one is willing to relax the requirement of finding the true NN, this paper shows that it is possible to achieve significant improvements in running time and at only a very small loss in the performance of the vector quantizer.&lt;br /&gt;&lt;br /&gt;This paper present an empirical study of 3 NN algorithms on a number of data distributions, and in dimensions varying from 8 to 16.&lt;br /&gt;(1) Standard k-d tree algorithm, which has been enhanced to use incremental distance calculation.&lt;br /&gt;(2) Priority k-d tree search, a further improvement that orders search by the proximity of the k-d cell to the query point.&lt;br /&gt;(3) A neighborhood graph search algorithm, based on a simple greedy search.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-5672998539858438793?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/5672998539858438793/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/04/reading-algorithms-for-fast-vector.html#comment-form' title='1 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/5672998539858438793'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/5672998539858438793'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/04/reading-algorithms-for-fast-vector.html' title='[Reading] Algorithms for Fast Vector Quantization'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-1708304541131308996</id><published>2009-04-01T11:00:00.000+08:00</published><updated>2009-04-01T11:01:47.258+08:00</updated><title type='text'>[Reading] Latent Dirichlet Allocation</title><content type='html'>latent Dirichlet allocation (LDA) is a generative probabilistic model of a corpus that allows sets of observations to be explained by unobserved groups which explain why some parts of the data are similar. The basic idea is that the documents are represented as random mixtures over latent topics, where a topic is characterized by a distribution over words. It assumes that words are generated by topics and that those topics are infinitely exchangeable within a document, i.e. axchangeability assumption.&lt;br /&gt;&lt;br /&gt;It uses variational EM to estimate parameters. Also, it introduces Dirichlet smoothing to avoid the "zero frequency problem" called smoothed LDA. Exact inference is intractable for LDA, but any or a large suite of approximate inference algorithms for inference and parameter estimation can be used with the LDA framework. It Can be viewed as a dimensionality reduction technique.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-1708304541131308996?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/1708304541131308996/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/04/reading-latent-dirichlet-allocation.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/1708304541131308996'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/1708304541131308996'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/04/reading-latent-dirichlet-allocation.html' title='[Reading] Latent Dirichlet Allocation'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-8960973959913410795</id><published>2009-03-31T22:26:00.003+08:00</published><updated>2009-04-01T11:01:56.656+08:00</updated><title type='text'>[Reading] Probabilistic Latent Semantic Indexing</title><content type='html'>pLSA is a novel approach to automated document indexing and information retrieval. It models each word in a document as a sample from a mixture model. Each word is generated from a single topic, different words in the document may be generated from different topics. Each document is represented as a list of mixing proportions for the mixture components.&lt;br /&gt;&lt;br /&gt;pLSA is based on the likelihood principle and uses a statistical model called aspect model to define a proper generative model of the data, and directly minimizes word perplexity, so it has a better statistical foundation than LSA. Also, pLSA outperforms LSA in the experiments. pLSA uses EM algorithm to identify latent classes. It is capable of dealing with polysemy and synonymy.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-8960973959913410795?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/8960973959913410795/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/03/reading-probabilistic-latent-semantic.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/8960973959913410795'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/8960973959913410795'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/03/reading-probabilistic-latent-semantic.html' title='[Reading] Probabilistic Latent Semantic Indexing'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-1003804777474578843</id><published>2009-03-25T23:54:00.003+08:00</published><updated>2009-03-26T00:04:56.480+08:00</updated><title type='text'>[Reading] Shape Matching and Object Recognition Using Shape Contexts</title><content type='html'>This paper propose a robust and simple algorithm for finding correspondences and measure the similarity between shapes and exploit it for object recognition. This approach is a 3-stage process: (1) Find correspondences between points on shapes, (2) Estimate transformation, and (3) Measure similarity. In order to solve the correspondence problem, it propose a descriptor named shape context. Shape context records the distribution of relative positions of points. the estimation use regularized thin plate spline as transformation model. Shape distance is a weighted sum of shape context distance, appearance distance and bending energy. Results are presented for handwritten digits, 3D objects, silhouettes and trademarks.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-1003804777474578843?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/1003804777474578843/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/03/reading-shape-matching-and-object.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/1003804777474578843'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/1003804777474578843'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/03/reading-shape-matching-and-object.html' title='[Reading] Shape Matching and Object Recognition Using Shape Contexts'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-4605452083662137875</id><published>2009-03-25T23:49:00.003+08:00</published><updated>2009-03-25T23:53:56.351+08:00</updated><title type='text'>[Reading] Contour and Texture Analysis for Image Segmentation</title><content type='html'>This paper propose a general algorithm for partitioning grayscale images into disjoint regions of coherent brightness and texture. It uses texture features for segmentation. A texture descriptor is a vector of filter bank outputs. Textons are found by clustering. Affinities are given by similarities of texton histograms over windows given by the "local scale" of the texture. Having get a locall measure, it use the spectral graph theoretic framework of normalized cuts to find partitions.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-4605452083662137875?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/4605452083662137875/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/03/reading-contour-and-texture-analysis.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/4605452083662137875'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/4605452083662137875'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/03/reading-contour-and-texture-analysis.html' title='[Reading] Contour and Texture Analysis for Image Segmentation'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-645700781326240677</id><published>2009-03-10T14:07:00.003+08:00</published><updated>2009-03-10T14:09:58.063+08:00</updated><title type='text'>[Reading] Nonlinear Dimensionality Reduction by Locally Linear Embedding</title><content type='html'>This paper introduce locally linear embedding (LLE), an unsupervised learning algorithm that&lt;br /&gt;computes low-dimensional, neighborhood-preserving embeddings of high-dimensional inputs.&lt;br /&gt;LLE recovers global nonlinear structure from locally linear fits by exploiting the local symmetries of linear reconstructions, thus LLE is able to learn the global structure of nonlinear manifolds.&lt;br /&gt;&lt;br /&gt;LLE maps high-dimensional data into a &lt;span style="font-weight: bold;"&gt;single global coordinate system&lt;/span&gt; of lower dimensionality. It constructs a neighborhood-preserving mapping based on reconstructing the constrained weights. By minimize the reconstruction errors, these weights reflect intrinsic geometric properties of the data that are invariant to rotations, rescalings, and translations.&lt;br /&gt;&lt;br /&gt;This approach eliminates the need to estimate pairwise distances between widely separated data points. It also avoids the need to solve large dynamic programming problems.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-645700781326240677?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/645700781326240677/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/03/this-paper-introduce-locally-linear.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/645700781326240677'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/645700781326240677'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/03/this-paper-introduce-locally-linear.html' title='[Reading] Nonlinear Dimensionality Reduction by Locally Linear Embedding'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-1240754558433336240</id><published>2009-03-10T13:15:00.000+08:00</published><updated>2009-03-10T13:16:19.188+08:00</updated><title type='text'>[Reading] Eigenfaces for Recognition</title><content type='html'>Eigenfaces are a set of eigenvectors used in the computer vision problem of human face recognition. The eigenvectors of the covariance matrix associate to a large set of normalized pictures of faces are called eigenfaces. They are derived from the covariance matrix of the probability distribution of the high-dimensional vector space of possible faces of human beings. This approach is an example of principal components analysis.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-1240754558433336240?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/1240754558433336240/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/03/reading-eigenfaces-for-recognition.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/1240754558433336240'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/1240754558433336240'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/03/reading-eigenfaces-for-recognition.html' title='[Reading] Eigenfaces for Recognition'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-2537889971326990765</id><published>2009-03-09T20:32:00.003+08:00</published><updated>2009-03-09T20:34:21.070+08:00</updated><title type='text'>[Reading] Scale &amp; Affine Invariant Interest Point Detectors</title><content type='html'>This paper propose a novel approach for detecting interest points especially invariant to scale and affine transformaitons. Scale invariant detector computes a multi-scale representation for the Harris interest point detector and then selects points at which a local measure (Laplacian) is maximal over scales, such approach combines the Harris detector with the Laplacian-based scale selection. It extends the scale invariant detector to affine invariance by estimating the affine shape of a point neighborhood. This method modifies location, scale and shape of every point neighborhood and converges to affine invariant points.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-2537889971326990765?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/2537889971326990765/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/03/this-paper-propose-novel-approach-for.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/2537889971326990765'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/2537889971326990765'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/03/this-paper-propose-novel-approach-for.html' title='[Reading] Scale &amp; Affine Invariant Interest Point Detectors'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-6180982388179162740</id><published>2009-03-09T20:01:00.004+08:00</published><updated>2009-03-09T20:05:58.816+08:00</updated><title type='text'>[Reading] Distinctive Image Features from Scale-Invariant Keypoints</title><content type='html'>This paper presents a method named SIFT for extracting distinctive invariant features (named SIFT) from images that providing a basis for object and scene recognition. SIFT is a carefully designed procedure with empirically determined parameters for the invariant and distinctive features.&lt;br /&gt;&lt;br /&gt;SIFT has the following four stages (the first two is as a detector, the last two is as a descriptor):&lt;br /&gt;(1) Scale-space extrema detection&lt;br /&gt;Use a DOG function to identify potential interest points that are invariant to scale.&lt;br /&gt;(2) Keypoint localization&lt;br /&gt;Detailed fitting for sub-pixel accuracy and further selection based on stability.&lt;br /&gt;(3) Orientation assignment&lt;br /&gt;In short it is based on gradient directions, so the feature are orientation invariant.&lt;br /&gt;(4) Keypoint descriptor&lt;br /&gt;Create array of orientation histograms.&lt;br /&gt;&lt;br /&gt;The SIFT keypoints are invariant to image scale and rotation and robust across a substantial range of affine distortion, addition of noise, and change in illumination.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-6180982388179162740?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/6180982388179162740/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/03/reading-distinctive-image-features-from.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/6180982388179162740'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/6180982388179162740'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/03/reading-distinctive-image-features-from.html' title='[Reading] Distinctive Image Features from Scale-Invariant Keypoints'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-3763466926464951773</id><published>2009-02-21T22:51:00.006+08:00</published><updated>2009-02-22T15:56:49.339+08:00</updated><title type='text'>[Reading] Image Retrieval: Ideas, Influences, and Trends of the New Age</title><content type='html'>I choose to summarize from &lt;span style="font-weight: bold;"&gt;3. IMAGE RETRIEVAL TECHNIQUES: ADDRESSING THE CORE PROBLEM&lt;/span&gt; (p.14) to &lt;span style="font-weight: bold;"&gt;3.2. Image Similarity Using Visual Signature&lt;/span&gt; (p.30).&lt;br /&gt;&lt;br /&gt;CBIR technology amounts to 2 problems: (a) the design of image description (signature), and (b) the similarity measure between two image descriptions. In the recent years, the design of features and the signatures constructed by these features have much progress. Besides, using machine learning techniques in CBIR has become more popular and also important.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Signature Extraction&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Feature extraction is the first step, after we extract features from an image, we need to do signature construction using these features. There are 2 ways to do signature construction: (a) using segmentation as first step, and (b) segmentation-free approach.&lt;br /&gt;&lt;br /&gt;To acquire a region-based signature, image segmentation is needed. Several methods proposed to do segmentation on medical images. Segmentation-based approach may have the problem that result is too sensitive to segmentation quality, so several methods tried to solve these problems.&lt;br /&gt;&lt;br /&gt;Computing global feature is efficient, but it is insensitive to location. So a better way (also a trend) is to compute local features then summarize them. Several types of local features are discussed such as color, texture, shape, spatial modeling and interesting points.&lt;br /&gt;&lt;br /&gt;When the # of features are very large for us to choose, we can use machine learning techniques to do feature selection.&lt;br /&gt;&lt;br /&gt;For constructing region-based signature, several methods proposed to do signature construction. Lots of them have a connection with histograms.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Similarity&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;There are 3 types of signatures: (a) region-based signature, (b) feature vector, and (c) summary of local feature vectors. Different types of signatures have different similarity measures. For (a), the definition of distance between “set of vectors” is crucial. For (b), several recent efforts have been made to measure the distance on a manifold, because using geodesic as distance measure is more reasonable. For (c),  codebooks and probability density functions have been used as signatures.&lt;br /&gt;&lt;br /&gt;For region-based signature, there are basically 2 formulations to compute similarity, one is using the sum of weighted pair-wise distance as formulation, different constraints lead to different design of weights. Another approach is using Hausdorff distance. Recently, several improvements have been made including feature tuning, weight computation, robustness against inaccurate segmentation and speeding-up retrieval.&lt;br /&gt;&lt;br /&gt;For feature vector, computation of similarity is performed nonlinearly along the manifold, typical methods are locally-linear embedding (LLE), isomapping, and multidimensional scaling.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-3763466926464951773?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/3763466926464951773/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/02/reading-image-retrieval-ideas.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/3763466926464951773'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/3763466926464951773'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/02/reading-image-retrieval-ideas.html' title='[Reading] Image Retrieval: Ideas, Influences, and Trends of the New Age'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-7393736183576868520</id><published>2009-02-21T18:07:00.000+08:00</published><updated>2009-02-21T21:21:56.102+08:00</updated><title type='text'>[Reading] How to give a good research talk</title><content type='html'>This artical gives suggestions about giving a presentation of 30-60 minutes. Because it says that "make what is useful for you, and ignore the rest", I only summarize the parts useful to me.&lt;br /&gt;First, use examples is important. Always remember to illustrate an idea (theorem, definiiton, ...) WITH an example.&lt;br /&gt;Second, treat the more important aspects in more detail than others. Also, don't read your slides, talk ABOUT what's on it.&lt;br /&gt;Last, avoid too much introduction such as previous work. Also, sometimes give outline of your talk is not appropriate.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-7393736183576868520?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/7393736183576868520/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/02/this-artical-gives-suggestions-about.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/7393736183576868520'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/7393736183576868520'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/02/this-artical-gives-suggestions-about.html' title='[Reading] How to give a good research talk'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7160049826242650176.post-6644269561685475771</id><published>2009-02-21T16:36:00.000+08:00</published><updated>2009-02-21T16:43:25.071+08:00</updated><title type='text'>[Reading] How to Read a Paper</title><content type='html'>This paper propose a 3-pass method for reading papers.&lt;br /&gt;(1) The 1st pass (5~10min) gives you a general idea by answering the 5 Cs to yourself.&lt;br /&gt;(2) The 2nd pass (1hr) lets you grasp the content but not detail. At this stage you should be able to do summarization!&lt;br /&gt;(3) The 3rd pass (4~5hr) helps you understand the paper in depth. The key is to attempt to "virtually re-implement it".&lt;br /&gt;&lt;br /&gt;This paper also describe how to use the proposed method to do a survey by 3 steps.&lt;br /&gt;(1) Use search engine and read "RELATED WORK".&lt;br /&gt;(2) Find key citations and key researchers's recent publication.&lt;br /&gt;(3) Quickly scan the top conferences' recent papers.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7160049826242650176-6644269561685475771?l=frankcch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://frankcch.blogspot.com/feeds/6644269561685475771/comments/default' title='張貼意見'/><link rel='replies' type='text/html' href='http://frankcch.blogspot.com/2009/02/reading-how-to-read-paper.html#comment-form' title='0 個意見'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/6644269561685475771'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7160049826242650176/posts/default/6644269561685475771'/><link rel='alternate' type='text/html' href='http://frankcch.blogspot.com/2009/02/reading-how-to-read-paper.html' title='[Reading] How to Read a Paper'/><author><name>frankcch</name><uri>http://www.blogger.com/profile/14401810428992454951</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
