Title: Embedding and Similarity Search for Point Sets under Translation

Speaker:  Dave Mount
University of Maryland

Date: Friday, August 15, 2008
Time 11:00-12:00PM
Venue: Room 3501, HKUST

Abstract:
Pattern matching in point sets is a well studied problem with numerous applications. 
We assume that the point sets may contain outliers (missing or spurious points) 
and are subject to an unknown translation.  The distance between any two point 
sets is defined to be the minimum size of their symmetric difference over all 
translations of one set relative to the other. We consider the problem in the 
context of similarity search. In particular, there is large database of point 
sets to be preprocessed, so that given any query point set, the closest matches 
in the database can be computed efficiently. Our approach is based on showing 
that there is a algorithm that computes a translation-invariant embedding of 
any point set of size at most n into the L_1 metric, so that with high probability, 
distances are subject to a distortion that is O(log2 n). The algorithm is randomized, 
and the distortion bounds holds with a probability bound that can be set by the user. 
(This is joint work with Minkyoung Cho.)