Tata Institute of Fundamental Research

Testing the Manifold Hypothesis

Seminar
Speaker: Hariharan Narayanan (Universithy of Washington Department of Statistics and Department of Mathematics Padelford Hall, Room C-301 Seattle, WA 98105 United States of America)
Organiser: Jaikumar Radhakrishnan
Date: Monday, 4 Aug 2014, 14:30 to 15:30
Venue: AG-69

(Scan to add to calendar)
Abstract:  Abstract: We are confronted with very high dimensional data sets. As a result, methods of dealing with high dimensional data have become prominent. One geometrically motivated approach for analyzing data is called manifold learning. The underlying hypothesis of this subfield of machine learning is that high dimensional data tend to lie near a low dimensional manifold. However, the basic question of understanding when data lies near a manifold is poorly understood. I will describe joint work with Charles Fefferman and Sanjoy Mitter on developing a provably correct algorithm to test this hypothesis using i.i.d samples from an arbitrary distribution supported in the unit ball in a Hilbert space.