NSF Grant awarded for Dr. Gautam Das and Dr. Nan Zhang

Sep. 01, 2008

Dr. Nan ZhangDr. Gautam Das
Dr. Nan Zhang
Dr. Gautam Das

Professors Dr. Gautam Das and Dr. Nan Zhang's proposal for reserach on "Data Analytics over Hidden Databases" has been awarded with NSF Grant. The grant amount is $120,001 for the period of 18 months starting from 9/1/2008. Dr. Das is th Principal Investigator and Dr. Zhang is the Co-PI.

Abstract:
Structured hidden databases are widely prevalent on the Web. They provide restricted form-like search interfaces that allow users to execute search queries by specifying desired attribute values of the sought-after tuples, and the system responds by returning a few (e.g., top-k) tuples that satisfy the selection conditions, sorted by a suitable ranking function. Although search interfaces for hidden databases are designed with focused search queries in mind, for certain applications it may be advantageous to infer more aggregated views of the data from the returned results of search queries. Such aggregated information will facilitate learning data distributions or building mining models, which can then be used to power and optimize a multitude of emerging data analytical applications.

This research involves developing effective techniques for performing data analytics, especially sampling, over hidden structured databases via their public interfaces. The outcomes include efficient algorithms for sampling hidden databases with a heterogeneous mix of data types, achievability results for sampling different types of search interfaces, and a prototypical toolset which demonstrates the sampling of real-world hidden databases. The research results of this project have broader impact on the nation’s higher education system and high-tech industries. The ability to pose high-level analytical queries over hidden databases is needed by knowledge workers in a wide variety of corporations, governments, and security agencies. Parts of this project will be integrated into teaching and carried out by students as part of advanced class projects, which will potentially attract motivated students to pursue doctoral degrees.