Design and Development of a Data Management System for Uncertain Data
Sponsor: National Science Foundation
This material is based upon work supported by the National Science Foundation
under Grant No: IIS-0534702
Graduate Students:
Rohit Jain
Chris Mayfield
Sarvjeet Singh
Yinian Qi
Yicheng Tu
Project Summary
Current database management systems require all data to be modeled in terms of precise values.
However, there is a large number of application domains where data values are imprecise or uncertain. Examples
of such data include measurements for sensors, locations of moving objects, and experimental data. For these
applications there is a need to develop a database management system that treats uncertain data as a first
class data type.
The project aims to develop a comprehensive database management system for storing and querying uncertain,
or imprecise data. The project encompasses the creation of a comprehensive model for uncertain
data based upon the relational model, the extension of SQL to support probabilistic queries over uncertain
data, techniques for efficient and accurate evaluation of probabilistic queries, and the development of a
prototype system. The specific optimization issues addressed include indexing, join algorithms, and query
optimization for uncertain data.
The prototype will be developed as an extension of the open-source PostgreSQL database
management system. A realistic moving objects' application is targeted for testing of the prototype. In
addition, targeted collaboration with experts in biology and chemistry will serve as validations of the
applicability of the developments.
The project is expected to have a significant impact on application domains that are in need of an uncertain
data management system, and also on the database community. The proposal is expected to provide a single model for
multiple types of uncertainty, and to develop indexing, join, and query optimization techniques for
uncertain data.
This project will integrate research and education through student participation in research projects, course
development, and research seminar. The results from this research will be included in course projects and
will be disseminated via peer-reviewed publications in journals and conferences, web pages, conference
presentations, and prototype dissemination.
Goals, Objectives, and Targeted Activities
The goals of the project are to develop a novel data management system
for handling uncertain, or imprecise, data.
Publications
Disclaimer
Any opinions, findings and conclusions or recomendations expressed in this
material are those of the author(s) and do not necessarily reflect
the views of the National Science Foundation
(NSF).
Last Modified by Sunil Prabhakar
on 22nd February, 2006.