|
TITLE: | What is special about mining spatial and spatio-temporal datasets? |
|
PRESENTER: | Shashi Shekhar : Biography , Homepage , Picture |
|
AFFILIATION: | Computer Science Department, University of Minnesota. |
|
URL: | http://www.cs.umn.edu/~shekhar |
SLIDES:
Classical data mining techniques often perform poorly when applied to spatial and spatio-temporal data sets because of the many reasons. First, these dataset are embedded in continuous space, whereas classical datasets (e.g. transactions) are often discrete. Second, patterns are often local where as classical data mining techniques often focus on global patterns. Finally, one of the common assumptions in classical statistical analysis is that data samples are independently generated. When it comes to the analysis of spatial and spatio-temporal data, however, the assumption about the independence of samples is generally false because such data tends to be highly self correlated. For example, people with similar characteristics, occupation and background tend to cluster together in the same neighborhoods. In spatial statistics this tendency is called autocorrelation. Ignoring autocorrelation when analyzing data with spatial and spatio-temporal characteristics may produce hypotheses or models that are inaccurate or inconsistent with the data set.
Thus new methods are needed to analyze spatial and spatio-temporal data to interesting, useful and non-trivial patterns. This talk surveys some of the new methods including those for discovering interactions (e.g. co-locations , co-occurrences, tele-connections), detecting spatial outliers and location prediction along with emerging ideas on spatio-temporal pattern mining.
KEYWORDS: Spatial, Spatio-temporal, Auto-correlation, Data Mining.
ACKNOWLEDGMENTS: This work was supported in part by the National Science Foundation, the U.S. Department of Defense, the National Aeronautics and Space Administration the Federal Highway Authority, and the University of Minnesota (e.g. Center for Transportation Studies).
NOTE: Some of the results discussed in this talk appeared in the following publications: