|
|
 |
Multimedia and Stream Data Mining
Faculty: Christos Faloutsos
Student: Jia-Yu Pan
We provide tools for the following questions:
- Given a video stream, find interesting patterns, including either
visual, auditory or text-level patterns, which lead to applications
like segmentation, clustering and rule discovery.
- Given a general stream (e.g. motion capture data), find patterns
such as correlated attributes which will benefit segmentation and rules
discovery.
Several research topics are related to our work.
Here, we make a partial list of those with our current proposed methods
and results on them.
- Dimensionality reduction: ICA (or Fastmap) for dimensionality
reduction
- Similarity search (distance function): Dimensionality reduction
as the first step.
- Multimodal classification: Combine classification decisions from
multiple classifiers.
- Rule discovery: For example,
- Using "geoplot" we find rules such as: Words for describing
``storms developed in the ocean'': ``Typhoon'', used in West
Pacific;``Monsoon'', used in Indian Ocean; ``Hurricane'', used in North
Atlantic.
- Using "videocubes" we find multimodal rules like: ``White
ceiling associates with human speech, blue sky associates with soft
music.''
- Efficiency consideration: The ideal result is a fast computation
and online algorithm for streaming data (O(1) storage space, O(1)
amortized time complexity).
Currently, we have proposed several tools in
achieving the goals outlined above. They are
- AutoSplit: Sparse coding and blind hidden variable separation.
Cluster attributes using weights specified by bases. Cluster data items
base on their similar values on the hidden variables.
- VideoCube: Automatic and ``natural'' visual/auditory feature
extraction, considering both spatial and temporal information.
- VideoGraph: Dimensionality reduction using FastMap. Segmentation
by thresholding a marginal cost measurement.
- Geoplot: Similarity function between Geo-footprints and rule
discovery. Involving the name-entity extraction and a gazetteer.
- FastCARS: Temporal correlation awared sampling method on data
streams.
Publications
* Jia-Yu Pan, Srinivasan Seshan, and Christos
Faloutsos. FastCARS: Fast, Correlation-Aware Sampling for Network Data
Mining. In Proceedings of IEEE GlobeCOM 2002 - Global Internet
Symposium, 2002.
* Jia-Yu Pan and Christos Faloutsos.
"Geoplot": Spatial Data Mining on Video Libraries. In Proceedings of
the Eleventh International Conference on Information and Knowledge
Management (CIKM 2002), 2002.
* Jia-Yu Pan and Christos Faloutsos.
VideoCube: a novel tool for video mining and classification. In
Proceedings of the Fifth International Conference on Asian Digital
Libraries (ICADL 2002), 2002.
* Jia-Yu Pan and Christos Faloutsos.
VideoGraph: A New Tool for Video Mining and Visualization. In
Proceedings of the First ACM+IEEE Joint Conference on Digital Libraries
(JCDL 2001), 2001.
|
|