Spacetime Texture Dataset
Konstantinos G. Derpanis and
Richard P. Wildes
Contact email kosta at cse dot yorku dot ca
Department of Computer Science and Engineering
and Centre for Vision Research
York University, Toronto, ON, Canada
Overview
This webpage provides a spacetime texture database.
The database contains a total of 610 spacetime texture samples. The videos were obtained
from various sources, including a Canon HF10
camcorder and the BBC Motion Gallery; the videos
vary widely in their resolution, temporal extents and
capture frame rates. Owing to the diversity within
and across the video sources, the videos generally
contain significant differences in scene appearance,
scale, illumination conditions and camera viewpoint.
Also, for the stochastic patterns, variations in the
underlying physical processes ensure large intra-class
variations.
For details of our technical approach to spacetime structure grouping, see our project page.
Database Organization
The figure below illustrates the overall organization of the York University Vision Lab (YUVL) spacetime texture database.
The data set is partitioned in two ways: (i) basiclevel
and (ii) subordinate-level, by analogy to terminology
for capturing the hierarchical nature of human
categorical perception [3]. The basic-level partition, is based on the
number of spacetime orientations present in a given
pattern; for a detailed description of these categories,
see Section 2.1 of [2]. For the multi-dominant category, samples
were limited to two superimposed structures
(e.g., rain over a stationary background). Note, the
basic-level partition is not arbitrary, it follows from the
systematic enumeration of dynamic patterns based
on their spacetime oriented structure presented in
Sec. 2.1 of [2]. Furthermore, each of these categories have
been central to research regarding the representation
of image-related dynamics, typically considered on a
case-by-case basis. To evaluate the ability
to make finer categorical distinctions, several of the
basic-level categories were further partitioned into
subordinate-levels. This partition
is based on the particular spacetime
orientations present in a given pattern. Beyond
the basic-level categorization of an unconstrained oriented
pattern (i.e., unstructured), no further subdivision
based on pattern dynamics is possible. Underconstrained
cases arise naturally as the aperture
problem and pure temporal variation (i.e., flicker).
Dominant oriented patterns can be distinguished by
whether there is a single orientation that describes
a pattern (e.g., motion) or a narrow range of spacetime
orientations distributed about a given orientation. Note
that further distinctions might be made based, for
example, on the velocity of motion (e.g., stationary
vs. rightward motion vs. leftward motion). In the
case of multi-dominant oriented patterns, the initial
choice of restricting patterns to two components to
populate the database precludes further meaningful
parsing. The heterogeneous and isotropic basic category
was partitioned into wavy fluid and those more
generally stochastic in nature. Further parsing within
the heterogeneous and isotropic basic-level category
is possible; however,
since this more granular partition
is considered in the UCLA data set [4], in the YUVL data
set only a two-way subdivision of the heterogeneous
and isotropic basic-level category is considered.
Download
Spacetime Texture Database (2.6GB download)
Related Papers
[1] K.G. Derpanis and R.P. Wildes, Dynamic Texture Recognition based on Distributions of Spacetime Oriented Structure, In Proceedings of the IEEE Conference Computer Vision and Pattern Recognition (CVPR), 2010.
[2] K.G. Derpanis and R.P. Wildes, Spacetime Texture Representation and Recognition Based on a Spatiotemporal Orientation Analysis, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), to appear.
[3] E. Rosch and C. Mervis, Family resemblances: Studies in the internal structure of categories. Cognitive Psychology, vol. 7, pp. 573-605, 1975.
[4] P. Saisan, G. Doretto, Y. Wu, and S. Soatto, Dynamic texture recognition, in CVPR, 2001, pp. II:58-63.
Last updated: October 6, 2011.
|