Automated Derivation of Behavior Vocabularies for Autonomous Humanoid Motion. Jenkins, O. C. & Mataric, M. J. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, pages 225--232, 2003.
doi  abstract   bibtex   
In this paper we address the problem of automatically deriving vocabularies of motion modules from human motion data, taking advantage of the underlying spatio-temporal structure in motion. We approach this problem with a data-driven methodology for modularizing a motion stream (or time-series of human motion) into a vocabulary of parameterized primitive motion modules and a set of metalevel behaviors characterizing extended combinations of the primitives. Central to this methodology is the discovery of spatio-temporal structure in a motion stream. We estimate this structure by extending an existing nonlinear dimension reduction technique, Isomap, to handle motion data with spatial and temporal dependencies. The motion vocabularies derived by our methodology provide a substrate of autonomous behavior and can be used in a variety of applications. We demonstrate the utility of derived vocabularies for the application of synthesizing new humanoid motion that is structurally similar to the original demonstrated motion.
@InProceedings{Jenkins2003,
  author    = {Jenkins, O. C. and Mataric, M. J.},
  title     = {Automated Derivation of Behavior Vocabularies for Autonomous Humanoid Motion},
  booktitle = {Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems},
  year      = {2003},
  pages     = {225--232},
  abstract  = {In this paper we address the problem of automatically deriving vocabularies of motion modules from human motion
data, taking advantage of the underlying spatio-temporal
structure in motion. We approach this problem with a
data-driven methodology for modularizing a motion stream
(or time-series of human motion) into a vocabulary of parameterized primitive motion modules and a set of metalevel behaviors characterizing extended combinations of the
primitives. Central to this methodology is the discovery of
spatio-temporal structure in a motion stream. We estimate
this structure by extending an existing nonlinear dimension
reduction technique, Isomap, to handle motion data with
spatial and temporal dependencies. The motion vocabularies derived by our methodology provide a substrate of autonomous behavior and can be used in a variety of applications. We demonstrate the utility of derived vocabularies for
the application of synthesizing new humanoid motion that
is structurally similar to the original demonstrated motion.},
  acmid     = {860612},
  doi       = {10.1145/860575.860612},
  groups    = {Lit Review 2013-09},
  isbn      = {1-58113-683-8},
  keywords  = {autonomous humanoid agents, humanoid robotics, kinematic motion segmentation, motion primitives, motion vocabularies, spectral dimension reduction},
  location  = {Melbourne, Australia},
  numpages  = {8},
  review    = {Want to auto-label human movement data. Uses spatial isomap.

Spatial isomap
1) bin spatial points together ("determine a local neighbourhood of nearby points") via k-nn
(don't quite follow the details here)
- basically, want to create a distance matrix D between data points and some neighbouring point
- this allows the isomap to be performed on this feature space similarity matrix D instead of some input space covariance matrix C (as is the case for PCA)
 - similar idea to Kernal PCA, where we only need the dot product to compute our similarity, as oppose to converting the entire input data space
 - they add temporal information by taking the Euclidean distance and warping the D matrix based on temporal neighbours
 - they like it because PCA only rotates the data, whereas isomap seems to do it more intelligently

After isomap warping, the segmentation is done by greedy algorithm:
1. current segment is the first frame
2. calculate distance between centroid from the current segment boundary (so t-1 frame) to every subsequent frame
3. pick first local maximum in centroid distance function

(so not online I guess) - results not reported



Jenkins and Matari\'{c} \cite{Jenkins2002, Jenkins2003} segments by performed by calculating the distance from the first frame's Cartesian centroid of a given DOF to all the subseqent frames, and segmenting on the first local maximum. Labeling and learning is done by performing dimensionality reduction using spatial Isomap by binning spatial data points together via \emph{k}-NN, computing the shortest distance between all point pairs using Dijkstra's algorithm, then preforming PCA. Although Isomap and PCA have the same general steps, Isomap is performed in the feature space, which is a higher dimensional space when compared to the input space, allowing for non-linearity in the input space. PCA works optimally if the input data is linear, or near linear. making Isomap a more flexible algorithm. Temporal information is accounted for by adding temporal-based weights to the distance metric. Similar activities are combined together by sweep-and-prune clustering. The focus of these paper is movement learning, and not segmentation, so no segmentation accuracy was reported.},
  timestamp = {2013.09.30},
}

Downloads: 0