G. Caridakis, K. Karpouzis, A. Drosopoulos, S. Kollias |
Non parametric, self organizing, scalable modeling of spatiotemporal inputs: The Sign Language Paradigm |
Neural Networks, Elsevier, Volume 36, December, 2012, pp. 157-166. |
ABSTRACT
|
Modeling and recognizing spatiotemporal, as opposed to static input, is a challenging task since it incorporates input dynamics as part of the problem. The vast majority of existing methods tackle the problem as an extension of the static counterpart, using dynamics, such as input derivatives, at feature level and adopting artificial intelligence and machine learning techniques originally designed for solving problems that do not specifically address the temporal aspect. The proposed approach deals with temporal and spatial aspects of the spatiotemporal domain in a discriminative as well as coupling manner. Self Organizing Maps (SOM) model the spatial aspect of the problem and Markov models its temporal counterpart. Incorporation of adjacency, both in training and classification, enhances the overall architecture with robustness and adaptability. The proposed scheme is validated both theoretically, through an error propagation study, and experimentally, on the recognition of individual signs, performed by different, native Greek Sign Language users. Results illustrate the architecture's superiority when compared to Hidden Markov Model techniques and variations both in terms of classification performance and computational cost.
|
05 October , 2012 |
G. Caridakis, K. Karpouzis, A. Drosopoulos, S. Kollias, "Non parametric, self organizing, scalable modeling of spatiotemporal inputs: The Sign Language Paradigm", Neural Networks, Elsevier, Volume 36, December, 2012, pp. 157-166. |
[ PDF] [
BibTex] [
Print] [
Back] |