Elastic Functional Coding of Riemannian Trajectories

Visual observations of dynamic phenomena, such as human actions, are often represented as sequences of smoothly-varying features. In cases where the feature spaces can be structured as Riemannian manifolds, the corresponding representations become trajectories on manifolds. Analysis of these traject...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence 2017-05, Vol.39 (5), p.922-936
Hauptverfasser: Anirudh, Rushil, Turaga, Pavan, Jingyong Su, Srivastava, Anuj
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 936
container_issue 5
container_start_page 922
container_title IEEE transactions on pattern analysis and machine intelligence
container_volume 39
creator Anirudh, Rushil
Turaga, Pavan
Jingyong Su
Srivastava, Anuj
description Visual observations of dynamic phenomena, such as human actions, are often represented as sequences of smoothly-varying features. In cases where the feature spaces can be structured as Riemannian manifolds, the corresponding representations become trajectories on manifolds. Analysis of these trajectories is challenging due to non-linearity of underlying spaces and high-dimensionality of trajectories. In vision problems, given the nature of physical systems involved, these phenomena are better characterized on a low-dimensional manifold compared to the space of Riemannian trajectories. For instance, if one does not impose physical constraints of the human body, in data involving human action analysis, the resulting representation space will have highly redundant features. Learning an effective, low-dimensional embedding for action representations will have a huge impact in the areas of search and retrieval, visualization, learning, and recognition. Traditional manifold learning addresses this problem for static points in the euclidean space, but its extension to Riemannian trajectories is non-trivial and remains unexplored. The difficulty lies in inherent non-linearity of the domain and temporal variability of actions that can distort any traditional metric between trajectories. To overcome these issues, we use the framework based on transported square-root velocity fields (TSRVF); this framework has several desirable properties, including a rate-invariant metric and vector space representations. We propose to learn an embedding such that each action trajectory is mapped to a single point in a low-dimensional euclidean space, and the trajectories that differ only in temporal rates map to the same point. We utilize the TSRVF representation, and accompanying statistical summaries of Riemannian trajectories, to extend existing coding methods such as PCA, KSVD and Label Consistent KSVD to Riemannian trajectories or more generally to Riemannian functions. We show that such coding efficiently captures trajectories in applications such as action recognition, stroke rehabilitation, visual speech recognition, clustering and diverse sequence sampling. Using this framework, we obtain state-of-the-art recognition results, while reducing the dimensionality/ complexity by a factor of 100-250x. Since these mappings and codes are invertible, they can also be used to interactively-visualize Riemannian trajectories and synthesize actions.
doi_str_mv 10.1109/TPAMI.2016.2564409
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_7466117</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7466117</ieee_id><sourcerecordid>1861613131</sourcerecordid><originalsourceid>FETCH-LOGICAL-c395t-affad4b48f48859e76d132cf6a8e60b42a75cc7889478034ec43c5c66506c2af3</originalsourceid><addsrcrecordid>eNpdkM1KAzEURoMoto6-gIIMuHEzNf-TLKW0WqgoUtchzSSSMjOpyczCt3dqaxdyF3dxz_fBPQBcIzhBCMqH1dvjy2KCIeITzDilUJ6AMZJEFoQReQrGwwUXQmAxAhcpbSBElEFyDkZYIES4lGNAZ7VOnTf5vG9N50Or63waKt9-5sHl7942um29bvNV1BtruhC9TZfgzOk62avDzsDHfLaaPhfL16fF9HFZGCJZV2jndEXXVDgqBJO25BUi2DiuheVwTbEumTGlEJKWAhJqDSWGGc4Z5AZrRzJwv-_dxvDV29Spxidj61q3NvRJIcERR2Q3Gbj7h25CH4dvksKopJQwPpjJAN5TJoaUonVqG32j47dCUO2cql-naudUHZwOodtDdb9ubHWM_EkcgJs94K21x3NJOUeoJD9SH3kB</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2174435619</pqid></control><display><type>article</type><title>Elastic Functional Coding of Riemannian Trajectories</title><source>IEEE Electronic Library (IEL)</source><creator>Anirudh, Rushil ; Turaga, Pavan ; Jingyong Su ; Srivastava, Anuj</creator><creatorcontrib>Anirudh, Rushil ; Turaga, Pavan ; Jingyong Su ; Srivastava, Anuj</creatorcontrib><description>Visual observations of dynamic phenomena, such as human actions, are often represented as sequences of smoothly-varying features. In cases where the feature spaces can be structured as Riemannian manifolds, the corresponding representations become trajectories on manifolds. Analysis of these trajectories is challenging due to non-linearity of underlying spaces and high-dimensionality of trajectories. In vision problems, given the nature of physical systems involved, these phenomena are better characterized on a low-dimensional manifold compared to the space of Riemannian trajectories. For instance, if one does not impose physical constraints of the human body, in data involving human action analysis, the resulting representation space will have highly redundant features. Learning an effective, low-dimensional embedding for action representations will have a huge impact in the areas of search and retrieval, visualization, learning, and recognition. Traditional manifold learning addresses this problem for static points in the euclidean space, but its extension to Riemannian trajectories is non-trivial and remains unexplored. The difficulty lies in inherent non-linearity of the domain and temporal variability of actions that can distort any traditional metric between trajectories. To overcome these issues, we use the framework based on transported square-root velocity fields (TSRVF); this framework has several desirable properties, including a rate-invariant metric and vector space representations. We propose to learn an embedding such that each action trajectory is mapped to a single point in a low-dimensional euclidean space, and the trajectories that differ only in temporal rates map to the same point. We utilize the TSRVF representation, and accompanying statistical summaries of Riemannian trajectories, to extend existing coding methods such as PCA, KSVD and Label Consistent KSVD to Riemannian trajectories or more generally to Riemannian functions. We show that such coding efficiently captures trajectories in applications such as action recognition, stroke rehabilitation, visual speech recognition, clustering and diverse sequence sampling. Using this framework, we obtain state-of-the-art recognition results, while reducing the dimensionality/ complexity by a factor of 100-250x. Since these mappings and codes are invertible, they can also be used to interactively-visualize Riemannian trajectories and synthesize actions.</description><identifier>ISSN: 0162-8828</identifier><identifier>EISSN: 1939-3539</identifier><identifier>EISSN: 2160-9292</identifier><identifier>DOI: 10.1109/TPAMI.2016.2564409</identifier><identifier>PMID: 28113699</identifier><identifier>CODEN: ITPIDJ</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>activity recognition ; Clustering ; Coding ; dimensionality reduction ; Embedding ; Encoding ; Euclidean geometry ; Euclidean space ; Linearity ; Machine learning ; Manifolds ; Manifolds (mathematics) ; Measurement ; Nonlinearity ; Principal component analysis ; Rehabilitation ; Representations ; Riemann manifold ; Riemannian geometry ; Speech recognition ; State of the art ; Trajectory ; Trajectory analysis ; Velocity distribution ; Visual observation ; Visualization</subject><ispartof>IEEE transactions on pattern analysis and machine intelligence, 2017-05, Vol.39 (5), p.922-936</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c395t-affad4b48f48859e76d132cf6a8e60b42a75cc7889478034ec43c5c66506c2af3</citedby><cites>FETCH-LOGICAL-c395t-affad4b48f48859e76d132cf6a8e60b42a75cc7889478034ec43c5c66506c2af3</cites><orcidid>0000-0002-4186-3502</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7466117$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,778,782,794,27911,27912,54745</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/7466117$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/28113699$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Anirudh, Rushil</creatorcontrib><creatorcontrib>Turaga, Pavan</creatorcontrib><creatorcontrib>Jingyong Su</creatorcontrib><creatorcontrib>Srivastava, Anuj</creatorcontrib><title>Elastic Functional Coding of Riemannian Trajectories</title><title>IEEE transactions on pattern analysis and machine intelligence</title><addtitle>TPAMI</addtitle><addtitle>IEEE Trans Pattern Anal Mach Intell</addtitle><description>Visual observations of dynamic phenomena, such as human actions, are often represented as sequences of smoothly-varying features. In cases where the feature spaces can be structured as Riemannian manifolds, the corresponding representations become trajectories on manifolds. Analysis of these trajectories is challenging due to non-linearity of underlying spaces and high-dimensionality of trajectories. In vision problems, given the nature of physical systems involved, these phenomena are better characterized on a low-dimensional manifold compared to the space of Riemannian trajectories. For instance, if one does not impose physical constraints of the human body, in data involving human action analysis, the resulting representation space will have highly redundant features. Learning an effective, low-dimensional embedding for action representations will have a huge impact in the areas of search and retrieval, visualization, learning, and recognition. Traditional manifold learning addresses this problem for static points in the euclidean space, but its extension to Riemannian trajectories is non-trivial and remains unexplored. The difficulty lies in inherent non-linearity of the domain and temporal variability of actions that can distort any traditional metric between trajectories. To overcome these issues, we use the framework based on transported square-root velocity fields (TSRVF); this framework has several desirable properties, including a rate-invariant metric and vector space representations. We propose to learn an embedding such that each action trajectory is mapped to a single point in a low-dimensional euclidean space, and the trajectories that differ only in temporal rates map to the same point. We utilize the TSRVF representation, and accompanying statistical summaries of Riemannian trajectories, to extend existing coding methods such as PCA, KSVD and Label Consistent KSVD to Riemannian trajectories or more generally to Riemannian functions. We show that such coding efficiently captures trajectories in applications such as action recognition, stroke rehabilitation, visual speech recognition, clustering and diverse sequence sampling. Using this framework, we obtain state-of-the-art recognition results, while reducing the dimensionality/ complexity by a factor of 100-250x. Since these mappings and codes are invertible, they can also be used to interactively-visualize Riemannian trajectories and synthesize actions.</description><subject>activity recognition</subject><subject>Clustering</subject><subject>Coding</subject><subject>dimensionality reduction</subject><subject>Embedding</subject><subject>Encoding</subject><subject>Euclidean geometry</subject><subject>Euclidean space</subject><subject>Linearity</subject><subject>Machine learning</subject><subject>Manifolds</subject><subject>Manifolds (mathematics)</subject><subject>Measurement</subject><subject>Nonlinearity</subject><subject>Principal component analysis</subject><subject>Rehabilitation</subject><subject>Representations</subject><subject>Riemann manifold</subject><subject>Riemannian geometry</subject><subject>Speech recognition</subject><subject>State of the art</subject><subject>Trajectory</subject><subject>Trajectory analysis</subject><subject>Velocity distribution</subject><subject>Visual observation</subject><subject>Visualization</subject><issn>0162-8828</issn><issn>1939-3539</issn><issn>2160-9292</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkM1KAzEURoMoto6-gIIMuHEzNf-TLKW0WqgoUtchzSSSMjOpyczCt3dqaxdyF3dxz_fBPQBcIzhBCMqH1dvjy2KCIeITzDilUJ6AMZJEFoQReQrGwwUXQmAxAhcpbSBElEFyDkZYIES4lGNAZ7VOnTf5vG9N50Or63waKt9-5sHl7942um29bvNV1BtruhC9TZfgzOk62avDzsDHfLaaPhfL16fF9HFZGCJZV2jndEXXVDgqBJO25BUi2DiuheVwTbEumTGlEJKWAhJqDSWGGc4Z5AZrRzJwv-_dxvDV29Spxidj61q3NvRJIcERR2Q3Gbj7h25CH4dvksKopJQwPpjJAN5TJoaUonVqG32j47dCUO2cql-naudUHZwOodtDdb9ubHWM_EkcgJs94K21x3NJOUeoJD9SH3kB</recordid><startdate>20170501</startdate><enddate>20170501</enddate><creator>Anirudh, Rushil</creator><creator>Turaga, Pavan</creator><creator>Jingyong Su</creator><creator>Srivastava, Anuj</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-4186-3502</orcidid></search><sort><creationdate>20170501</creationdate><title>Elastic Functional Coding of Riemannian Trajectories</title><author>Anirudh, Rushil ; Turaga, Pavan ; Jingyong Su ; Srivastava, Anuj</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c395t-affad4b48f48859e76d132cf6a8e60b42a75cc7889478034ec43c5c66506c2af3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>activity recognition</topic><topic>Clustering</topic><topic>Coding</topic><topic>dimensionality reduction</topic><topic>Embedding</topic><topic>Encoding</topic><topic>Euclidean geometry</topic><topic>Euclidean space</topic><topic>Linearity</topic><topic>Machine learning</topic><topic>Manifolds</topic><topic>Manifolds (mathematics)</topic><topic>Measurement</topic><topic>Nonlinearity</topic><topic>Principal component analysis</topic><topic>Rehabilitation</topic><topic>Representations</topic><topic>Riemann manifold</topic><topic>Riemannian geometry</topic><topic>Speech recognition</topic><topic>State of the art</topic><topic>Trajectory</topic><topic>Trajectory analysis</topic><topic>Velocity distribution</topic><topic>Visual observation</topic><topic>Visualization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Anirudh, Rushil</creatorcontrib><creatorcontrib>Turaga, Pavan</creatorcontrib><creatorcontrib>Jingyong Su</creatorcontrib><creatorcontrib>Srivastava, Anuj</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>IEEE transactions on pattern analysis and machine intelligence</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Anirudh, Rushil</au><au>Turaga, Pavan</au><au>Jingyong Su</au><au>Srivastava, Anuj</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Elastic Functional Coding of Riemannian Trajectories</atitle><jtitle>IEEE transactions on pattern analysis and machine intelligence</jtitle><stitle>TPAMI</stitle><addtitle>IEEE Trans Pattern Anal Mach Intell</addtitle><date>2017-05-01</date><risdate>2017</risdate><volume>39</volume><issue>5</issue><spage>922</spage><epage>936</epage><pages>922-936</pages><issn>0162-8828</issn><eissn>1939-3539</eissn><eissn>2160-9292</eissn><coden>ITPIDJ</coden><abstract>Visual observations of dynamic phenomena, such as human actions, are often represented as sequences of smoothly-varying features. In cases where the feature spaces can be structured as Riemannian manifolds, the corresponding representations become trajectories on manifolds. Analysis of these trajectories is challenging due to non-linearity of underlying spaces and high-dimensionality of trajectories. In vision problems, given the nature of physical systems involved, these phenomena are better characterized on a low-dimensional manifold compared to the space of Riemannian trajectories. For instance, if one does not impose physical constraints of the human body, in data involving human action analysis, the resulting representation space will have highly redundant features. Learning an effective, low-dimensional embedding for action representations will have a huge impact in the areas of search and retrieval, visualization, learning, and recognition. Traditional manifold learning addresses this problem for static points in the euclidean space, but its extension to Riemannian trajectories is non-trivial and remains unexplored. The difficulty lies in inherent non-linearity of the domain and temporal variability of actions that can distort any traditional metric between trajectories. To overcome these issues, we use the framework based on transported square-root velocity fields (TSRVF); this framework has several desirable properties, including a rate-invariant metric and vector space representations. We propose to learn an embedding such that each action trajectory is mapped to a single point in a low-dimensional euclidean space, and the trajectories that differ only in temporal rates map to the same point. We utilize the TSRVF representation, and accompanying statistical summaries of Riemannian trajectories, to extend existing coding methods such as PCA, KSVD and Label Consistent KSVD to Riemannian trajectories or more generally to Riemannian functions. We show that such coding efficiently captures trajectories in applications such as action recognition, stroke rehabilitation, visual speech recognition, clustering and diverse sequence sampling. Using this framework, we obtain state-of-the-art recognition results, while reducing the dimensionality/ complexity by a factor of 100-250x. Since these mappings and codes are invertible, they can also be used to interactively-visualize Riemannian trajectories and synthesize actions.</abstract><cop>United States</cop><pub>IEEE</pub><pmid>28113699</pmid><doi>10.1109/TPAMI.2016.2564409</doi><tpages>15</tpages><orcidid>https://orcid.org/0000-0002-4186-3502</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 0162-8828
ispartof IEEE transactions on pattern analysis and machine intelligence, 2017-05, Vol.39 (5), p.922-936
issn 0162-8828
1939-3539
2160-9292
language eng
recordid cdi_ieee_primary_7466117
source IEEE Electronic Library (IEL)
subjects activity recognition
Clustering
Coding
dimensionality reduction
Embedding
Encoding
Euclidean geometry
Euclidean space
Linearity
Machine learning
Manifolds
Manifolds (mathematics)
Measurement
Nonlinearity
Principal component analysis
Rehabilitation
Representations
Riemann manifold
Riemannian geometry
Speech recognition
State of the art
Trajectory
Trajectory analysis
Velocity distribution
Visual observation
Visualization
title Elastic Functional Coding of Riemannian Trajectories
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T19%3A50%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Elastic%20Functional%20Coding%20of%20Riemannian%20Trajectories&rft.jtitle=IEEE%20transactions%20on%20pattern%20analysis%20and%20machine%20intelligence&rft.au=Anirudh,%20Rushil&rft.date=2017-05-01&rft.volume=39&rft.issue=5&rft.spage=922&rft.epage=936&rft.pages=922-936&rft.issn=0162-8828&rft.eissn=1939-3539&rft.coden=ITPIDJ&rft_id=info:doi/10.1109/TPAMI.2016.2564409&rft_dat=%3Cproquest_RIE%3E1861613131%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2174435619&rft_id=info:pmid/28113699&rft_ieee_id=7466117&rfr_iscdi=true