SwePub
Sök i LIBRIS databas

  Utökad sökning

onr:"swepub:oai:DiVA.org:liu-121606"
 

Sökning: onr:"swepub:oai:DiVA.org:liu-121606" > Deep Semantic Pyram...

Deep Semantic Pyramids for Human Attributes and Action Recognition

Khan, Fahad Shahbaz (författare)
Linköpings universitet,Datorseende,Tekniska fakulteten
Rao, Muhammad Anwer (författare)
Department of Information and Computer Science, Aalto University School of Science, Aalto, Finland
van de Weijer, Joost (författare)
Computer Vision Center, CS Department, Universitet Autonoma de Barcelona, Barcelona, Spain
visa fler...
Felsberg, Michael (författare)
Linköpings universitet,Datorseende,Tekniska fakulteten
Laaksonen, Jorma (författare)
Department of Information and Computer Science, Aalto University School of Science, Aalto, Finland
visa färre...
 (creator_code:org_t)
2015-06-09
2015
Engelska.
Ingår i: Image Analysis. - Cham : Springer. - 9783319196657 - 9783319196640 ; , s. 341-353
  • Konferensbidrag (refereegranskat)
Abstract Ämnesord
Stäng  
  • Describing persons and their actions is a challenging problem due to variations in pose, scale and viewpoint in real-world images. Recently, semantic pyramids approach [1] for pose normalization has shown to provide excellent results for gender and action recognition. The performance of semantic pyramids approach relies on robust image description and is therefore limited due to the use of shallow local features. In the context of object recognition [2] and object detection [3], convolutional neural networks (CNNs) or deep features have shown to improve the performance over the conventional shallow features.We propose deep semantic pyramids for human attributes and action recognition. The method works by constructing spatial pyramids based on CNNs of different part locations. These pyramids are then combined to obtain a single semantic representation. We validate our approach on the Berkeley and 27 Human Attributes datasets for attributes classification. For action recognition, we perform experiments on two challenging datasets: Willow and PASCAL VOC 2010. The proposed deep semantic pyramids provide a significant gain of 17.2%, 13.9%, 24.3% and 22.6% compared to the standard shallow semantic pyramids on Berkeley, 27 Human Attributes, Willow and PASCAL VOC 2010 datasets respectively. Our results also show that deep semantic pyramids outperform conventional CNNs based on the full bounding box of the person. Finally, we compare our approach with state-of-the-art methods and show a gain in performance compared to best methods in literature.

Ämnesord

TEKNIK OCH TEKNOLOGIER  -- Elektroteknik och elektronik -- Robotteknik och automation (hsv//swe)
ENGINEERING AND TECHNOLOGY  -- Electrical Engineering, Electronic Engineering, Information Engineering -- Robotics (hsv//eng)
TEKNIK OCH TEKNOLOGIER  -- Elektroteknik och elektronik -- Datorsystem (hsv//swe)
ENGINEERING AND TECHNOLOGY  -- Electrical Engineering, Electronic Engineering, Information Engineering -- Computer Systems (hsv//eng)

Nyckelord

Action recognition Human attributes Semantic pyramids

Publikations- och innehållstyp

ref (ämneskategori)
kon (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy