Sökning: onr:"swepub:oai:DiVA.org:kth-177033" >
Factors of Transfer...
Factors of Transferability for a Generic ConvNet Representation
-
- Azizpour, Hossein, 1985- (författare)
- KTH,Datorseende och robotik, CVAP,Computer Vision
-
- Sharif Razavian, Ali, 1985- (författare)
- KTH,Datorseende och robotik, CVAP
-
- Sullivan, Josephine (författare)
- KTH,Datorseende och robotik, CVAP
-
visa fler...
-
- Maki, Atsuto (författare)
- KTH,Datorseende och robotik, CVAP
-
- Carlssom, Stefan (författare)
- KTH,Datorseende och robotik, CVAP
-
visa färre...
-
(creator_code:org_t)
- IEEE Computer Society Digital Library, 2016
- 2016
- Engelska.
-
Ingår i: IEEE Transactions on Pattern Analysis and Machine Intelligence. - : IEEE Computer Society Digital Library. - 0162-8828 .- 1939-3539. ; 38:9, s. 1790-1802
- Relaterad länk:
-
http://ieeexplore.ie...
-
visa fler...
-
https://urn.kb.se/re...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- Evidence is mounting that Convolutional Networks (ConvNets) are the most effective representation learning method for visual recognition tasks. In the common scenario, a ConvNet is trained on a large labeled dataset (source) and the feed-forward units activation of the trained network, at a certain layer of the network, is used as a generic representation of an input image for a task with relatively smaller training set (target). Recent studies have shown this form of representation transfer to be suitable for a wide range of target visual recognition tasks. This paper introduces and investigates several factors affecting the transferability of such representations. It includes parameters for training of the source ConvNet such as its architecture, distribution of the training data, etc. and also the parameters of feature extraction such as layer of the trained ConvNet, dimensionality reduction, etc. Then, by optimizing these factors, we show that significant improvements can be achieved on various (17) visual recognition tasks. We further show that these visual recognition tasks can be categorically ordered based on their similarity to the source task such that a correlation between the performance of tasks and their similarity to the source task w.r.t. the proposed factors is observed.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Datorseende och robotik (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Computer Vision and Robotics (hsv//eng)
Nyckelord
- Computer Science
- Datalogi
Publikations- och innehållstyp
- ref (ämneskategori)
- art (ämneskategori)
Hitta via bibliotek
Till lärosätets databas