Sökning: id:"swepub:oai:DiVA.org:lnu-89056" >
Document image clas...
Document image classification using SEMCON
-
- Kastrati, Zenun, 1984- (författare)
- Gjøvik University College, Norway
-
- Imran, Ali Shariq (författare)
- Gjøvik University College, Norway
-
(creator_code:org_t)
- IEEE, 2015
- 2015
- Engelska.
-
Ingår i: 2015 20th Symposium on Signal Processing, Images and Computer Vision (STSIVA). - : IEEE. - 9781467394611 - 9781467394604 ; , s. 1-6
- Relaterad länk:
-
https://urn.kb.se/re...
-
visa fler...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- In this paper, we are proposing a new semantic and contextual based document image classification framework. The framework is composed of two main modules. The first one is the text analysis module (TAM) which processes document images and extracts words from the image, and second one is the SEMCON, which is a semantic and contextual objective metric. From the list of extracted words by TAM, SEMCON finds a list of noun terms, employs contextual and semantic meaning to it and then uses those terms to classify documents. The scope of this paper is limited to the proposed framework and testing the approach presented on a limited test dataset. Our preliminary results are very promising and suggest that the proposed framework can be used effectively to classify document images.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
Nyckelord
- document image processing;image classification;text analysis;SEMCON;contextual based document image classification framework;text analysis module;TAM;semantic and contextual objective metric;Semantics;Feature extraction;Databases;Text analysis;Context;Optical character recognition software;Visualization
- Computer Science
- Datavetenskap
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)
Hitta via bibliotek
Till lärosätets databas