SwePub
Sök i LIBRIS databas

  Utökad sökning

id:"swepub:oai:DiVA.org:lnu-92079"
 

Sökning: id:"swepub:oai:DiVA.org:lnu-92079" > Text-Independent Sp...

Text-Independent Speaker ID Employing 2D-CNN for Automatic Video Lecture Categorization in a MOOC Setting

Imran, Ali Shariq (författare)
Norwegian University of Science and Technology, Norway
Kastrati, Zenun, 1984- (författare)
Linnéuniversitetet,Institutionen för datavetenskap och medieteknik (DM)
Svendsen, Torbjorn Karl (författare)
Norwegian University of Science and Technology, Norway
visa fler...
Kurti, Arianit, 1977- (författare)
Linnéuniversitetet,Institutionen för datavetenskap och medieteknik (DM)
visa färre...
 (creator_code:org_t)
IEEE Press, 2019
2019
Engelska.
Ingår i: 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI). - : IEEE Press. - 9781728137988 - 9781728137995 ; , s. 273-277
  • Konferensbidrag (refereegranskat)
Abstract Ämnesord
Stäng  
  • A new form of distance and blended education has hit the market in recent years with the advent of massive open online courses (MOOCs) which have brought many opportunities to the educational sector. Consequently, the availability of learning content to vast demographics of people and across locations has opened up a plethora of possibilities for everyone to gain new knowledge through MOOCs. This poses an immense issue to the content providers as the amount of manual effort required to structure properly and to organize the content automatically for millions of video lectures daily become incredibly challenging. This paper, therefore, addresses this issue as a small part of our proposed personalized content management system by exploiting the voice pattern of the lecturer for identification and for classifying video lectures to the right speaker category. The use of Mel frequency Cepstral coefficients (MFCC) as 2D input features maps to 2D-CNN has shown promising results in contrast to machine learning and deep learning classifiers - making text-independent speaker identification plausible in MOOC setting for automatic video lecture categorization. It will not only help categorize educational videos efficiently for easy search and retrieval but will also promote effective utilization of micro-lectures and multimedia video learning objects (MLO).

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Computer Sciences (hsv//eng)

Nyckelord

speaker identification;speaker classification;DNN;CNN;eLearning;videos lectures;MOOCs;MFCC
Computer Science
Datavetenskap

Publikations- och innehållstyp

ref (ämneskategori)
kon (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy