Search: onr:"swepub:oai:research.chalmers.se:fb1d7843-0103-46ab-b425-2045a72b1346" >
A New AI Evaluation...
A New AI Evaluation Cosmos: Ready to Play the Game?
-
- Hernandez-Orallo, J. (author)
- Universitat Politecnica de Valencia (UPV),Polytechnic University of Valencia (UPV)
-
- Baroni, M. (author)
- Universita degli Studi di Trento,University of Trento
-
- Bieger, J. (author)
- Reykjavik University
-
show more...
-
- Chmait, N. (author)
- Monash University
-
- Dowe, D. L. (author)
- Monash University
-
- Hofmann, K. (author)
- Microsoft Research
-
- Martinez-Plumed, F. (author)
- Universitat Politecnica de Valencia (UPV),Polytechnic University of Valencia (UPV)
-
- Strannegård, Claes, 1962 (author)
- Chalmers tekniska högskola,Chalmers University of Technology
-
- Thorissons, K. R. (author)
- Reykjavik University,Icelandic Institute of Intelligent Machines
-
show less...
-
(creator_code:org_t)
- 2017-10-02
- 2017
- English.
-
In: AI Magazine. - : Wiley. - 0738-4602 .- 2371-9621. ; 38:3, s. 66-69
- Related links:
-
https://ojs.aaai.org...
-
show more...
-
https://doi.org/10.1...
-
https://research.cha...
-
show less...
Abstract
Subject headings
Close
- We report on a series of new platforms and events dealing with AI evaluation that may change the way in which AI systems are compared and their progress is measured. The introduction of a more diverse and challenging set of tasks in these platforms can feed AI research in the years to come, shaping the notion of success and the directions of the field. However, the playground of tasks and challenges presented there may misdirect the field without some meaningful structure and systematic guidelines for its organization and use. Anticipating this issue, we also report on several initiatives and workshops that are putting the focus on analyzing the similarity and dependencies between tasks, their difficulty, what capabilities they really measure and ultimately on elaborating new concepts and tools that can arrange tasks and benchmarks into a meaningful taxonomy.
Subject headings
- NATURVETENSKAP -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
Publication and Content Type
- art (subject category)
- ref (subject category)
Find in a library
To the university's database