SwePub
Sök i LIBRIS databas

  Extended search

WFRF:(Farooq Harith 1986)
 

Search: WFRF:(Farooq Harith 1986) > (2019) > CoordinateCleaner: ...

CoordinateCleaner: Standardized cleaning of occurrence records from biological collection databases

Zizka, Alexander, 1986 (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Silvestro, Daniele (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Andermann, Tobias (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
show more...
Azevedo, Josué (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Ritter, Camila (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Edler, Daniel (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Farooq, Harith, 1986 (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Herdean, Andrei, 1984 (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Ariza, María (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Scharn, Ruud (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Svantesson, Sten (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Wengström, Niklas, 1969 (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
Zizka, V. (author)
Antonelli, Alexandre, 1978 (author)
Gothenburg University,Göteborgs universitet,Institutionen för biologi och miljövetenskap,Department of Biological and Environmental Sciences
show less...
 (creator_code:org_t)
2019-02-24
2019
English.
In: Methods in Ecology and Evolution. - : Wiley. - 2041-210X. ; 10:5, s. 744-751
  • Journal article (peer-reviewed)
Abstract Subject headings
Close  
  • Species occurrence records from online databases are an indispensable resource in ecological, biogeographical and palaeontological research. However, issues with data quality, especially incorrect geo-referencing or dating, can diminish their usefulness. Manual cleaning is time-consuming, error prone, difficult to reproduce and limited to known geographical areas and taxonomic groups, making it impractical for datasets with thousands or millions of records. Here, we present CoordinateCleaner, an r-package to scan datasets of species occurrence records for geo-referencing and dating imprecisions and data entry errors in a standardized and reproducible way. CoordinateCleaner is tailored to problems common in biological and palaeontological databases and can handle datasets with millions of records. The software includes (a) functions to flag potentially problematic coordinate records based on geographical gazetteers, (b) a global database of 9,691 geo-referenced biodiversity institutions to identify records that are likely from horticulture or captivity, (c) novel algorithms to identify datasets with rasterized data, conversion errors and strong decimal rounding and (d) spatio-temporal tests for fossils. We describe the individual functions available in CoordinateCleaner and demonstrate them on more than 90million occurrences of flowering plants from the Global Biodiversity Information Facility (GBIF) and 19,000 fossil occurrences from the Palaeobiology Database (PBDB). We find that in GBIF more than 3.4 million records (3.7%) are potentially problematic and that 179 of the tested contributing datasets (18.5%) might be biased by rasterized coordinates. In PBDB, 1205 records (6.3%) are potentially problematic. All cleaning functions and the biodiversity institution database are open-source and available within the CoordinateCleaner r-package.

Subject headings

NATURVETENSKAP  -- Geovetenskap och miljövetenskap -- Miljövetenskap (hsv//swe)
NATURAL SCIENCES  -- Earth and Related Environmental Sciences -- Environmental Sciences (hsv//eng)

Keyword

biodiversity institutions
data quality
fossils
GBIF
geo-referencing
palaeobiology database (PBDB)
r
big data
diversity
Environmental Sciences & Ecology

Publication and Content Type

ref (subject category)
art (subject category)

Find in a library

To the university's database

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view