GBIF KOS
Biodiversity Knowledge Organization System
The GBIF Knowledge Organization System (KOS) task group (Catapano et al., 2011) provided recommendations for the uptake of KOS technology by GBIF. The KOS report, as well as some of the previous task group reports on metadata (Jones et al, 2010), persistent identifiers (Cryer et al, 2010; Richards et al, 2011), recommended to build on existing (or to establish new) persistent identifiers for each vocabulary term and concept. These reports further recommended to reuse existing terms and concepts wherever possible.
The Biodiversity Information Standards (TDWG) maintains standards for biodiversity data. Many of these standards has in the past been expressed using the XML schema language (XSD). With the advance of the semantic web there is a growing interest in TDWG for expressing vocabularies as RDF resources. We have proposed a Vocabulary Management Task Group (VoMaG) to develop best practices and guidelines for maintaining RDF vocabularies of terms and concepts from biodiversity informatics. Membership of the task group would mean to contribute to the evaluation of these best practices. Previous task groups of this sort has produced a technical report to summarize the results. One part of the tasks for the vocabulary management task group would be to evaluate software tools including the ISOcat and the Semantic MediaWiki for collaborative development and maintenance of vocabularies of basic concepts (declared here to be re-used by other resources). I believe that these resources will be maintained by a small group of people, and that they will be used by a much larger group of people. One of the uses for these basic concepts is as a repository of terms for the data sharing profiles in use by the GBIF network. These data sharing profiles include the Darwin Core "extensions" and the "vocabularies" of controlled values that are declared for some of the terms included in "extensions". The overall outline is that terms to be included in the "extensions" and in the "vocabularies" would be drawn from the basic concepts declared by the RDF vocabularies.
We have proposed for the task group to convene at the GBIF community site 1. Members of the task group would start by making a user profile at the GBIF Community site and join the Vocabulary Management group. The ISOcat demo 2 and the Semantic Wiki 3 (this site) should also be open for users to start signing up.
[1] http://community.gbif.org/pg/groups/21382/vocabulary-management/ [2] http://kos.gbif.org/isocat/interface/ [3] http://kos.gbif.org/wiki/
The annual TDWG conference could provide an opportunity for some of the task group members to meet (http://www.tdwg.org/). However most of the work would be as contributions to the discussions and the evaluation of the software tools.
References
Catapano T, Hobern D, Lapp H, Morris RA, Morrison N, Noy N, Schildhauer M, and Thau D (2011). Recommendations for the use of knowledge organization systems by GBIF. Released on 4 February 2011. Global Biodiversity Information Facility (GBIF), Copenhagen. Available at http://www.gbif.org/orc/?doc_id=2942&l=en, verified 26 March 2012.
Cryer P, Hyam R, Miller C, Nicolson N, O Tuama E, Page R, Rees J, Riccardi G, Richards K, and White R (2010). Adoption of persistent identifiers for biodiversity informatics: Recommendations of the GBIF LSID GUID task group, 6. November 2009. Global Biodiversity Information Facility (GBIF), Copenhagen. Available at http://www.gbif.org/orc/?doc_id=2956&l=en, verified 26 March 2012.
Endresen DTF, Ó Tuama É, and Remsen D. (2012a). Vocabulary Management Task Group Charter: A Task Group of the TAG Interest Group. [Technical Report] Available at http://community.gbif.org/pg/blog/read/21387/
Endresen DTF, Ó Tuama É , and Remsen D (2012b). Biodiversity Knowledge Organization System: Proposed Architecture. [Technical Report] Available at http://community.gbif.org/pg/file/read/21582/
Harman KT, Hyam R, Remsen DP (2009). Vocabularies - Managing Them. Proceedings of TDWG 2009 Available at http://www.tdwg.org/proceedings/article/view/605, verified 26 March 2012.
Jones MB, Bertrand N, Holetschek J, Hutchison V, Ko BC-J, Suarez-Mayorga A, Meaux M, Ulate W, Watts D, Robertson T, O Tuama E (2009). Report of the GBIF metadata implementation framework task group (MIFTG). September 15, 2009. Global Biodiversity Information Facility (GBIF), Copenhagen. Available at: http://imsgbif.gbif.org/CMS_NEW/get_file.php?FILE=2d85d0e8c76408129024c09aa072d6, verified 26 March 2012.
Richards, K, White R, Nicolson N, Pyle R (2011). A beginner’s guide to persistent identifiers, version 1.0. Released on 9 February 2011. Global Biodiversity Information Facility (GBIF) Copenhagen. Available at http://www.gbif.org/orc/?doc_id=2428, verified 26 March 2012.
Smith VS, Rycroft SD, Harman KT, Scott B, and Roberts D (2009). Scratchpads: a data-publishing framework to build, share and manage information on the diversity of life. BMC Bioinformatics 10 (Suppl 14) p. S6. DOI:10.1186/1471-2105-10-S14-S6. Available at http://www.biomedcentral.com/1471-2105/10/S14/S6, verified 26 March 2012.
Wieczorek J, Bloom D, Guralnick R, Blum S, Döring M, Giovanni R, Robertson T, Vieglais D (2012). Darwin Core: An Evolving Community-Developed Biodiversity Data Standard. PLoS ONE 7(1): e29715. doi:10.1371/journal.pone.0029715