Difference between revisions of "Hackathon 2013/Citations"

From TaxonWorks Wiki
Jump to: navigation, search
(URLs of taxonomic significance: Tweaked section)
(Members)
Line 25: Line 25:
  
 
== Members ==
 
== Members ==
* Mike Maehr
 
 
* Beth Frank
 
* Beth Frank
 
* Gaurav Vaidya
 
* Gaurav Vaidya
 +
* Mike Maehr
  
 
== Datasets we can play with ==
 
== Datasets we can play with ==

Revision as of 13:50, 30 September 2013

This pitch covers making sure that citations can flow easily in and out of TaxonWorks. It will do this in three ways:

  1. Finding an existing data model which can hold information citations with a standard format for representing it.
  2. Writing code to read this in and out in Ruby, possibly just using a standard library [1].
  3. Writing code to resolve microcitations. A working example of this has been created by Rod Page that links generic names from Nomenclator Zoologicus with references from the Biodiversity Heritage Library.
    • Micro-citations should be differentiated from verbatim references. A micro-citation is just an author and date (as seen in a full taxon name). A verbatim reference is the full reference as it appears in documentation that hasn't be broken into normalized pieces yet. Both versions need to be tracked and searchable. We also need a way to get a listing out of TW of both versions, so they can be normalized into full sources.
    • Lists of journals.
    • BHL API to convert BHL URLs into references
  4. Finding out more about a particular citation
    • Getting abstract, keywords from PubMed
    • Pulling in citation from BHL
    • ImpactStory information
  5. Designing a user interface to make it easy to resolve microcitations
    • Autocompletion
    • Journal name identification
    • Searching on Google Scholar/Wikipedia for books/authors

Contents

Terms

  • Citation: An individual, unnormalized use of a source.
  • Source: Something you want to credit in providing that data.
    • A person can be a source.
    • TW needs sources to be private or public.
  • Global source: a common pool of sources. These should be published and non-private.

Members

  • Beth Frank
  • Gaurav Vaidya
  • Mike Maehr

Datasets we can play with

  • ITIS
  • GNUB
  • UCD

Input/output formats

  • BibTeX
  • RIS?

URLs and identifiers of taxonomic significance

It should be noted that there will be multiple identifiers associated with a single source.

  • ISBN/ISSN
  • BHL URLs
  • PubMed ID/URLs
  • DOI ID/URLs
  • Handle ID/URLs?
  • Mendeley/Zotero/EndNote ID/URLs

APIs available

  • The BHL API can be used to get bibliographic information on BHL pages and publications.
  • The Mendeley API can be used to get abstract, keywords, and search.