Crossref Metadata
Crossref Metadata
Crossref is a non-for-profit membership organisation working on making scholarly communications better. It is an official Digital Object Identifier (DOI) Registration Agency of the International DOI Foundation. They provide metadata for every DOI that is registered with Crossref.
Crossref Members send Crossref scholarly metadata on research which is collated and standardised into the Crossref metadata dataset. This dataset is made available through services and tools for manuscript tracking, searching, bibliographic management, library systems, author profiling, specialist subject databases, scholarly sharing networks . - source: Crossref Metadata and schema details.
This table is created as part of the ONIX workflow. The Master Crossref Metadata table is created by Academic Observatory workflows and contains the entirety of the Crossref Metadata dataset. The ONIX workflow creates a subset of the Crossref Metadata by matching on the ISBNs of the workflow's publisher by querying the master table. This date-sharded table is placed in the crossref dataset in multiple crossref_metadataYYYYMMDD tables.
Dataset Name
crossref
Table Name
crossref_metadata
Table Type
Sharded
Average Runtime
2 min
Average Download Size
null
Harvest Type
Query
Each Run Includes All Data
Table Schema
DOI
STRING
NULLABLE
DOI of the work.
ISBN
STRING
REPEATED
ISSN
STRING
REPEATED
URL
STRING
NULLABLE
URL form of the work's DOI.
alternative_id
STRING
REPEATED
Other identifiers for the work provided by the depositing member
abstract
STRING
NULLABLE
Abstract as a JSON string or a JATS XML snippet encoded into a JSON string.
author
RECORD
REPEATED
author.ORCID
STRING
NULLABLE
URL-form of an ORCID identifier
author.affiliation
RECORD
REPEATED
author.affiliation.acronym
STRING
REPEATED
author.affiliation.name
STRING
NULLABLE
author.affiliation.id
RECORD
REPEATED
author.affiliation.id.id
STRING
NULLABLE
author.affiliation.id.id_type
STRING
NULLABLE
author.affiliation.id.asserted_by
STRING
NULLABLE
author.affiliation.place
STRING
REPEATED
author.affiliation.department
STRING
REPEATED
author.authenticated_orcid
BOOLEAN
NULLABLE
If true, record owner asserts that the ORCID user completed ORCID OAuth authentication.
author.family
STRING
NULLABLE
author.given
STRING
NULLABLE
author.name
STRING
NULLABLE
author.sequence
STRING
NULLABLE
author.suffix
STRING
NULLABLE
clinical_trial_number
RECORD
REPEATED
clinical_trial_number.clinical_trial_number
STRING
NULLABLE
Identifier of the clinical trial.
clinical_trial_number.registry
STRING
NULLABLE
DOI of the clinical trial regsitry that assigned the trial number.
clinical_trial_number.type
STRING
NULLABLE
One of preResults, results or postResults
container_title
STRING
REPEATED
Full titles of the containing work (usually a book or journal)
funder
RECORD
REPEATED
funder.DOI
STRING
NULLABLE
Optional Open Funder Registry DOI uniquely identifing the funding body (http://www.crossref.org/fundingdata/registry.html)
funder.award
STRING
REPEATED
Award number(s) for awards given by the funding body.
funder.doi_asserted_by
STRING
NULLABLE
Either crossref or publisher
funder.name
STRING
NULLABLE
Funding body primary name
group_title
STRING
NULLABLE
Group title for posted content.
is_referenced_by_count
INTEGER
NULLABLE
Count of inbound references deposited with Crossref.
issn_type
RECORD
REPEATED
List of ISSNs with ISSN type information
issn_type.type
STRING
NULLABLE
ISSN type, can either be print ISSN or electronic ISSN.
issn_type.value
STRING
NULLABLE
ISSN value
issue
STRING
NULLABLE
Issue number of an article's journal.
published_print
RECORD
NULLABLE
published_print.date_parts
INTEGER
REPEATED
issued
RECORD
NULLABLE
Earliest of published-print and published-online
issued.date_parts
INTEGER
REPEATED
Contains an ordered array of year, month, day of month. Only year is required. Note that the field contains a nested array, e.g. [ [ 2006, 5, 19 ] ] to conform to citeproc JSON dates
license
RECORD
REPEATED
license.URL
STRING
NULLABLE
Link to a web page describing this license
license.content_version
STRING
NULLABLE
Either vor (version of record,) am (accepted manuscript,) tdm (text and data mining) or unspecified.
license.delay_in_days
INTEGER
NULLABLE
Number of days between the publication date of the work and the start date of this license.
license.start
RECORD
NULLABLE
Date on which this license begins to take effect
license.start.date_parts
INTEGER
REPEATED
Contains an ordered array of year, month, day of month. Only year is required. Note that the field contains a nested array, e.g. [ [ 2006, 5, 19 ] ] to conform to citeproc JSON dates
license.start.date_time
TIMESTAMP
NULLABLE
ISO 8601 date time.
license.start.timestamp
INTEGER
NULLABLE
Seconds since UNIX epoch.
link
RECORD
REPEATED
URLs to full-text locations.
link.URL
STRING
NULLABLE
Direct link to a full-text download location.
link.content_type
STRING
NULLABLE
Content type (or MIME type) of the full-text object.
link.content_version
STRING
NULLABLE
Either vor (version of record,) am (accepted manuscript) or unspecified.
link.intended_application
STRING
NULLABLE
Either text-mining, similarity-checking or unspecified.
member
INTEGER
NULLABLE
Member identifier of the form http://id.crossref.org/member/MEMBER_ID
page
STRING
NULLABLE
Pages numbers of an article within its journal.
prefix
STRING
NULLABLE
DOI prefix identifier of the form http://id.crossref.org/prefix/DOI_PREFIX.
published
RECORD
NULLABLE
Date on which content was published.
published.date_parts
INTEGER
REPEATED
Contains an ordered array of year, month, day of month. Only year is required. Note that the field contains a nested array, e.g. [ [ 2006, 5, 19 ] ] to conform to citeproc JSON dates
publisher
STRING
NULLABLE
Name of work's publisher.
publisher_location
STRING
NULLABLE
Location of work's publisher
references_count
INTEGER
NULLABLE
Count of outbound references deposited with Crossref
short_container_title
STRING
REPEATED
Abbreviated titles of the containing work.
subject
STRING
REPEATED
Subject category names, a controlled vocabulary from Sci-Val. Available for most journal articles
title
STRING
REPEATED
Work titles, including translated titles.
type
STRING
NULLABLE
Enumeration, one of the type ids from https://api.crossref.org/v1/types.
volume
STRING
NULLABLE
Volume number of an article's journal.
Last updated