Crossref Events

When someone links their data online, or mentions research on a social media site, we capture that event and make it available for anyone to use in their own way. We provide the unprocessed data—you decide how to use it.

Before the expansion of the Internet, most discussion about scholarly content stayed within scholarly content, with articles citing each other. With the growth of online platforms for discussion, publication and social media, we have seen discussions extend into new, non-traditional venues. Crossref Event Data captures this activity and acts as a hub for the storage and distribution of this data. An event may be a citation in a dataset or patent, a mention in a news article, Wikipedia page or on a blog, or discussion and comment on social media.

When someone links their data online, or mentions research on, for example, Twitter, Wikipedia, or Reddit, Crossref’s uses a set of APIs to captures and records those events in their ‘Event dataset’. Events are tracked via their DOI and URLs, which enables Crossref to monitor where it’s been shared, linked, bookmarked, referenced or commented on. Crossref Event Data currently contains events from a range of data sources, including Crossref Metadata, DataCite Metadata, F1000Prime (Recommendations of research publications, Hypothes.is, The Lens (Cambia), Newsfeed, Reddit, Reddit Links, Stack Exchange Network, Wikipedia, and Wordpress.com

See the crossref events page, and data details, for more information.

This date-sharded table is created during the Onix workflow, and placed in the crossref dataset in multiple crossref_eventsYYYYMMDD tables.

Dataset Name

crossref

Table Name

crossref_events

Table Type

Sharded

Average Runtime

10-120 min

Average Download Size

10-500MB

Harvest Type

API

Each Run Includes All Data

Table Schema

nametypemodedescription

id

STRING

REQUIRED

Unique ID for the Event.

subj_id

STRING

NULLABLE

Subject persistent ID.

relation_type_id

STRING

NULLABLE

Type of the relationship between the subject and object.

obj_id

STRING

NULLABLE

Object persistent ID.

timestamp

TIMESTAMP

REQUIRED

Timestamp of when the Event was created.

occurred_at

TIMESTAMP

REQUIRED

Timestamp of when the Event is reported to have occurred.

experimental

BOOL

NULLABLE

total

INTEGER

NULLABLE

source_id

STRING

REQUIRED

A name for the source.

source_token

STRING

NULLABLE

Unique ID that identifies the Agent that generated the Event.

terms

STRING

NULLABLE

Terms of use for using the API at the point that you acquire the Event.

license

STRING

NULLABLE

A license under which the Event is made available.

evidence_record

STRING

NULLABLE

Link to an Evidence Record for this Event.

subj

RECORD

NULLABLE

Subject metadata.

subj.pid

STRING

NULLABLE

The persistent ID. Must correspond to 'subj_id' or 'obj_id'

subj.issued

TIMESTAMP

NULLABLE

Publication date.

subj.title

STRING

NULLABLE

The title of the webpage, comment, etc.

subj.author

RECORD

REPEATED

Author of the comment, blog etc.

subj.author.url

STRING

NULLABLE

subj.author.name

STRING

NULLABLE

subj.author.id

STRING

NULLABLE

subj.url

STRING

NULLABLE

URL where this was found. May be different to 'pid'

subj.alternative_id

STRING

NULLABLE

subj.original_tweet_author

STRING

NULLABLE

subj.original_tweet_url

STRING

NULLABLE

subj.type

STRING

NULLABLE

subj.work_type_id

STRING

NULLABLE

subj.work_subtype_id

STRING

NULLABLE

subj.jurisdiction

STRING

NULLABLE

subj.api_url

STRING

NULLABLE

subj.publisher

RECORD

REPEATED

subj.publisher.url

STRING

NULLABLE

subj.publisher.name

STRING

NULLABLE

subj.publisher.id

STRING

NULLABLE

subj.publisher.type

STRING

NULLABLE

subj.json_url

STRING

NULLABLE

subj.name

STRING

NULLABLE

subj.datePublished

STRING

NULLABLE

subj.registrantId

STRING

NULLABLE

subj.dateModified

TIMESTAMP

NULLABLE

subj.id

STRING

NULLABLE

subj.proxyIdentifiers

STRING

NULLABLE

subj.funder

RECORD

NULLABLE

subj.funder.id

STRING

NULLABLE

subj.funder.type

STRING

NULLABLE

subj.funder.name

STRING

NULLABLE

subj.issueNumber

STRING

NULLABLE

subj.periodical

RECORD

NULLABLE

subj.periodical.id

STRING

NULLABLE

subj.periodical.issn

STRING

NULLABLE

subj.periodical.type

STRING

NULLABLE

subj.periodical.name

STRING

NULLABLE

subj.pagination

STRING

NULLABLE

subj.version

STRING

NULLABLE

subj.volumeNumber

STRING

NULLABLE

subj.includedInDataCatalog

RECORD

NULLABLE

subj.includedInDataCatalog.id

STRING

NULLABLE

subj.includedInDataCatalog.type

STRING

NULLABLE

subj.includedInDataCatalog.name

STRING

NULLABLE

obj

RECORD

REPEATED

Object metadata.

obj.pid

STRING

NULLABLE

obj.url

STRING

NULLABLE

obj.method

STRING

NULLABLE

obj.verification

STRING

NULLABLE

obj.work_type_id

STRING

NULLABLE

obj.publisher

RECORD

REPEATED

obj.publisher.url

STRING

NULLABLE

obj.publisher.name

STRING

NULLABLE

obj.publisher.id

STRING

NULLABLE

obj.publisher.type

STRING

NULLABLE

obj.name

STRING

NULLABLE

obj.datePublished

STRING

NULLABLE

obj.registrantId

STRING

NULLABLE

obj.dateModified

TIMESTAMP

NULLABLE

obj.id

STRING

NULLABLE

obj.proxyIdentifiers

STRING

NULLABLE

obj.author

STRING

NULLABLE

obj.type

STRING

NULLABLE

obj.funder

RECORD

NULLABLE

obj.funder.id

STRING

NULLABLE

obj.funder.type

STRING

NULLABLE

obj.funder.name

STRING

NULLABLE

obj.issueNumber

STRING

NULLABLE

obj.periodical

RECORD

NULLABLE

obj.periodical.id

STRING

NULLABLE

obj.periodical.issn

STRING

NULLABLE

obj.periodical.type

STRING

NULLABLE

obj.periodical.name

STRING

NULLABLE

obj.pagination

STRING

NULLABLE

obj.version

STRING

NULLABLE

obj.volumeNumber

STRING

NULLABLE

obj.includedInDataCatalog

RECORD

NULLABLE

obj.includedInDataCatalog.id

STRING

NULLABLE

obj.includedInDataCatalog.type

STRING

NULLABLE

obj.includedInDataCatalog.name

STRING

NULLABLE

updated

STRING

NULLABLE

will have a value of 'deleted' or 'edited'

updated_reason

STRING

NULLABLE

optional, may point to an announcement page explaining the edit

updated_date

TIMESTAMP

NULLABLE

ISO8601 date string for when the event was updated

message_action

STRING

NULLABLE

action

STRING

NULLABLE

jwt

STRING

NULLABLE

Last updated