Dashboard data sources
Last updated
Last updated
To see the data sources for a specific Dashboard, click on About & FAQ on the Dashboard, and consult the list at Data Sources. The only obligatory data source is title metadata in ONIX format; each publisher then chooses the other data sources they wish to include.
The data sources currently available to be visualised in the Dashboard are detailed in the tables below. The standard data sources and variables used are included, other data sources and variables may be supported as an extra add-on service.
Data source | Status | Access |
---|---|---|
Data source | Status | Access | COUNTER-conformant? | Time aggregation |
---|---|---|---|---|
The public access data sources are those where data is made publicly available by the data source. No additional access permission is required from Dashboard partners for the Dashboard to access the following data sources if partners want them to be included on their dashboard/s.
Crossref is a not-for-profit membership organisation, and an official Digital Object Identifier (DOI) Registration Agency of the International DOI Foundation. They make metadata available for all DOIs registered with Crossref. BAS can use Crossref metadata to match ISBNs obtained from a publisher's ONIX feed to DOIs.
OAPEN enables libraries and aggregators to use the metadata of all available titles in the OAPEN Library. The metadata is available in different formats and BAS harvests the data in XML format and converts it into ONIX format for the OAPEN platform.
Thoth is a free, open metadata service that publishers can use as a metadata storage solution. Thoth can provide metadata in a number of formats. BAS uses the Thoth export API to download metadata for publishers in ONIX format.
University College London (UCL) is an eBook publisher, and partner in the BAD project. UCL Discovery is UCL's open access repository, showcasing and providing access to the full texts of UCL research publications.
Google Analytics Universal monitors and records web traffic for specific websites. If a Dashboard partner had configured Google Analytics on their publisher website, the Google Analytics data can be used to find out which countries and territories website visitors are from.
The Google Books Partner program hosts eBooks, including some free open access eBooks. eBook publishers can then download usage reports from Google Books. BAS uses data from the Google Play sales transaction report and the Google Books Traffic Report.
JSTOR is a digital library offering over 7,000 open access eBooks. Publisher usage reports offer details about the use (views and downloads) of eBooks by institution, and country.
ONIX is a standard that book publishers use to share information about the books that they have published. BAS dashboard partners that have ONIX feeds are given credentials and access to their own upload folder on the Mellon SFTP server. Each publisher uploads their ONIX feed to their upload folder on a weekly, fortnightly, or monthly basis. The BAS data workflow downloads the ONIX data, transforms it (with the ONIX parser Java command line tool) and then loads it into BigQuery for further processing.
IRUS provides COUNTER standard access reports for eBooks hosted on the Fulcrum platform. Fulcrum is a “community-developed, open source platform for digital scholarship” which provides “users the ability to read books with associated digital enhancements, such as: 3-D models, embedded audio, video, and databases; zoomable online images, and interactive media”.
IRUS provides COUNTER standard access reports for eBooks hosted on the OAPEN library and platform. OAPEN "promotes and supports the transition to open access for academic books by providing open infrastructure services to stakeholders in scholarly communication". Almost all eBooks on OAPEN are provided as a PDF file for the whole book. The reports show access figures for each month, and the location (IP address) of the access. Within the OAPEN Google Cloud project (located in Europe), IP addresses are replaced with geographical information (city and country). This means that IP addresses are not stored within BAS data, and only de-identified geographical information is transferred to BAS.
Data source | Events | Page Views |
---|---|---|
Data source | Book Views | Book Downloads |
---|---|---|
Data source | Chapter Downloads |
---|---|
Crossref metadata
Current
Public
OAPEN metadata
Current
Public
ONIX-FTP feed from publishers
Current
Private
Thoth
Current
Public
Crossref Event Data
Current
Public
n/a
Monthly
Google Analytics Universal
Not current
Private
No
Monthly
Google Books
Current
Private
No
Monthly
IRUS Fulcrum
Current
Private
Yes
Monthly
IRUS OAPEN
Current
Private
Yes
Monthly
JSTOR
Current
Private
Yes
Monthly
UCL Discovery
Current
Public
No
Monthly
Crossref Event Data
Y (count of event [id])
Google Analytics Universal
Y [page_views]
Google Analytics Universal
Y (with custom dimensions)
Google Books
Y [BV_with_Pages_Viewed]
Y [qty]
IRUS Fulcrum
[total_item_requests]
IRUS OAPEN
Y [title_requests] and [total_item_requests]
UCL Discovery
Y [total_downloads]
JSTOR
Y [total_item_requests]