support portal

Solution home General Unpaywall FAQ
What does is_paratext mean in the API?
Created by: Richard Orr
Modified on: Mon, 3 Feb, 2020 at 4:12 PM
Almost anything can have a DOI, so Unpaywall indexes a lot of things besides the journal articles most people use it for. This can be a problem for analyses that use Unpaywall. If you want to collect statistics about Open Access, you probably want to focus on articles, without including extraneous content like issue covers.

Crossref’s content type metadata helps with this: most articles have the type journal-article. But it’s not perfect; not every publisher uses these types consistently and a lot of journal content that isn’t an article is also called journal-article. Unpaywall still includes this information as-is in the API (as the genre field), but we also have added a new attribute that aims to identify this non-article content: is_paratext.

From Wikipedia:
The main text of published authors (e.g. the story, non-fiction description, poems, etc.) is often surrounded by other material supplied by the authors, editors, printers, and publishers, which is known as the paratex.

Paratext is pretty much the same as journal matter as described in this paper. It’s similar to what folks call front matter, but it excludes some items that are considered front matter (for example, letters to the editor) and includes some new ones (for example, back covers). We use a more restrictive and specific definition. If the item’s title begins with one of the following, it’s paratext:

author index
back cover
cover image
cover picture
editorial board
front cover
inside back cover
inside cover
inside front cover
issue information
list of contents
title page

We’ll keep adding to this list when we see journal-articles that aren’t journal articles.
Richard is the author of this solution article.
Did you find it helpful?
Related Articles
What do the fields in the API response and snapshot records mean?
What does oa_date mean and how is it determined?
Unpaywall Change Notes
What is an OA license?
How do I use the title search API?
What do the host_type values "publisher" and "repository" mean?
What do the types of oa_status (green, gold, hybrid, and bronze) mean?
Which DOIs does Unpaywall cover?
Link Resolver Integrations
How do we decide if a given journal is fully OA?
Home Solutions Cookie policy
Help Desk Software by Freshdesk
HomeSolutions Cookie policy