Research
S2ORC: The Semantic Scholar Open Research Corpus
Semantic Scholar • 2019
A large corpus of 81.1M English-language academic papers spanning many academic disciplines. Rich metadata, paper abstracts, resolved bibliographic references, as well as structured full text for 8.1M open access papers. Full text annotated with automatically-detected inline mentions of citations, figures, and tables, each linked to their corresponding paper objects. Aggregated papers from hundreds of academic publishers and digital archives into a unified source, and create the largest publicly-available collection of machine-readable academic text to date.
Download
Read Paper
View Website
View Repo
Authors
Work by Kyle Lo, Lucy Lu Wang, Mark Neumann, Rodney Kinney, Daniel S. Weld.
Please contact Kyle (kylel@allenai.org) or Lucy (lucyw@allenai.org) for details.
AI FOR THE COMMON GOOD
Email us: ai2-info@allenai.org
Call us: 206.548.5600
Follow us: @allen_ai
Subscribe to the AI2 Newsletter
© The Allen Institute for Artificial Intelligence - All Rights Reserved.
Privacy Policy
|Terms and Conditions|Business Code of Conduct
AI2Allen Institute for AIAI2