Discover/arxiv.org API
live

arxiv.org APIarxiv.org

Search arXiv research papers by keyword, author, title, or category. Fetch full metadata by arXiv ID. Browse the complete category taxonomy. 3 endpoints.

Endpoints
3
Updated
8d ago
Try it
General keyword search across all fields
Starting index for pagination (0-based)
Title keywords to search for
Author name to search for
Sort field: relevance, lastUpdatedDate, or submittedDate
arXiv category code (e.g., cs.AI, math.CO, hep-th)
Sort order: descending or ascending
Maximum number of results to return (max 100)
api.parse.bot/scraper/9380e1b0-fae2-4340-9056-3d416f86c775/<endpoint>
Ready to send
Fill in the parameters and hit sign in to send to see live response data here.
Use it in your codegrab a free API key at signup
curl -X GET 'https://api.parse.bot/scraper/9380e1b0-fae2-4340-9056-3d416f86c775/search_papers?query=transformer&max_results=2' \
  -H 'X-API-Key: $PARSE_API_KEY'
All endpoints · 3 totalclick to expand

Search arXiv for research papers using keyword, author, title, and category filters. Supports combining multiple search fields with AND logic. At least one search parameter (query, author, title, or category) must be provided. Returns paginated results.

Input
ParamTypeDescription
querystringGeneral keyword search across all fields
startintegerStarting index for pagination (0-based)
titlestringTitle keywords to search for
authorstringAuthor name to search for
sort_bystringSort field: relevance, lastUpdatedDate, or submittedDate
categorystringarXiv category code (e.g., cs.AI, math.CO, hep-th)
sort_orderstringSort order: descending or ascending
max_resultsintegerMaximum number of results to return (max 100)
Response
{
  "type": "object",
  "fields": {
    "papers": "array of paper objects with arxiv_id, title, authors, summary, categories, primary_category, published, updated, pdf_url, abs_url, comment, journal_ref, doi",
    "start_index": "integer pagination offset",
    "total_results": "integer total number of matching papers"
  },
  "sample": {
    "data": {
      "papers": [
        {
          "doi": null,
          "title": "PyramidTNT: Improved Transformer-in-Transformer Baselines with Pyramid Architecture",
          "abs_url": "https://arxiv.org/abs/2201.00978v1",
          "authors": [
            "Kai Han",
            "Jianyuan Guo",
            "Yehui Tang",
            "Yunhe Wang"
          ],
          "comment": "Tech Report",
          "pdf_url": "https://arxiv.org/pdf/2201.00978v1",
          "summary": "Transformer networks have achieved great progress...",
          "updated": "2022-01-04T04:56:57Z",
          "arxiv_id": "2201.00978v1",
          "published": "2022-01-04T04:56:57Z",
          "categories": [
            "cs.CV"
          ],
          "journal_ref": null,
          "primary_category": "cs.CV"
        }
      ],
      "start_index": 0,
      "total_results": 167798
    },
    "status": "success"
  }
}

About the arxiv.org API

The arXiv API covers 3 endpoints that let you search millions of research papers, retrieve detailed metadata for any paper by its arXiv ID, and browse the full category taxonomy across disciplines. The search_papers endpoint accepts keyword, author, title, and category filters simultaneously, returning paginated results with abstracts, authors, PDF links, and category assignments.

Searching Papers

The search_papers endpoint accepts up to four independent search fields — query (general keyword), author, title, and category — combined with AND logic. At least one must be provided. Results are paginated via start (0-based offset) and max_results (up to 100 per call). The total_results field in the response tells you how many papers match overall. Sort options include relevance, lastUpdatedDate, and submittedDate in either direction via sort_order.

Each paper object in the results array includes arxiv_id, title, authors (array of names), summary (full abstract text), categories (all assigned category codes), primary_category, published and updated timestamps in ISO 8601, and direct pdf_url and abs_url links.

Fetching a Specific Paper

The get_paper endpoint takes a single arxiv_id and returns the full metadata record for that paper. The ID can be in current format (2301.00001), versioned format (2301.00001v1), or legacy format (hep-th/9901001). The response includes the doi field (string or null), an optional comment field for author-provided notes, and all the same title, abstract, author, category, and link fields returned by search.

Category Taxonomy

get_category_taxonomy returns the complete arXiv subject hierarchy. Each entry in the groups array contains a group_name (e.g., Computer Science, Physics, Mathematics, Quantitative Biology, Quantitative Finance) and a categories array of objects with id, name, and description. Pass the optional group parameter to filter down to a single discipline, which is useful when building category pickers or validating category codes before passing them to search_papers.

Common use cases
  • Build a literature review tool that queries search_papers by author name and exports abstracts to a spreadsheet
  • Monitor a research area by polling search_papers with a category code like cs.AI sorted by submittedDate to surface new papers
  • Resolve a list of arXiv IDs to full metadata records including DOI and PDF URL using get_paper in bulk
  • Populate a category selector UI from get_category_taxonomy so users can filter searches by valid arXiv subject codes
  • Cross-reference papers in a citation graph by fetching doi and abs_url fields from get_paper for each node
  • Ingest paper metadata into a RAG pipeline using summary and authors fields from search_papers results
  • Track version history of a specific paper by fetching versioned IDs (e.g., 2301.00001v1, 2301.00001v2) via get_paper
Pricing & limitsSee full pricing →
TierPriceCredits/monthRate limit
Free$0/mo1005 req/min
Hobby$30/mo1,00020 req/min
Developer$100/mo5,000250 req/min

One credit = one API call regardless of which marketplace API you call. Exceeding the rate limit returns a 429 response. Authenticate with the X-API-Key header.

Frequently asked questions
Does arXiv have an official developer API?+
Yes. arXiv provides the arXiv API at https://arxiv.org/help/api/index, which exposes paper search and metadata over Atom XML using query syntax. The Parse API surfaces the same underlying data in JSON with structured parameters, avoiding manual XML parsing.
What does `get_paper` return that `search_papers` does not?+
get_paper returns two additional fields not present in search results: doi (a DOI string or null) and comment (an author-supplied note string or null, often containing page counts, conference names, or revision notes). Both endpoints return title, authors, summary, categories, timestamps, and URLs.
How does pagination work in `search_papers`?+
Use the start parameter (0-based) combined with max_results (maximum 100) to page through results. The response includes total_results so you can compute how many pages exist. For example, to fetch the second page of 25 results, set start=25 and max_results=25.
Does the API return full paper PDFs or citation counts?+
No. The API returns pdf_url (a direct link to the PDF on arXiv) and abs_url (the abstract page URL), but does not fetch or return the PDF content itself. Citation counts and reference lists are also not exposed — those are not part of arXiv's own metadata. You can fork this API on Parse and revise it to add an endpoint that fetches and parses the PDF content or integrates citation data from a source like Semantic Scholar.
Are preprint versions other than the latest accessible?+
Yes, for get_paper you can pass a versioned ID like 2301.00001v1 or 2301.00001v2 to retrieve metadata for a specific revision. However, search_papers results always reflect the latest version of each paper. Version-specific search filtering is not currently supported. You can fork this API on Parse and revise it to add version-aware search behavior.
Page content last updated . Spec covers 3 endpoints from arxiv.org.
Related APIs in EducationSee all →
kenpom.com API
Access comprehensive college basketball analytics and ratings including team efficiency stats, four factors, point distribution, and detailed team statistics from KenPom's renowned basketball evaluation system. Search teams and coaches, retrieve conference-specific ratings, and explore advanced metrics across Division I college basketball.
springer.com API
Search and retrieve metadata for millions of articles, books, and journals from Springer Nature's research library using DOI or ISBN lookups, with powerful filtering and pagination options. Get detailed information about academic publications including journal details, article metadata, and book information to power your research tools and discovery applications.
maxpreps.com API
Access high school sports data from MaxPreps. Search for schools, retrieve team rosters and schedules, look up athlete profiles, and browse national or state rankings across all sports.
amberstudent.com API
Search student accommodation listings across popular cities and access comprehensive property information including room types, pricing trends, and tenant reviews. Get detailed insights into student housing options to compare amenities, prices, and community feedback all in one place.
ieeexplore.ieee.org API
Search for scientific papers and retrieve their metadata, abstracts, references, and citations from IEEE Xplore's collection of journals and conferences. Look up author profiles, browse journals, and access paper details and full text sections all programmatically.
josaa.nic.in API
Access JoSAA (Joint Seat Allocation Authority) admission data for IITs, NITs, IIITs, and GFTIs. Retrieve opening and closing ranks by institute, program, category, quota, and round for the current counselling session as well as historical data from 2016 onwards. Also query seat matrices and full institute details.
ncaa.com API
Access live college sports scores, game schedules, detailed boxscores, play-by-play breakdowns, and team statistics across NCAA sports. Search for specific contests and retrieve comprehensive game information for any NCAA sport, division, or team.
oeis.org API
Search OEIS for integer sequences by keyword, A-number, or known terms, then retrieve full sequence entries and b-file term data.