From cc96e02f88db212d2ac3fb709a53bf26d8995aa7 Mon Sep 17 00:00:00 2001 From: Jules Laplace Date: Fri, 8 Feb 2019 23:37:44 +0100 Subject: readme --- scraper/README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/scraper/README.md b/scraper/README.md index 4399abd3..33b2d975 100644 --- a/scraper/README.md +++ b/scraper/README.md @@ -42,6 +42,10 @@ We do a two-stage fetch process as only about 66% of their papers are in this da Loads titles from citations file and queries the S2 search API to get paper IDs, then uses the paper IDs from the search entries to query the S2 papers API to get first-degree citations, authors, etc. +### s2-papers.py + +Of course, searching is not totally accurate, so run the s2-papers.py script to build a report of all the papers, so you can correct any papers that did not resolve. Also reports papers without a location. + ### s2-dump-ids.py Dump all the paper IDs and citation IDs from the queried papers. -- cgit v1.2.3-70-g09d2