summaryrefslogtreecommitdiff
path: root/scraper/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'scraper/README.md')
-rw-r--r--scraper/README.md4
1 files changed, 4 insertions, 0 deletions
diff --git a/scraper/README.md b/scraper/README.md
index 4399abd3..33b2d975 100644
--- a/scraper/README.md
+++ b/scraper/README.md
@@ -42,6 +42,10 @@ We do a two-stage fetch process as only about 66% of their papers are in this da
Loads titles from citations file and queries the S2 search API to get paper IDs, then uses the paper IDs from the search entries to query the S2 papers API to get first-degree citations, authors, etc.
+### s2-papers.py
+
+Of course, searching is not totally accurate, so run the s2-papers.py script to build a report of all the papers, so you can correct any papers that did not resolve. Also reports papers without a location.
+
### s2-dump-ids.py
Dump all the paper IDs and citation IDs from the queried papers.