diff options
| author | Jules Laplace <julescarbon@gmail.com> | 2018-11-05 23:14:56 +0100 |
|---|---|---|
| committer | Jules Laplace <julescarbon@gmail.com> | 2018-11-05 23:14:56 +0100 |
| commit | f616775cd805ef991bae5f3058bb9c7857896d5a (patch) | |
| tree | cb0c5c020f1bf2a40c0625609a513ad735cb77ba /pdf_dump_first_page.sh | |
| parent | d6f2c1a496fb478e6533730fef654b7aa8833f90 (diff) | |
dump first pages
Diffstat (limited to 'pdf_dump_first_page.sh')
| -rw-r--r-- | pdf_dump_first_page.sh | 8 |
1 files changed, 8 insertions, 0 deletions
diff --git a/pdf_dump_first_page.sh b/pdf_dump_first_page.sh new file mode 100644 index 00000000..dd67c87d --- /dev/null +++ b/pdf_dump_first_page.sh @@ -0,0 +1,8 @@ +#!/bin/bash + +for i in datasets/s2/pdf/*/*/*.pdf + do + OUTPUT="${i%.*}.txt" + pdf2txt.py -p 1 $i > $OUTPUT + echo $OUTPUT + done
\ No newline at end of file |
