You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Jan 7, 2021. It is now read-only.
Searches through the provided path and attempts to upload all the PDFs it can find. Metadata provided to the other keyword arguments will be recorded for all uploads. Returns a list of document objects that are created. Be warned, this will upload any documents in directories inside the path you specify.
@@ -138,7 +142,7 @@ Metadata
138
142
>>> obj.file_hash
139
143
'872b9b858f5f3e6bb6086fec7f05dd464b60eb26'
140
144
141
-
You could recreate this hexadecimal hash yourself using the `SHA-1 algorithm <https://en.wikipedia.org/wiki/SHA-1>`_.
145
+
You could recreate this hexadecimal hash yourself using the `SHA-1 algorithm <https://en.wikipedia.org/wiki/SHA-1>`_.
142
146
143
147
>>> import hashlib
144
148
>>> hashlib.sha1(obj.pdf).hexdigest()
@@ -154,7 +158,7 @@ Metadata
154
158
155
159
.. attribute:: document_obj.full_text_url
156
160
157
-
Returns the URL that contains the full text of the document, as extracted from the original PDF by DocumentCloud.
161
+
Returns the URL that contains the full text of the document, as extracted from the original PDF by DocumentCloud.
158
162
159
163
.. method:: document_obj.get_page_text(page)
160
164
@@ -266,4 +270,3 @@ Metadata
266
270
.. attribute:: document_obj.updated_at
267
271
268
272
The date and time that the document was last updated, in Python's datetime format.
0 commit comments