pdf_decrypt_retrieve_attachments

Decrypt and extract embedded pdf attachments.

Credit

Based on https://piep.tech/posts/automatic-password-removal-in-paperless-ngx/

Setup

1. Create a Dictionary File

The first step in creating a pre-consumption script is to create a dictionary file. This file will contain a list of all the passwords that you want to try to remove from the PDF files. To create a dictionary file:

Open a text editor.
Enter each password on a new line.

Save the file as <paperless-ngx_root>/scripts/passwords.txt.

123456
123456789
qwerty
password
12345
qwerty123
1q2w3e
12345678

2. Write the Pre-Consumption Script

Next, you’ll need to write the pre-consumption script. This script will use the dictionary file to automatically remove the passwords and extract pdf attachments from the PDF files.

Open a text editor.
Copy pre-consumption.py script.
Save the file as <paperless-ngx_root>/scripts/pre-consumption.py.

3. Configure the pre-consumption script to be run

We need to configure the Python script to run, when a new files is processed by Paperless-ngx.

docker-compose.yml

Open your docker configuration file of Paperless-ngx. <paperless-ngx_root>/docker-compose.yml

ℹ️ See the example. Make sure that the script folder is available to the docker container.

services.webserver.volumes:
    - <paperless-ngx_root>/scripts:/usr/src/paperless/scripts

Make sure that the environment file is processed.
```
services.env_file: docker-compose.env
```

docker-compose.env

Open your docker environment file of Paperless-ngx. <paperless-ngx_root>/docker-compose.env

ℹ️ See the example. Set the script path.

PAPERLESS_PRE_CONSUME_SCRIPT=/usr/src/paperless/scripts/pre-consumption.py

4. Restart the Paperless-ngx docker container

docker-compose up -d

Check if environment variables were properly set.

docker exec -it paperless_webserver_1 printenv \
    | grep PAPERLESS_PRE_CONSUME_SCRIPT

Should yield.

PAPERLESS_PRE_CONSUME_SCRIPT=/usr/src/paperless/scripts/pre-consumption.py

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.vs		.vs
.vscode		.vscode
docs		docs
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
README.md		README.md
docker-compose.env		docker-compose.env
docker-compose.yml		docker-compose.yml
post-consumption.sh		post-consumption.sh
pre-consumption.py		pre-consumption.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

pdf_decrypt_retrieve_attachments

Credit

Setup

1. Create a Dictionary File

2. Write the Pre-Consumption Script

3. Configure the pre-consumption script to be run

docker-compose.yml

docker-compose.env

4. Restart the Paperless-ngx docker container

About

Uh oh!

Languages

lukasz-lobocki/pdf_decrypt_retrieve_attachments

Folders and files

Latest commit

History

Repository files navigation

pdf_decrypt_retrieve_attachments

Credit

Setup

1. Create a Dictionary File

2. Write the Pre-Consumption Script

3. Configure the pre-consumption script to be run

docker-compose.yml

docker-compose.env

4. Restart the Paperless-ngx docker container

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages