Feature Request: Combine documents #426

Zocker1999NET · 2021-01-24T12:59:53Z

My scanner is not aware of which scanned PNGs/PDFs belong together or not, and due to I simply want all my documents to be "OCRed" and searchable using paperless even before I was able to sort/combine them manually, it would be great if this could be integrated into paperless itself.

How could this be implemented on the UI:

Select the documents you want:
Click on a "Combine" button

What happens in the background:

Combine the original documents (not the archived versions!) for example using ImageMagisk: convert "$@" pdf:-
Delete all old entries of the selected documents
Reprocess the new document as it was simply placed into the consume directory

Known issues with this implementation:

The original source files maybe cannot be currently handled, so they may be lost. Possible workaround: Before combining the originals to a PDF document, pack them together into a zip/tar archive, store that as "original document" and enable paperless to work with zip/tar archives if possible
Will most likely not support formats not supported by ImageMagisk like Office documents, however should be able to combine JPEGs/PNGs/PDFs/TIFFs. Possible workaround: Before combining using ImageMagisk them, convert each file not supported by ImageMagisk to a PDF reusing current existing strategies.

The text was updated successfully, but these errors were encountered:

jonaswinkler · 2021-01-24T13:39:05Z

See #335

Zocker1999NET · 2021-01-24T14:00:17Z

Okay, closing this as duplicate

henfri · 2022-02-06T18:46:34Z

My scanner creates a filename plus suffix for each set of Documents I feed to it.
E.g.
set 1:
Receipt_004942.jpg
Receipt_004942_2.jpg
Receipt_004942_3.jpg
set 2:
Receipt_004946.jpg
Receipt_004946_2.jpg
Receipt_004946_3.jpg
Receipt_004946_4.jpg
Receipt_004946_5.jpg
Receipt_004946_6.jpg

I.e. every group of documents that I send in one go gets a new number. Following pages get a suffix _2, _3, ...

Are you sure that your scanner cannot do something similar?
For that, I have created a script, that could help:
#457 (comment)

Zocker1999NET mentioned this issue Jan 24, 2021

[Feature Request] Split and merge documents #335

Open

Zocker1999NET closed this as completed Jan 24, 2021

shamoon mentioned this issue Jan 25, 2021

Pre-receive Hook: Wait for file - software duplex #440

Closed

czyzlukasz mentioned this issue Dec 16, 2023

Add option to merge multiple files into single PDF when bulk downloading paperless-ngx/paperless-ngx#5002

Closed

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Combine documents #426

Feature Request: Combine documents #426

Zocker1999NET commented Jan 24, 2021 •

edited

Loading

jonaswinkler commented Jan 24, 2021

Zocker1999NET commented Jan 24, 2021

henfri commented Feb 6, 2022

Feature Request: Combine documents #426

Feature Request: Combine documents #426

Comments

Zocker1999NET commented Jan 24, 2021 • edited Loading

jonaswinkler commented Jan 24, 2021

Zocker1999NET commented Jan 24, 2021

henfri commented Feb 6, 2022

Zocker1999NET commented Jan 24, 2021 •

edited

Loading