Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there an option to output ALTO XML to STDOUT? #143

Open
Sukii opened this issue May 22, 2022 · 3 comments
Open

Is there an option to output ALTO XML to STDOUT? #143

Sukii opened this issue May 22, 2022 · 3 comments
Labels
question Further information is requested

Comments

@Sukii
Copy link

Sukii commented May 22, 2022

I need it for a down-stream XSLT pipeline;
https://gitlab.coko.foundation/XSweet/XSweet/-/tree/pdf2html/applications/pdf2html

@Sukii Sukii changed the title Is there an option to output ALTO XML to STDIN? Is there an option to output ALTO XML to STDOUT? May 22, 2022
@kermitt2
Copy link
Owner

kermitt2 commented Jun 3, 2022

Hello @Sukii !

There is no such option currently. As the normal use case is to produce several files in addition to the ATLO document to cover information in the PDF that cannot be encoded in ALTO (for annotations, outline, ...), I didn't plan to add it so far.
I guess working with files is no problem, the interest of using pipes with stdout/stdin would be to speed up a bit the XSTL transformation?

@Sukii
Copy link
Author

Sukii commented Jun 3, 2022

Yes, not only the speed improvement, but also that Linux pipes help in sending the output directly to the webservices avoiding possible collisions, racing conditions etc. Of course, the images and stuff like that better remain outside as binary files, so it may be necessary to write that to hard-disk anyway.

@kermitt2 kermitt2 added the question Further information is requested label Jun 3, 2022
@Sukii
Copy link
Author

Sukii commented Jun 3, 2022

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants