Using rwp
for archived content
#95
Replies: 3 comments
-
I was unaware of the EPUB/a initiative; it's good to see progress in this area. Ideally, the
The report could potentially be saved inside the EPUB container, though this is not recommended as it would alter the checksum of the original file. Preserving HTTP links within book content is a separate challenge, as these links decay over time like any others. It would be useful to:
This feature could be particularly valuable in scholarly publishing. Related Resources |
Beta Was this translation helpful? Give feedback.
-
In addition to what @atomotic is mentioning, which does not modify the EPUB file, I think there is also a use case for lossless optimizing of an EPUB file. |
Beta Was this translation helpful? Give feedback.
-
It's a separate but complementary challenge:
I would treat that one differently, it would be mostly interesting for optimizing EPUB files in a distribution workflow than archiving them. We might want to open a separate discussion, where we identify other means of optimizing an EPUB (unused styles for example). |
Beta Was this translation helpful? Give feedback.
-
A draft of EPUB/a was recently announced by ISO/NISO/CLOCKSS: https://clockss.org/new-draft-international-standard-for-the-preservation-of-ebooks-published-in-the-epub-format/
EPUB/a stands for EPUB archive, a profile of the EPUB spec designed with long term preservation in mind.
The draft is available at: https://clockss.org/wp-content/uploads/2024/06/EPUB-3-Archiving-for-Preservation.pdf
On the W3C mailing list, @iherman did a great job summarizing what makes this profile different from "normal" EPUB:
In a similar vein, @atomotic mentioned his interest in extracting external links from an EPUB, to check if they're still alive and return a 2xx HTTP code.
How does this group feels about adding support for such potential features to
rwp
and thego-toolkit
in general?Beta Was this translation helpful? Give feedback.
All reactions