feat(tex): support page parameter for includegraphics with multi-page pdf #1922

fk128 · 2025-03-16T17:42:23Z

Add support for selecting specific pages in multi-page PDFs

In LaTeX, when including a multi-page PDF as a graphic, it's possible to specify a page number:

 \includegraphics[width=1.0\textwidth,page=3]{figures/my_pic.pdf}

However, the page parameter is currently ignored, causing all the links to point only to the first page, even though ImageMagick extracts and converts all pages.

This PR adds support for the page parameter, ensuring that the correct page is selected when converting multi-page PDFs.

changeset-bot · 2025-03-16T17:42:26Z

🦋 Changeset detected

Latest commit: 56921a5

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 5 packages

Name	Type
myst-cli	Patch
tex-to-myst	Patch
myst-to-tex	Patch
mystmd	Patch
myst-migrate	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

rowanc1

Nice, looks great. Left a few minor things.

Could you also add page with a comment here:

https://github.com/executablebooks/mystmd/blob/main/packages/myst-spec-ext/src/types.ts#L117

e.g.

export type Image = SpecImage & {
  urlSource?: string;
  urlOptimized?: string;
  height?: string;
  placeholder?: boolean;
  /** Optional page number for PDF images, this ensure the correct page is extracted when converting to web and translated to LaTeX */
  page?: number;
};

rowanc1 · 2025-03-16T18:13:33Z

packages/myst-cli/src/utils/imagemagick.ts

    } ${output}`;
-    session.log.debug(`Executing: ${executable}`);
+
+    session.log.info(`Executing: ${executable}`);


Suggested change

session.log.info(`Executing: ${executable}`);

session.log.debug(`Executing: ${executable}`);

rowanc1 · 2025-03-16T18:13:45Z

packages/tex-to-myst/src/figures.ts

 import type { Handler, ITexParser } from './types.js';
-import { getArguments, texToText } from './utils.js';
+import { getArguments, extractParams, texToText } from './utils.js';
+import { group } from 'console';


I don't think this is used?

rowanc1 · 2025-03-16T18:16:42Z

packages/tex-to-myst/src/figures.ts

+    }
+    if (params.page) {
+      if (typeof params.page === 'number') {
+        params.page = Number(params.page) - 1; // Convert to 0-based for imagemagick


Do we need to round or parse this or have something similar to Number.isFinite(Number.parseFloat(params.page))?

You're right! Unless I'm mistaken, I think the extractParams should already have done the parsing, but I've added the missing Number.isFinite and the rounding.

agoose77 · 2025-03-17T07:44:54Z

@fwkoch @rowanc1 I'm wondering whether it makes sense to bring page awareness to our normal AST. Should we not treat this as a post-tex transform, and rewrite the image node with the proper figure?

fwkoch

I just looked at this PR now for the first time - looks great, and I think we should work on getting it landed.

First, just to summarize my understanding, this has two parts: (1) On parsing a tex file, this looks at image params and adds page to the image node (alongside width, which was previously supported). (2) On image conversion, if page is present, imagemagick extracts the correct page.

A couple points:

@agoose77 - I think your concern was around adding page to the image node. It feels a little clutter-y since it only applies in a few specific cases. Thinking through the alternative a bit: If we do not store page on the node, we would need to do initial image processing at parse time, pulling out the specific page as a separate file. This separate file would still need final conversion to correct format. This means we have extra intermediate files stored on users' machines.
I noticed if this page-specific PDF image is selected as the "implicit" thumbnail, it also causes problems: thumbnail is just a file, not an entire image node. That means thumbnail does not get the page value, and is stuck trying to process/convert the full PDF file. There are certainly ways around this where we could get these thumbnails working... but I also think we might not need to worry about this edge case. An "explicit" thumbnail can always be set if there are errors with the "implicit" thumbnail.

My inclination is to keep this as-is. It works nicely and it's relatively simple. The only downside is an extra page field on image nodes that will usually be undefined.

fwkoch · 2025-06-28T08:15:33Z

packages/tex-to-myst/src/utils.ts

  );
 }

+export function extractParams(args: { content: string }[]): Record<string, string | number> {


This seems like it has potential for wider use across other macros/parameter types. 👍

feat: support page parameter for includegraphics pdf

a4494a8

rowanc1 reviewed Mar 16, 2025

View reviewed changes

rowanc1 and others added 2 commits March 16, 2025 12:20

Create spicy-dingos-talk.md

5d93741

fix: update types, check finite and round page to int

56921a5

fwkoch reviewed Jun 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(tex): support page parameter for includegraphics with multi-page pdf #1922

feat(tex): support page parameter for includegraphics with multi-page pdf #1922

Uh oh!

fk128 commented Mar 16, 2025

Uh oh!

changeset-bot bot commented Mar 16, 2025 •

edited

Loading

Uh oh!

rowanc1 left a comment •

edited

Loading

Uh oh!

rowanc1 Mar 16, 2025

Uh oh!

fk128 Mar 16, 2025

Uh oh!

rowanc1 Mar 16, 2025

Uh oh!

fk128 Mar 16, 2025

Uh oh!

rowanc1 Mar 16, 2025

Uh oh!

fk128 Mar 16, 2025

Uh oh!

agoose77 commented Mar 17, 2025 •

edited

Loading

Uh oh!

fwkoch left a comment

Uh oh!

fwkoch Jun 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	session.log.info(`Executing: ${executable}`);
	session.log.debug(`Executing: ${executable}`);

feat(tex): support page parameter for includegraphics with multi-page pdf #1922

Are you sure you want to change the base?

feat(tex): support page parameter for includegraphics with multi-page pdf #1922

Uh oh!

Conversation

fk128 commented Mar 16, 2025

Uh oh!

changeset-bot bot commented Mar 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

rowanc1 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rowanc1 Mar 16, 2025

Choose a reason for hiding this comment

Uh oh!

fk128 Mar 16, 2025

Choose a reason for hiding this comment

Uh oh!

rowanc1 Mar 16, 2025

Choose a reason for hiding this comment

Uh oh!

fk128 Mar 16, 2025

Choose a reason for hiding this comment

Uh oh!

rowanc1 Mar 16, 2025

Choose a reason for hiding this comment

Uh oh!

fk128 Mar 16, 2025

Choose a reason for hiding this comment

Uh oh!

agoose77 commented Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fwkoch left a comment

Choose a reason for hiding this comment

Uh oh!

fwkoch Jun 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

changeset-bot bot commented Mar 16, 2025 •

edited

Loading

rowanc1 left a comment •

edited

Loading

agoose77 commented Mar 17, 2025 •

edited

Loading