Codeblocks should not be parsed #12

anakojm · 2023-01-26T02:19:18Z

Obsidian-to-hugo wrongly convert the following:

```python
if foo==bar and foo==baz:
    L = [[12,42],[13,90]]
```

to

```python
if foo<mark>bar and foo</mark>baz:
    L = [12,42],[13,90]({{< ref "12,42],[13,90" >}})
```

Codeblocks should instead be skipped to prevent such false positives (I have no idea how to implement this).

devidw · 2023-01-28T19:35:10Z

Hey @anakojm

I guess regex lookarounds should work for this use case, this would ideally nail the regex down to only those matches that are not written in between triple quotes

If you would like to give it a shot, feel free to add a test case for this in the md marks suit

anakojm · 2023-01-29T00:25:35Z

I am willing to try but one problem I am facing is that I can't do something like that r"(?<!^```.*?$).*?==([^=\n]+)==.*?(?!^```$)"gsm because it is not supported: re.error: look-behind requires fixed-width pattern.

I think you would be better off dealing with this issue as I lack experience in the matter.

Also why did you restrict the issue to the marks processor?
The issue affect the wikilinks parser too, as shown by my example

In the meantime, I have written test cases, should I PR them? Maybe in another branch?

devidw · 2023-01-29T06:33:03Z

Alright I see

Also why did you restrict the issue to the marks processor?
The issue affect the wikilinks parser too, as shown by my example

Good point, have overseen the change in the second line of the example 🙈

If we want to point out the issue clearly and avoid misunderstandings, we can use the diff block on GH 😉

```python
- if foo==bar and foo==baz:
+ if foo<mark>bar and foo</mark>baz:
-    L = [[12,42],[13,90]]
+    L = [12,42],[13,90]({{< ref "12,42],[13,90" >}})
```

In the meantime, I have written test cases, should I PR them? Maybe in another branch?

Cool, yes that would be awesome, maybe an extra branch like bug-codeblocks

vonloxley · 2024-01-27T15:02:42Z

This might do the trick since Python 3.6:

    wiki_link_regex = r"(?ms:```.*?```)|\[\[(.*?)\]\]"
    for match in re.finditer(wiki_link_regex, text):
        if not match.group(1):
            continue

anakojm · 2024-01-29T02:45:07Z

it might work but i believe the problem is more fundamental. we can’t parse markdown with regex properly since markdown is not a regular language.

Fixes devidw#12

anakojm changed the title ~~Codeblock should not be parsed~~ Codeblocks should not be parsed Jan 26, 2023

devidw changed the title ~~Codeblocks should not be parsed~~ Marks processor: Codeblocks should not be parsed Jan 28, 2023

devidw changed the title ~~Marks processor: Codeblocks should not be parsed~~ Codeblocks should not be parsed Jan 29, 2023

vonloxley added a commit to vonloxley/obsidian-to-hugo that referenced this issue Feb 11, 2024

Ignore code blocks while searching wiki links

bcefb9f

Fixes devidw#12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Codeblocks should not be parsed #12

Codeblocks should not be parsed #12

anakojm commented Jan 26, 2023 •

edited

Loading

devidw commented Jan 28, 2023

anakojm commented Jan 29, 2023 •

edited

Loading

devidw commented Jan 29, 2023 •

edited

Loading

vonloxley commented Jan 27, 2024 •

edited

Loading

anakojm commented Jan 29, 2024 •

edited

Loading

Codeblocks should not be parsed #12

Codeblocks should not be parsed #12

Comments

anakojm commented Jan 26, 2023 • edited Loading

devidw commented Jan 28, 2023

anakojm commented Jan 29, 2023 • edited Loading

devidw commented Jan 29, 2023 • edited Loading

vonloxley commented Jan 27, 2024 • edited Loading

anakojm commented Jan 29, 2024 • edited Loading

anakojm commented Jan 26, 2023 •

edited

Loading

anakojm commented Jan 29, 2023 •

edited

Loading

devidw commented Jan 29, 2023 •

edited

Loading

vonloxley commented Jan 27, 2024 •

edited

Loading

anakojm commented Jan 29, 2024 •

edited

Loading