Fixes #36912 - support multi-line type definitions #367

evgeni · 2023-11-10T10:10:44Z

This strips line breaks from type definitions, thus making them parseable again.

This will probably horribly break if type manifests would contain anything else but type definitions, but Puppet recommends [1] to store them in separate files and only have one type alias per file.

[1] https://www.puppet.com/docs/puppet/7/lang_type_aliases

evgeni · 2023-11-10T10:13:59Z

test/kafo/data_type_parser_test.rb

@@ -24,6 +24,16 @@ module Kafo
      it { _(parser.types).must_equal({'Ipv4' => 'Pattern[/^(\d+)\.(\d+)\.(\d+)\.(\d+)$/]'}) }
    end

+    describe "parse multiline alias" do
+      let(:file) { "type IP = Variant[\n  IPv4,\n  IPv6,\n]" }
+      it { _(parser.types).must_equal({'IP' => 'Variant[IPv4,IPv6,]'}) }


I wonder if those trailing comma will break something 😁

added a test to ensure that it's fine

ekohl

What if we use puppet-strings to parse it? There's this part in the JSON:

  "data_type_aliases": [
    {
      "name": "Candlepin::LogLevel",
      "file": "types/loglevel.pp",
      "line": 2,
      "docstring": {
        "text": "",
        "tags": [
          {
            "tag_name": "summary",
            "text": "A log4j log level"
          }
        ]
      },
      "alias_of": "Enum['ALL', 'DEBUG', 'INFO', 'WARN', 'ERROR', 'FATAL', 'OFF', 'TRACE']"
    }
  ],

In #343 I started to utilize the bulk parsing approach. Current kafo_parsers calls puppet strings for every file. The bulk parsing approach is to call it for multiple files (or even for a whole module at once) and then extract the data from that file.

Since the data type aliases are also in there, we could leverage that and avoid writing our own parser.

ekohl · 2023-11-10T10:31:35Z

lib/kafo/data_type_parser.rb

+        line = line.force_encoding("UTF-8").strip
+        next if line.start_with?('#')
+
+        line = line.split('#').first.strip


This will break if you use a Pattern with # in it.

So will the current regex, no?

kafo/lib/kafo/data_type_parser.rb

Line 5 in 4b44b57

TYPE_DEFINITION = /^type\s+([^\s=]+)\s*=\s*(.+?)(\s+#.*)?\s*$/

Ah, it ensures at least one space before the #…

However, https://www.puppet.com/docs/puppet/7/lang_comments doesn't say anything about leading spaces

added a test for the one Pattern I could find using a # inside, Stdlib::Email :)

evgeni · 2023-11-10T10:50:43Z

What if we use puppet-strings to parse it?

This sounds like the correct approach long term, but it also seems like a bigger re-work is required to get that going correctly?

ekohl

This sounds like the correct approach long term, but it also seems like a bigger re-work is required to get that going correctly?

Probably. The patches were written quite some time ago, but never really finished. It did give a massive speed up in building the installer so perhaps it's the excuse I need to properly wrap it up.

But short term this at least allows us to proceed.

ekohl · 2023-11-10T15:12:53Z

lib/kafo/data_type_parser.rb

+
+        line = line.split(' #').first.strip
+        if line =~ TYPE_DEFINITION
+          lines << last_line


What's the idea behind the last_line here? When is that ever relevant for a type definition?

Well, it's a buffer where I store all the lines in one string to drop the newlines.

I renamed the variable now, to make it more obvious

lib/kafo/data_type_parser.rb

This strips line breaks from type definitions, thus making them parseable again. This will probably horribly break if type manifests would contain anything else but type definitions, but Puppet recommends [1] to store them in separate files and only have one type alias per file. [1] https://www.puppet.com/docs/puppet/7/lang_type_aliases

ekohl · 2023-11-10T17:19:16Z

Let's see what theforeman/foreman-installer@66f3669 does in CI.

evgeni · 2023-11-10T18:04:53Z

it didn't explode

pr-processor bot added the Not yet reviewed label Nov 10, 2023

evgeni mentioned this pull request Nov 10, 2023

Update to puppetlabs-stdlib 9.x theforeman/foreman-installer#898

Merged

evgeni force-pushed the i36912 branch from f161339 to 25676dc Compare November 10, 2023 10:12

evgeni commented Nov 10, 2023

View reviewed changes

evgeni force-pushed the i36912 branch from 25676dc to da8fa3a Compare November 10, 2023 10:32

ekohl reviewed Nov 10, 2023

View reviewed changes

evgeni force-pushed the i36912 branch from da8fa3a to cb16dbc Compare November 10, 2023 10:42

evgeni force-pushed the i36912 branch 2 times, most recently from 4928690 to a90cc79 Compare November 10, 2023 11:13

ekohl requested changes Nov 10, 2023

View reviewed changes

pr-processor bot added Waiting on contributor and removed Not yet reviewed labels Nov 10, 2023

evgeni force-pushed the i36912 branch from a90cc79 to d9c00a2 Compare November 10, 2023 15:27

pr-processor bot added Needs re-review and removed Waiting on contributor labels Nov 10, 2023

ekohl approved these changes Nov 10, 2023

View reviewed changes

pr-processor bot removed the Needs re-review label Nov 10, 2023

ekohl merged commit 1bd5fe5 into master Nov 10, 2023
3 checks passed

evgeni deleted the i36912 branch November 10, 2023 19:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes #36912 - support multi-line type definitions #367

Fixes #36912 - support multi-line type definitions #367

evgeni commented Nov 10, 2023

evgeni Nov 10, 2023

evgeni Nov 10, 2023

ekohl left a comment

ekohl Nov 10, 2023

evgeni Nov 10, 2023 •

edited

Loading

evgeni Nov 10, 2023

evgeni Nov 10, 2023

evgeni commented Nov 10, 2023

ekohl left a comment

ekohl Nov 10, 2023

evgeni Nov 10, 2023

ekohl commented Nov 10, 2023

evgeni commented Nov 10, 2023

Fixes #36912 - support multi-line type definitions #367

Fixes #36912 - support multi-line type definitions #367

Conversation

evgeni commented Nov 10, 2023

evgeni Nov 10, 2023

Choose a reason for hiding this comment

evgeni Nov 10, 2023

Choose a reason for hiding this comment

ekohl left a comment

Choose a reason for hiding this comment

ekohl Nov 10, 2023

Choose a reason for hiding this comment

evgeni Nov 10, 2023 • edited Loading

Choose a reason for hiding this comment

evgeni Nov 10, 2023

Choose a reason for hiding this comment

evgeni Nov 10, 2023

Choose a reason for hiding this comment

evgeni commented Nov 10, 2023

ekohl left a comment

Choose a reason for hiding this comment

ekohl Nov 10, 2023

Choose a reason for hiding this comment

evgeni Nov 10, 2023

Choose a reason for hiding this comment

ekohl commented Nov 10, 2023

evgeni commented Nov 10, 2023

evgeni Nov 10, 2023 •

edited

Loading