Skip to content

Conversation

@Johan-Liebert1
Copy link
Collaborator

While working on integrating composefs-rs, received this error

ERROR Installing to disk: Creating composefs deployment: Failed to create image proxy: error

The actual error occurred deep into the containers-image-proxy crate and was a bit difficult to track down. After this patch below is what the error reporting looks like

ERROR Installing to disk: Creating composefs deployment: Failed to create image proxy: error
0: Error {
    context: "Failed to create image proxy",
    source: Other(
        "skopeo proxy unexpectedly exited during request method Initialize: exit status: 125\nError: please use unshare with rootless\n",
    ),
}
1: Other(
    "skopeo proxy unexpectedly exited during request method Initialize: exit status: 125\nError: please use unshare with rootless\n",
)

This is more for better dev-ex, so please let me know if this should be behind some sort of flag, like something similar to BOOTC_ERR_TRACE=1

@cgwalters
Copy link
Collaborator

Thanks for your patch! A minor procedural thing but this project tends to use "topic prefixes", so can you add e.g. cli: or so as a prefix? Also hopefully most patches are improvements, and the "imperative mood" is preferred so this could be condensed to just e.g.:

cli: Print error chain sources

Anyways on the actual code change. Yeah, today we use the "alternative printer" for anyhow which formats errors as a single line. I have personally felt in the past this is more readable than the default, especially as we use the #[context] attribute somewhat liberally.

In this specific case...I'm confused as to why we're only getting error at the end and not the full source. It maybe relates to our use of thiserror in the proxy crate?

This is more for better dev-ex, so please let me know if this should be behind some sort of flag, like something similar to BOOTC_ERR_TRACE=1

Perhaps when -v is used (ref #1154 ) we could also change how errors are printed to be more verbose.

In this specific case though it seems to me we should just fix the proxy code right?

Offhand I wonder if we should change https://github.com/containers/containers-image-proxy-rs/blob/832795839fb71ae9aba11dc1a8d86f029adb7f1a/src/imageproxy.rs#L35C1-L35C48 to use #[from] instead?

@cgwalters cgwalters added triaged This looks like a valid issue area/client Related to the client/CLI labels Apr 16, 2025
Copy link
Collaborator

@cgwalters cgwalters left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Forgot to mark as requested-changes per above

Currently only the top level error is printed which makes it hard to
pinpoint the origin of the error, this patch prints the error chain
similar to a backtrace.

Signed-off-by: Pragyan Poudyal <[email protected]>
@Johan-Liebert1
Copy link
Collaborator Author

Johan-Liebert1 commented Apr 17, 2025

Thanks for the review. Updated the commit message, I read the docs about commit message format but forgot them while actually committing.

In this specific case...I'm confused as to why we're only getting error at the end and not the full source

Seems like this is intentional behaviour on anyhow's end https://docs.rs/anyhow/1.0.28/anyhow/struct.Error.html#display-representations

The docs say {:?} will print a backtrace, but I guess that's only true in case of a panic?

Perhaps when -v is used

#1154 seems to be WIP as of now, but yes we could definitely have more verbose error reporting.

In this specific case though it seems to me we should just fix the proxy code right?

But, the fact still remains that anyhow will only print the error message on the top of the stack, so we will still end up missing out on printing the root cause.

// This code just captures any errors.
if let Err(e) = run() {
tracing::error!("{:#}", e);
e.chain().skip(1).enumerate().for_each(|(idx, error)| {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd find it helpful to understand this change with a concrete example. We could probably do this by:

  • Stop using tracing::error! for logging here (not sure it's helpful)
  • Move our error printing to a helper function that takes an impl Write
  • Add a unit test for it

or so?

There's...a lot of stuff in this space. For example https://lib.rs/crates/color-eyre

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK I typed this up just as a reference

diff --git i/Cargo.lock w/Cargo.lock
index a68cb12d..a177130e 100644
--- i/Cargo.lock
+++ w/Cargo.lock
@@ -179,6 +179,7 @@ dependencies = [
  "bootc-lib",
  "bootc-utils",
  "log",
+ "similar-asserts",
  "tokio",
  "tracing",
 ]
diff --git i/cli/Cargo.toml w/cli/Cargo.toml
index 374e89ab..d62dc966 100644
--- i/cli/Cargo.toml
+++ w/cli/Cargo.toml
@@ -21,5 +21,8 @@ tokio = { workspace = true, features = ["macros"] }
 log = "0.4.21"
 tracing = { workspace = true }
 
+[dev-dependencies]
+similar-asserts = { workspace = true }
+
 [lints]
 workspace = true
diff --git i/cli/src/main.rs w/cli/src/main.rs
index d9429aa3..48feea8f 100644
--- i/cli/src/main.rs
+++ w/cli/src/main.rs
@@ -1,5 +1,7 @@
 //! The main entrypoint for bootc, which just performs global initialization, and then
 //! calls out into the library.
+use std::io::Write;
+
 use anyhow::Result;
 
 /// The code called after we've done process global init and created
@@ -17,6 +19,9 @@ async fn async_main() -> Result<()> {
 /// Perform process global initialization, then create an async runtime
 /// and do the rest of the work there.
 fn run() -> Result<()> {
+    if std::env::var_os("FOO").is_some() {
+        return Err(anyhow::anyhow!("FOO is set").context("Checking env"));
+    }
     // Initialize global state before we've possibly created other threads, etc.
     bootc_lib::cli::global_init()?;
     // We only use the "current thread" runtime because we don't perform
@@ -31,12 +36,34 @@ fn run() -> Result<()> {
     runtime.block_on(async move { async_main().await })
 }
 
+fn print_error(mut w: impl Write, e: anyhow::Error) {
+    // Ignore recursive errors when writing errors
+    let _ = write!(w, "{e:#}\n");
+}
+
 fn main() {
     // In order to print the error in a custom format (with :#) our
     // main simply invokes a run() where all the work is done.
     // This code just captures any errors.
     if let Err(e) = run() {
-        tracing::error!("{:#}", e);
+        print_error(std::io::stderr(), e);
         std::process::exit(1);
     }
 }
+
+#[cfg(test)]
+mod tests {
+    use super::*;
+
+    #[test]
+    fn test_print_error() {
+        use std::io::Cursor;
+
+        let mut buf = Cursor::new(Vec::new());
+        let err = anyhow::anyhow!("test error").context("some context");
+        print_error(&mut buf, err);
+        let output = String::from_utf8(buf.into_inner()).unwrap();
+
+        similar_asserts::assert_eq!(output, "some context: test error\n");
+    }
+}

(missing color support, and honestly the more I look at this the cooler https://docs.rs/color-eyre/latest/color_eyre/ looks, especially that support for capturing tracing spans)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I spent a few minutes looking deeper at eyre and ran into eyre-rs/eyre#31 which seems to currently make a partial conversion not viable because we lose the .context() that we heavily rely on and we'd have to port to use tracing spans I think?

Which would probably be an improvement in some cases but again a codebase-wide impact.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm sorry for leaving this hanging. Hopefully I'll find time to pick this up sometime later this week

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No worries at all! There's higher priority things going on.

I think my bottom line on this is I believe it's fixing a real bug, but what bothers me is basically as far as I understand it our way of printing errors is at least semi-standard and I am a bit confused why it's not doing the right thing here.

Or at least it seems to me we should be matching what other tools are doing.

What specifically just confuses me is why the error source isn't printed by default.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area/client Related to the client/CLI triaged This looks like a valid issue

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants