mark lazy mode deprecated and add `logger.lazy()` for lazy format value #1233

trim21 · 2024-11-12T20:21:23Z

No description provided.

trim21 · 2024-11-12T20:53:04Z

ci failed at codecov, not test/lint

Delgan · 2024-11-20T12:41:28Z

Hi @trim21, thanks for opening a PR. I understand it refers to #1207.

I've been thinking about this lately, and to be honest I'm not very keen on the idea adding a new LazyValue class just to optimize lazy evaluations of message formatting. So far, the logger has been the only publicly importable component from loguru. That would be a significant change of this idiom for what seems to be a relatively small inconvenience.

I would welcome a new API for lazy evaluated arguments if it also address #782. However, this is not necessarily easy.

Another thing to consider is PEP 750. It's still draft and there's no guarantee that it will be accepted, but it could be interesting for our use case, without requiring a new API.

trim21 · 2024-11-20T19:14:42Z

maybe we can add a api logger.lazy(fn, *args, **kwargs)?

I don't think PEP 750 will help #782 or this, even with pep750 accepted, value in template str is still calculcated before logger function is called.

trim21 · 2024-11-20T19:18:33Z

also we have only sync api, I don't know there is a method to handle #782

Delgan · 2024-11-20T21:04:14Z

value in template str is still calculcated before logger function is called.

Indeed, but it's possible to pass a lambda: Approaches to Lazy Evaluation

We could imagine than when a template string is used, then Loguru will convert callable arguments, but only if the log level requires it. This would make the opt(lazy=true) argument obsolete, while making it straightforward to combine with non-lazy arguments.

This PEP is actively discussed here.

maybe we can add a api logger.lazy(fn, *args, **kwargs)?

I would prefer such API, yes. But still, that's a new method that increases the complexity of Loguru's API, for a marginal gain in my opinion.

also we have only sync api, I don't know there is a method to handle #782

Yes, unfortunately I don't know how to integrate it either.

trim21 · 2024-11-20T21:15:31Z

We could imagine than when a template string is used, then Loguru will convert callable arguments, but only if the log level requires it.

personal I think this is confusing and dangerous behaviour to call callable in arguments unless explicit specified with lazy or something

trim21 · 2024-11-25T09:57:02Z

what do you think about a new public API loguru.utils.LazyValue/Lazy/lazy instead of loguru.LazyValue?

even with PEP 750 accepted I think the api should be logger.debug(t'hello {Lazy(fn, ...)}') or logger.debug(t'hello {logger.lazy(fn, ...)}')

Delgan · 2024-11-26T16:11:34Z

Thanks for the update.

Sorry, I should have catch that earlier but I realized there are two mains problems with such API:

The expensive function will be called multiple time if the argument is to be formatted multiple times (e.g. logger.info("{foo} {foo}", foo=logger.lazy(func))).
The extra dict which be populated with a LazyValue instead of the underlying value (for structured logging).

Even if we managed to get around these problems, to be honest I'm still not convinced by such an API change.

We're trying to fix a mere inconvenience. But there are other patterns that are not possible today and that it would be more interesting to handle. Notably #782 as said, or even:

# There is no equivalent possible in Loguru.
if logger.isEnabledFor(logging.DEBUG):
    foo = expensive_func() 
    logger.info("Foo: %s", foo.summary)
    logger.debug("Foo details: %s", foo.details)

trim21 · 2024-11-26T19:32:11Z

I agree we should have a method for isEnabledFor.

These problems are easy to fix, we can just cache result and modify json encoder's default argument we used

{"text": "2024-11-27 03:32:58.508 | INFO     | __main__:<module>:13 - 1 1\n", "record": {"elapsed": {"repr": "0:00:00.002001", "seconds": 0.002001}, "exception": null, "extra": {"foo": "1"}, "file": {"name": "a.py", "path": "C:\\Users\\Trim21\\proj\\loguru\\a.py"}, "function": "<module>", "level": {"icon": "ℹ️", "name": "INFO", "no": 20}, "line": 13, "message": "1 1", "module": "a", "name": "__main__", "process": {"id": 7940, "name": "MainProcess"}, "thread": {"id": 27388, "name": "MainThread"}, "time": {"repr": "2024-11-27 03:32:58.508597+08:00", "timestamp": 1732649578.508597}}}

class LazyValue:
    def __init__(self, fn, *args, **kwargs):
        self.fn = fn
        self.args = args
        self.kwargs = kwargs

    def __format__(self, format_spec: str):
        return format(self.__result, format_spec)

    def __str__(self):
        return str(self.__result)

    def __repr__(self):
        return repr(self.__result)

    @functools.cached_property
    def __result(self):
        return self.fn(*self.args, **self.kwargs)

Delgan · 2024-12-03T17:16:23Z

Yes, I think a caching system is essential.

I was thinking again about this proposition the other day, and I believe we might be able to make it compatible with #782. Specifically, if the function to wrap is a coroutine, we could likely use create_task() (as is already done for async sinks) to delay the logging call internally.

It’s possible that the object returned by logger.lazy() should be awaitable (similar to logger.complete()). I’m not certain yet, as I haven’t experimented with this approach.

Additionally, I recalled another improvement I’d like to implement: lazy arguments should not be evaluated if the filter of all handlers returns False. This seems important, given that there are use cases where the handler level is set to 0 and everything else is handled in a custom filter function.

With these two elements in mind, there’s plenty to consider regarding the final API, especially in terms of when and how to perform lazy evaluation. That said, I think it’s doable.

I know this doesn’t directly address your initial concerns in #1207. However, as I mentioned, I’m trying to get the best out of this of this future new method. Of course, this isn’t something that needs to be addressed in this PR (which could be merged as 1st step). These are just ideas at this stage.

In any case, it seems feasible to include these changes in the next minor release.

trim21 · 2024-12-03T17:36:05Z

I was thinking again about this proposition the other day, and I believe we might be able to make it compatible with #782. Specifically, if the function to wrap is a coroutine, we could likely use create_task() (as is already done for async sinks) to delay the logging call internally.

We can emit log to async sink with asyncio.create_task but it's not very doable with async LazyValue with sync sink, I think?

for a minimal case:

import asyncio
import functools


class LazyValue:
    def __init__(self, fn, *args, **kwargs):
        self.fn = fn
        self.args = args
        self.kwargs = kwargs
    …

async def asink(s: str):
    print(s)

def asink(s: str):
    print(s)

def log(s, *args):
    how to get expected message here？


async def main():
    log("hello {}", LazyValue(asyncio.sleep, 1, "world"))


if __name__ == "__main__":
    asyncio.run(main())

trim21 · 2024-12-03T17:42:04Z

Additionally, I recalled another improvement I’d like to implement: lazy arguments should not be evaluated if the filter of all handlers returns False. This seems important, given that there are use cases where the handler level is set to 0 and everything else is handled in a custom filter function.

This can be done with cache, maybe? we do not generate log message in _log but pass a class with @functools.cached_property with def message() -> str: ... and handlers will need to get log line from a cached function.

and if no handle try to get log line, lazy value will not be evaluated.

Delgan · 2024-12-03T19:06:25Z

We can emit log to async sink with asyncio.create_task but it's not very doable with async LazyValue with sync sink, I think?

Here is roughly what I had in mind:

import asyncio
import time

MIN_LEVEL = 20

tasks = []


class Lazy:
    def __init__(self, fn):
        self.fn = fn


def _sink(message, *args, **kwargs):
    print(message.format(*args, **kwargs))


async def _async_log(message: str, lazy: Lazy):
    _sink(message, lazy=await lazy.fn())


def _sync_log(message: str, lazy: Lazy):
    _sink(message, lazy=lazy.fn())


def log(level: int, message: str, lazy: Lazy):
    if level < MIN_LEVEL:
        return

    if asyncio.iscoroutinefunction(lazy.fn):
        task = asyncio.get_event_loop().create_task(_async_log(message, lazy))
        tasks.append(task)
    else:
        _sync_log(message, lazy)


async def _async_expensive():
    await asyncio.sleep(1)
    return "<Expensive async result>"


def _sync_expensive():
    time.sleep(1)
    return "<Expensive sync result>"


async def async_main():
    lazy = Lazy(_async_expensive)
    log(10, "Result async 1: {lazy}", lazy=lazy)
    log(30, "Result async 2: {lazy}", lazy=lazy)
    await asyncio.gather(*tasks)


def sync_main():
    lazy = Lazy(_sync_expensive)
    log(10, "Result sync 1: {lazy}", lazy=lazy)
    log(30, "Result sync 2: {lazy}", lazy=lazy)


if __name__ == "__main__":
    start = time.time()
    sync_main()
    asyncio.run(async_main())
    print(f"Total time: {time.time() - start}")

I didn't implement the caching mechanism, but it's trivial.

Delgan · 2024-12-03T19:09:18Z

This can be done with cache, maybe? we do not generate log message in _log but pass a class with @functools.cached_property with def message() -> str: ... and handlers will need to get log line from a cached function.

Yes, something along these lines. I was thinking maybe the Lazy could have a special __getattribute__ that lazily evaluate the attribute access and cache the result. But that's yet to be defined.

trim21 · 2024-12-03T19:23:58Z

We can emit log to async sink with asyncio.create_task but it's not very doable with async LazyValue with sync sink, I think?

Here is roughly what I had in mind:

This doesn't look right to me...

for example, a logger with only sys.stdout/sys.stderr output, which is a sync sink, right?

a #782 example won't be able to write log message to it.

trim21 · 2024-12-03T19:27:04Z

OK, I got it. we will call _async_log method if any func of Lazy is coroutine function

trim21 · 2024-12-03T20:01:53Z

A more detail impl in mind:

class Lazy:
    def __init__(self, fn):
        self.fn = fn

    async def as_async_result(self):
        if asyncio.iscoroutinefunction(self.fn):
            return await self.fn()
        return self.fn()


def _sink(message: str):
    print(message)


async def _async_log(message: str, *args, **kwargs):
    _sink(await __get_real_log_line(message, *args, **kwargs))


def _sync_log(message: str, *args, **kwargs):
    _sink(message.format(*args, **kwargs))


async def __get_real_log_line(message: str, *args, **kwargs) -> str:
    # need to evaluate lazy value parallel here
    args = [(await v.as_async_result()) if isinstance(v, Lazy) else v for v in args]
    kwargs = {
        k: ((await v.as_async_result()) if isinstance(v, Lazy) else v) for k, v in kwargs.items()
    }
    return message.format(*args, **kwargs)


def is_async_lazy(*args, **kwargs):
    return any(asyncio.iscoroutinefunction(v.fn) for v in args if isinstance(v, Lazy)) or any(
        asyncio.iscoroutinefunction(v.fn) for v in kwargs.values() if isinstance(v, Lazy)
    )


def log(level: int, message: str, *args, **kwargs):
    if level < MIN_LEVEL:
        return

    if is_async_lazy(*args, **kwargs):
        task = asyncio.get_event_loop().create_task(_async_log(message, *args, **kwargs))
        tasks.append(task)
    else:
        _sync_log(message, *args, **kwargs)

trim21 · 2024-12-03T20:43:30Z

🤔 we will also need to maintain a backgroud task group in logger in-case they don't get freeed

trim21 · 2024-12-03T20:50:24Z

So I think we can do these in each seprated PRs:

mark lazy mode deprecated and add Lazy for sync fn (this one)
add asyncio support.
make lazy value evaluate after filter.

Delgan · 2024-12-04T12:53:35Z

There are also other possible API that would not require to test *args and **kwargs, for example:

logger.lazy(val=expensive_func).debug("My message: {val}")

But yes, once the API is fully defined, taking into account the technical aspect, it can be implemented in several PRs.

trim21 · 2024-12-06T18:46:31Z

Oh, I think this API looks better. maybe the best solution before PEP 750 is accepted.

trim21 added 5 commits November 13, 2024 04:21

mark lazy mode is deprecated and add LazyValue for lazy format value

7286c2d

fix pytest

7d01587

fix typr error

5af39d1

fix

802598d

add test

743d06f

trim21 added 3 commits November 21, 2024 06:54

Merge branch 'master' into lazy

5033035

improve UI

6037ac1

doc string

87315b8

trim21 changed the title ~~mark lazy mode is deprecated and add LazyValue for lazy format value~~ mark lazy mode deprecated and add LazyValue for lazy format value Nov 25, 2024

trim21 added 2 commits November 25, 2024 18:03

merge remote-tracking branch 'upstream/master' into lazy

973f6d6

lazy

74853f6

trim21 changed the title ~~mark lazy mode deprecated and add LazyValue for lazy format value~~ mark lazy mode deprecated and add logger.lazy() for lazy format value Nov 25, 2024

Merge branch 'master' into lazy

4a2be37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mark lazy mode deprecated and add `logger.lazy()` for lazy format value #1233

mark lazy mode deprecated and add `logger.lazy()` for lazy format value #1233

trim21 commented Nov 12, 2024

trim21 commented Nov 12, 2024

Delgan commented Nov 20, 2024

trim21 commented Nov 20, 2024

trim21 commented Nov 20, 2024

Delgan commented Nov 20, 2024

trim21 commented Nov 20, 2024

trim21 commented Nov 25, 2024 •

edited

Loading

Delgan commented Nov 26, 2024

trim21 commented Nov 26, 2024 •

edited

Loading

Delgan commented Dec 3, 2024

trim21 commented Dec 3, 2024 •

edited

Loading

trim21 commented Dec 3, 2024 •

edited

Loading

Delgan commented Dec 3, 2024

Delgan commented Dec 3, 2024

trim21 commented Dec 3, 2024 •

edited

Loading

trim21 commented Dec 3, 2024

trim21 commented Dec 3, 2024 •

edited

Loading

trim21 commented Dec 3, 2024

trim21 commented Dec 3, 2024 •

edited

Loading

Delgan commented Dec 4, 2024

trim21 commented Dec 6, 2024 •

edited

Loading

mark lazy mode deprecated and add logger.lazy() for lazy format value #1233

Are you sure you want to change the base?

mark lazy mode deprecated and add logger.lazy() for lazy format value #1233

Conversation

trim21 commented Nov 12, 2024

trim21 commented Nov 12, 2024

Delgan commented Nov 20, 2024

trim21 commented Nov 20, 2024

trim21 commented Nov 20, 2024

Delgan commented Nov 20, 2024

trim21 commented Nov 20, 2024

trim21 commented Nov 25, 2024 • edited Loading

Delgan commented Nov 26, 2024

trim21 commented Nov 26, 2024 • edited Loading

Delgan commented Dec 3, 2024

trim21 commented Dec 3, 2024 • edited Loading

trim21 commented Dec 3, 2024 • edited Loading

Delgan commented Dec 3, 2024

Delgan commented Dec 3, 2024

trim21 commented Dec 3, 2024 • edited Loading

trim21 commented Dec 3, 2024

trim21 commented Dec 3, 2024 • edited Loading

trim21 commented Dec 3, 2024

trim21 commented Dec 3, 2024 • edited Loading

Delgan commented Dec 4, 2024

trim21 commented Dec 6, 2024 • edited Loading

mark lazy mode deprecated and add `logger.lazy()` for lazy format value #1233

mark lazy mode deprecated and add `logger.lazy()` for lazy format value #1233

trim21 commented Nov 25, 2024 •

edited

Loading

trim21 commented Nov 26, 2024 •

edited

Loading

trim21 commented Dec 3, 2024 •

edited

Loading

trim21 commented Dec 3, 2024 •

edited

Loading

trim21 commented Dec 3, 2024 •

edited

Loading

trim21 commented Dec 3, 2024 •

edited

Loading

trim21 commented Dec 3, 2024 •

edited

Loading

trim21 commented Dec 6, 2024 •

edited

Loading