Memoize macro for wrapping IO operation in async Task

nielsls commented 2 years ago

Hey - thanks for this excellent package! My code is littered with @memoize! I have a feature request/idea:

I have a lot of IO-operations and have thus made my code highly concurrent (littered with @async...). So to @memoize IO operations while making sure I only load stuff once, I frequently use the following pattern:

load_stuff(params) = fetch(_load_stuff(params))
@memoize _load_stuff(params) = @async expensive_load(params)

Hence, by memoizing the Task created by @async, we make sure stuff is only loaded once even when multiple concurrent tasks might like to load the same thing at the same time.

It would be nice if the above could be wrapped in a macro (e.g. @task_memoize ?). As a performance enhancement, the @task_memoize macro could potentially unwrap the result from its task once the task is done. Then the task can be GC'ed and only the result needs to be saved.

Thoughts/suggestions welcome - can't help thinking the above use-case is relatively generic/common. I might take a stab at it myself - although I have doubts my macro-manipulating skills are currently fit for the job.

cstjean commented 2 years ago

That sounds very nice! In fact, if the overhead isn't too large, maybe this should be the default, although it's good to start with a separate macro.

I might take a stab at it myself - although I have doubts my macro-manipulating skills are currently fit for the job.

Have at it! To be frank, it's unlikely to be implemented by anyone else.

nielsls commented 2 years ago

Well, the overhead would be minimal and amount to this:

return x isa Task ? fetch(x) : x  # replaces: return x

where x is the memoized value. Making this the default is interesting, although it would be breaking (as you then can't memoize a Task).

JuliaCollections / Memoize.jl

Memoize macro for wrapping IO operation in async Task #76