LuaLS / lua-language-server

A language server that offers Lua language support - programmed in Lua
https://luals.github.io
MIT License
3.21k stars 296 forks source link

Function overloads with function args not narrowing inner function params #2758

Closed CodeGreen0386 closed 3 weeks ago

CodeGreen0386 commented 1 month ago

How are you using the lua-language-server?

Visual Studio Code Extension (sumneko.lua)

Which OS are you using?

Windows

What is the issue affecting?

Annotations, Type Checking, Hover

Expected Behaviour

LuaLS correctly narrows a function overload based on the arguments of the function and the inner function params are also narrowed correctly.

Actual Behaviour

LuaLS either defaults to the base outer function definition, or narrows enough to get the hover over the inner function params correct, but type checking the params fail to narrow fields of said params.

Reproduction steps

I have no clue how to reproduce this, it just happens whenever it feels like it after a while and does not persist after a refresh. However, it still happens quite often and it is getting mildly annoying. Sometimes it happens after modifying the file a little bit, it just seems random.

Additional Notes

The Factorio modding community heavily uses automatically generated LuaLS annotations from the machine readable API documentation for Factorio. One function in particular, script.on_event(), seems to have endlessly caused trouble with LuaLS over the years and recently we have switched to using overloads, since that fit the purpose of the function better. Several of us have been encountering an issue with this method specifically, and nobody has been able to reproduce it consistently.

Here's a link to the API docs for the aforementioned function: https://lua-api.factorio.com/latest/classes/LuaBootstrap.html#on_event

The first parameter, event, further referred to as the event ID, is either an enum value or a string, or an array of enum values or strings. For the purposes of this bug report, we can ignore the array, because it is irrelevant to the issue.

The second parameter, handler, is a function that takes one argument, also called event, which contains all of the information relevant to the event indicated by the enum. Each event ID has it's own overload for script.on_event, and each overload's event parameter is a separate class, all of which extend from a base class, EventData.

The third parameter, filters, is irrelevant to the bug.

Sometimes, for no apparent reason whatsoever, LuaLS fails to narrow the handler event parameter based on the event ID. The most commonly experienced bug is when someone tries to use event.player_index. A significant number of events have a field for player_index, an integer, but not all of them do, so it is not a part of the base EventData class. This means that if LuaLS ever fails to narrow an overload for one of these functions, event.player_index is a very common target for undefined-field warnings.

Here's a screenshot of the bug in action: image

And here's a github gist for all of the related typedefs for script.on_event: https://gist.github.com/CodeGreen0386/3f78b82dbeb7eadf07e66c983e6de637

Log File

No response

justarandomgeek commented 1 month ago

The third parameter, filters, is irrelevant to the bug.

well, it kind of is but it's mostly a repeat of the same thing as the second: it has narrower types when properly matched, or doesn't exist at all on some overloads.

I'd also note that our defines.events enum (like all of our enums) is a little odd specifically to make it an opaque enum - the actual numeric values are not supposed to be visible anywhere, so they're masked away to just their types here for the overload resolution to see. I suspect this is involved somehow, but i'm not sure why it would cause any trouble.

tomlau10 commented 1 month ago

I think I've been able to reproduce this issue more reliably.

Setup

First create a workspace with the following files:

defines = {}

script = { ---@param event (defines.events)|(string)|(((defines.events)|(string))[]) ---@param handler (fun(event:EventData))|(nil) ---@overload fun(event:string, handler:fun(event:EventData.CustomInputEvent)) ---@overload fun(event:defines.events.on_ai_command_completed, handler:fun(event:EventData.on_ai_command_completed)) ---@overload fun(event:defines.events.on_area_cloned, handler:fun(event:EventData.on_area_cloned)) ---@overload fun(event:integer, handler:fun(event:EventData)) on_event = function(event, handler) end }

---@enum defines.events defines.events={ on_ai_command_completed=#{} --[[@as defines.events.on_ai_command_completed]], on_area_cloned=#{} --[[@as defines.events.on_area_cloned]], }

---@class EventData ---@field name defines.events ---@field tick integer ---@field mod_name? string

---@class EventData.CustomInputEvent:EventData ---@field CustomInputEvent true

---@class EventData.on_ai_command_completed:EventData ---@field on_ai_command_completed true

---@class EventData.on_area_cloned:EventData ---@field on_area_cloned true

- `test.lua`
yes, the event name is missing a `d`, but this just sets up the reproduction environment 🙂 
```lua
local e = defines.events
script.on_event(e.on_ai_command_complete, function (event)
end)

Reproduction steps

Thoughts

tomlau10 commented 1 month ago

Some updates and findings on this issue:

But then, by just adding trailing a new line, the type narrow will stop working ...

By adding or removing trailing new lines on the above reproduction file that I provided, this issue randomly triggers 😕 Each time I add/remove newline, it may correct itself, or trigger the bug.

As far as I know, the inferred view of a source object is based on its node's view. And in order to change / add a view to a node, vm.setNode(src, node) is called. So I added some debugging print log / debug.traceback() there when the source object is matched.


Some interesting findings:


Each time the file is edited, the file will be re-parsed, and the diagnostics will be rechecked. And the above suggests that if a node recompilation is triggered by diagnostic, the final inferred view will be incorrect 😳 .


This is of course not a proper fix, but at worst may be a way to "mitigate" the effect of this bug. 🤔

tomlau10 commented 1 month ago

Good news 🎉 I have found the root cause of this issue. The logic doesn't clear node caches of call.args after doing the function type narrowing, thus their views are not recomputed based on the type narrowed function.

It's the call order of vm.compileNode() which matters. I guess in the problematic call order, the arg nodes are keeping the outdated cache of parent which has all overload signatures.

To fix this, the logic has to clear call.args node caches after setting the type narrowed node in matchCall(): https://github.com/LuaLS/lua-language-server/blob/f221eb304e914736f88412b97ed3e1406754a907/script/vm/compiler.lua#L573-L577 insert codes below at line 577 =>

        if call.args then
            -- clear node caches of args to allow recomputation with the type narrowed call
            for _, arg in ipairs(call.args) do
                vm.removeNode(arg)
            end
        end

I have tested this against the reproduction case above, and it solves the problem 🎉 Both the random bug triggered when adding / removing trailing newlines, as well as just after completed the typing e.on_ai_command_completed. I will soon open a PR for this 😄

Meanwhile can you guys test it out to confirm if it can solve the issue? Say by just modifying the source code of your installed extension locally. @CodeGreen0386 @justarandomgeek