TASEmulators / BizHawk

BizHawk is a multi-system emulator written in C#. BizHawk provides nice features for casual gamers such as full screen, and joypad support in addition to full rerecording and debugging tools for all system cores.
http://tasvideos.org/BizHawk.html
Other
2.2k stars 384 forks source link

Firmware UX improvements, mainly re: status #3974

Open YoshiRulz opened 3 months ago

YoshiRulz commented 3 months ago
Morilli commented 3 months ago

The invariant of "exactly 1 Ideal option per record" is currently broken

Looking at e18a349, it feels like setting all to ideal makes sense, so maybe this invariant doesn't need to exist? In any case, unit tests could work nicely to enforce the first two points.

YoshiRulz commented 3 months ago

Yes, I forgot to mention these aren't covered by the existing unit tests, hence they've been neglected.

vadosnaprimer commented 3 months ago

So maybe just... have TASVideos judges read them over?

What do you mean?

In general regarding firmwares, I think we need different icons, because the current ones are too cryptic.

Bad could be a "thumbs down", acceptable a "thumbs up", unacceptable - a red cross, and ideal - a star from SMW.

Also I don't see the benefit of only allowing 1 to be ideal.

YoshiRulz commented 3 months ago

So maybe just... have TASVideos judges read them over?

What do you mean?

Asking people from the TASing community with extensive knowledge of a particular console, such as judges, to confirm that we've made the best choices. (Which I believe is how the FW database was populated in the first place.)

I think we need different icons, because the current ones are too cryptic. [...] a star from SMW.

I'm sympathetic to this, but we only "just" got them working properly in https://github.com/TASEmulators/BizHawk/commit/1026503d9241203987dc695a01e4907dcaaffb1f (2.9). If you do change them, please use only Public Domain images—see also my personal notes #36.

Also I don't see the benefit of only allowing 1 to be ideal.

It would help reduce the diversity of setups. (A boon for judges?) Unfortunately, these bugs prevent that.

vadosnaprimer commented 3 months ago

Asking people from the TASing community with extensive knowledge of a particular console, such as judges, to confirm that we've made the best choices. (Which I believe is how the FW database was populated in the first place.)

People who added the cores had the most knowledge about firmwares. I don't think judges were involved at any point in any way.

I'm sympathetic to this, but we only "just" got them working properly in https://github.com/TASEmulators/BizHawk/commit/1026503d9241203987dc695a01e4907dcaaffb1f (2.9).

"Properly" is a subjective assessment, and I don't remember anyone for whom the current icons were immediately perfectly clear. Also 2.9 was just last year, I don't think everyone accustomed to them once and for all.

It would help reduce the diversity of setups. (A boon for judges?)

Has this been discussed with them, to determine it's the problem they're having? Since hawk auto selects the needed firmware given the path, all judges look at is whether it's an acceptable BIOS for a given machine, which hawk indicates.

CasualPokePlayer commented 3 months ago

Ideal should generally be what's ideal to the core. The ideal choices for PSX for example would the v3.0 images, as that's what mednafen was designed with. This would have nothing involving TASVideos judges.

For the example SGB records, iirc I did that under the assumption that "all of them are equally ideal" and by logic assigning multiple ideal tags (although that was long ago so idk)

Morilli commented 2 months ago

Can we consider this resolved now or does anyone disagree with the changes made? Namely that ideal does NOT need to exist exactly once and that the firmware manager will now prefer ideal over acceptable firmware in case both are available?

vadosnaprimer commented 2 months ago

I guess so.

YoshiRulz commented 2 months ago

Leaving this open as a reminder to double-check firmwares' statuses, but the part I wanted to get resolved for 2.10 now has been, thank you.

vadosnaprimer commented 2 months ago

How do we double-check them?

YoshiRulz commented 2 months ago

In a semi-automated way, by comparing against others' lists.

In a manual way, by having someone review the firmware's variants/history and sort them into the same status "buckets" we use.

I think it's more important that we downgrade unusable options which are incorrectly marked as good than vice-versa.