Benchmark leaderboard - Githubissues

hosseinfani commented 1 year ago

Building something like this: https://openbenchmark.github.io/BarsCTR/leaderboard/avazu_x1.html

rezaBarzgar commented 1 year ago

For the Benchmark Leaderboard, we aim to create a web page that displays the performance of each model based on the metrics provided in the test.pred.eval.mean.csv file. The task involves the following steps:

Crawling the Output directory to locate the results of each model.
Generating a graphical representation of the results in a clear and understandable manner. While there is no specific preferred visualization method, one idea is to create a plot where the models are displayed on the X-axis and the Y-axis represents a range from 0 to 1.
Incorporating a table that presents the results, with the best-performing outcomes highlighted.

I would greatly appreciate your ideas and suggestions, @hosseinfani, regarding this task. Additionally, as suggested by @MarcoKurepa, we could consider creating a separate repository specifically for this task, making it reusable for other projects as well.

MarcoKurepa commented 1 year ago

I'm done with the preliminary design, right now I am using manually inputted dummy values (no crawler yet, that's next). My current TODO:

Some basic responsiveness
Crawler
Working metrics buttons to switch between graphs
Highlight legend based on metric selected
Table highlights

Current Version Link

If there's anything you'd like me to add or change now, please do tell:)

MarcoKurepa commented 1 year ago

Alright, I've done everything except the table highlights ('ll probably just bold things in to highlight them). It may look a bit funky with only 2 models, but when we put all 11 in it'll look fine. To prepare it you first need t convert all the csv files to json files (I just used a website, I can make a script for this too) and then run 'extract_model_data.json'. From there you should just be able to open index.html with chrome.

Current Version

@rezaBarzgar You can review it now, suggest changes. If it is to your satisfaction, the only next steps are to get the real data in and the real logo.

rezaBarzgar commented 1 year ago

@MarcoKurepa I cloned the repo on my PC and executed it. However, I encountered a few issues that I believe are bugs in the code.

leaderboard

Specifically, there seems to be an error in reading the data. Furthermore, I noticed that there is a difference in the background color below the table. Currently, it appears to be white, whereas it should match the background color of the table and graph for a consistent visual experience.

MarcoKurepa commented 1 year ago

Hello Reza, this is not a bug in the code but rather my instructions on how to run it. Since local directory files are needed, the index.html cannot simply be opened in chrome using the 'file://' protocol. You'll need to host a local http server using Node.

Here is a stackoverflow thread detailing how to do so, I just tested it and it worked for me:)

Apologies for the inconvenience!

MarcoKurepa commented 1 year ago

Hello Reza, this is not a bug in the code but rather my instructions on how to run it. Since local directory files are needed, the index.html cannot simply be opened in chrome using the 'file://' protocol. You'll need to host a local http server using Node.

Here is a stackoverflow thread detailing how to do so, I just tested it and it worked for me:)

Apologies for the inconvenience!

Note: The whitespace at the bottom of the screen is due to the failure to load the data, without the data a graph cant be made.

rezaBarzgar commented 1 year ago

@MarcoKurepa Thank you for your additional explanation. Sorry about that. :)

rezaBarzgar commented 1 year ago

@MarcoKurepa, Would it be possible for you to add a README file to the repository? This README file would contain detailed instructions that anyone can follow to run the leaderboard page. Having clear and concise instructions would greatly facilitate the usage of the codebase and ensure a smooth experience for all users. Thank you!

MarcoKurepa commented 1 year ago

Will do!

MarcoKurepa commented 1 year ago

@rezaBarzgar I have updated the repository as specified.

https://github.com/MarcoKurepa/OpeNTF-Benchmark-Leaderboard/tree/main

rezaBarzgar commented 1 year ago

@MarcoKurepa great! Thanks.

rezaBarzgar commented 1 year ago

@MarcoKurepa Thanks for the update. We need the following changes:

Adding dataset: we need to fix this from the pipeline, adding domain name
Selecting the cutoffs based on selection of metric
convert the lines to bars (possibly overlays like this: https://github.com/fani-lab/learning_to_refine_query/issues/23)
there is a bug when hovering the mouse

MarcoKurepa commented 1 year ago

Changelog:

Switch from line to bar chart (adjacent bars)
made color uniform for all eval metrics
legend only shows relevant eval metrics (based on what the user selects)
link to github on opentf (with hover animation)
updated logo (added animation and link to lab webpage)
added model selection and ordering to side menu
fixed hover bug & made cutoffs dynamic
added eval metric name when you hover over the graph

TODO:

Crawling local files for data
Dataset selection tab

I will start to work in the TODO immediately, however I'd appreciate it if you'd go over my changes and make sure they're agreeable, or tell me if I'm missing something.

Repo Link

@rezaBarzgar @hosseinfani

MarcoKurepa commented 1 year ago

@hosseinfani Can you explain in a bit more detail how you'd like to restructure the output folder for the dynamic crawler?

hosseinfani commented 1 year ago

@MarcoKurepa I talked with @rezaBarzgar . You need to make a minor change in opentf driver code to add a subfolder based on the domain name to the output folder that user give. that is:

https://github.com/fani-lab/OpeNTF/blob/9a3c933c1d281205a5d0640046dea4d0d0fb6a7d/src/main.py#L159

to

output_path = f"{output}\{d_name}{os.path.split(datapath)[-1]}..."

MarcoKurepa commented 1 year ago

@rezaBarzgar Alright I made all the necessary changes, you can review them on my fork! I am updating the README.md right now to bring it up-to-date.

MarcoKurepa commented 1 year ago

@rezaBarzgar @hosseinfani Should we schedule another review of the benchmark leaderboard to ensure that it performs as intended? I'd like to wrap up this issue before school begins for me (next Tuesday) as after that I won't be available so often.

rezaBarzgar commented 1 year ago

@MarcoKurepa, I'll check it again. Hopefully, if it works fine, I'll merge it with the main branch alongside the dataset retrieval feature.

rezaBarzgar commented 1 year ago

@MarcoKurepa I reviewed the leaderboard. It works fine. I appreciate your help. @hosseinfani I think we can close this issue. We'll merge Marco's fork to the main branch when #191 is resolved.

hosseinfani commented 1 year ago

@MarcoKurepa thank you for developing the leaderboard. @rezaBarzgar thank you also.

fani-lab / OpeNTF

Benchmark leaderboard #192