rempsyc / busara_dashboard

The Missing Majority in Behavioural Science Dashboard
https://remi-theriault.com/dashboards/missing_majority
1 stars 0 forks source link

Readiness for "soft launch" at SIPS 2024, June 10-12 #33

Closed psforscher closed 1 month ago

psforscher commented 5 months ago

I'd like us to be prepared for a "soft launch" at the annual conference for the Society for the Improvement of Psychological Science, which Busara is co-hosting this year in Nairobi on June 10-12.

To be prepared for a soft launch, I would like to accomplish the following:

psforscher commented 3 months ago

the improvements make a huge difference! just a couple comments on this version:

Screenshot 2024-05-20 at 14 39 01 Screenshot 2024-05-20 at 14 41 14
psforscher commented 3 months ago

here are some possibilities for how we can approach dissemination:

of these, what do you feel comfortable with?

rempsyc commented 3 months ago

just a couple comments on this version:

Good catch, thanks! I opened an issue in #47.

rempsyc commented 3 months ago

One issue I have also is that I fear behavioural science only represents a fraction of all Nature, Science, PNAS, and PloS One papers, so I don't know if we can legitimately use these data to make claims about behavioural science as a whole since because of their volume, these general journals overwhelm the rest of the data by a factor of 5, so the "All fields" page and gauges are misleading.

That said, the data provided by OpenAlex includes some topics tags, e.g.,

> (data %>% 
+   filter(id == "https://openalex.org/W2055287564") %>% 
+   pull(concepts))[[1]]
                                 id                               wikidata             display_name level     score
1   https://openalex.org/C122980154  https://www.wikidata.org/wiki/Q205555                  Feeling     2 0.6483988
2   https://openalex.org/C118563197  https://www.wikidata.org/wiki/Q185030                   Genius     2 0.6240414
3    https://openalex.org/C15744967    https://www.wikidata.org/wiki/Q9418               Psychology     0 0.5501474
4  https://openalex.org/C2780583480 https://www.wikidata.org/wiki/Q1366327        Tone (literature)     2 0.5363786
5   https://openalex.org/C146902061  https://www.wikidata.org/wiki/Q476590         Music psychology     3 0.5164297
6    https://openalex.org/C13553968   https://www.wikidata.org/wiki/Q27908          Music education     2 0.4173431
7   https://openalex.org/C107038049   https://www.wikidata.org/wiki/Q35986               Aesthetics     1 0.4106070
8   https://openalex.org/C153349607   https://www.wikidata.org/wiki/Q36649              Visual arts     1 0.3683655
9   https://openalex.org/C180747234   https://www.wikidata.org/wiki/Q23373     Cognitive psychology     1 0.3277530
10  https://openalex.org/C142362112     https://www.wikidata.org/wiki/Q735                      Art     0 0.2974586
11  https://openalex.org/C124952713    https://www.wikidata.org/wiki/Q8242               Literature     1 0.1627391
12   https://openalex.org/C77805123  https://www.wikidata.org/wiki/Q161272        Social psychology     1 0.1458697
13  https://openalex.org/C138496976  https://www.wikidata.org/wiki/Q175002 Developmental psychology     1 0.1236818

Perhaps it would be possible to filter all the general journal data for behavioural science-related tags? Again, I'm not sure if this is realistic in time for SIPS, but something to think about for the future... Or is it useful to have "all fields" as a reference to compare psych and econ to? But then that shouldn't represent the aggregate data from the home page where we discuss the global south?

rempsyc commented 3 months ago
  • you could potentially deliver a presentation on this at the online post-event for SIPS, June 19-20
  • i could potentially do a pop-up talk on it at the in-person version of SIPS (obviously there was not enough time to arrange to fly you out to Nairobi haha)
  • the dashboard could be advertised through Busara social media during SIPS, taking care to advertise it as an "alpha" product

Yeah I think in the end I am too busy to fly there in person and also I was thinking that is probably a lot of money for the benefits, which would probably be better spent locally or in more dashboard development :)

Because of my schedule, I'm afraid even an online presentation would not really work for me in the current timing :(

Would you be ok to present it with option 2? If you can't, then I could try to make it work for option 1 (online presentation), and I think option 3 can be done anyhow now, as I feel much more comfortable with the openAlex data :)

psforscher commented 3 months ago

One issue I have also is that I fear behavioural science only represents a fraction of all Nature, Science, PNAS, and PloS One papers, so I don't know if we can legitimately use these data to make claims about behavioural science as a whole since because of their volume, these general journals overwhelm the rest of the data by a factor of 5, so the "All fields" page and gauges are misleading.

good point. i think for now the landing page should just draw from psych and economics and the "general" page should make clear that it's not behavioral science, but rather some general-interest journals. after SIPS, maybe you can look at using the openalex codes to filter the non-behavioral-science.

Yeah I think in the end I am too busy to fly there in person and also I was thinking that is probably a lot of money for the benefits, which would probably be better spent locally or in more dashboard development :)

Because of my schedule, I'm afraid even an online presentation would not really work for me in the current timing :(

Would you be ok to present it with option 2? If you can't, then I could try to make it work for option 1 (online presentation), and I think option 3 can be done anyhow now, as I feel much more comfortable with the openAlex data :)

yes, i'm happy to do option 2 and to do a social media release. :)

rempsyc commented 3 months ago

i think for now the landing page should just draw from psych and economics and the "general" page should make clear that it's not behavioral science, but rather some general-interest journals.

If we decide not to go ahead with already using keywords as in #48, having the landing page just from psych and econ would bring up additional questions: Would the "All fields" be meaningful at all anymore? If they are meant to be compared, not be aggregated, especially since the subfields get "erased" by the differential ratio of papers, should I remove the combined panel with all the data?

Or should "All fields" be kept but only include economics and psychology? Lastly, behavioural science is less homogenous than its composite fields. Does it even make sense to combine economics and psychology together if they clearly belong to different populations? Aren't we hiding the real data patterns doing so?

psforscher commented 3 months ago

hmm now that i see the new header labels, i've changed my mind again -- lol!

i see now that the longer labels are making the header run off the screen awkwardly on my laptop, and in addition, "Behavioral Science Journals" to me seems to imply that the set of journals is different rather than the combination of Psychology and Economics.

Screenshot 2024-05-21 at 11 05 25

maybe we can change these again to "Economics + Psychology", "Psychology", "Economics", and "General Interest"? this way, they don't run off the page, and it's clearer what goes into what used to be called "Behavioral Science". people can get a sense of why we group together economics and psychology based on how we describe behavioral science on the landing page.

rempsyc commented 3 months ago

"Behavioral Science Journals" to me seems to imply that the set of journals is different rather than the combination of Psychology and Economics. ... maybe we can change these again to "Economics + Psychology"

Exacly what I was thinking ;)

Also, do we need the instructions page anymore if every page basically has the full instructions? 🤔

psforscher commented 3 months ago

ah, good point. do you feel that all the content on the instructions page is replicated elsewhere? if so, i say delete the instructions page

rempsyc commented 3 months ago

Okie. And are you OK with the instructions as they are (for those included with the figures and tables), or would you change / shorten them? Are some of these things too obvious / self-explanatory? I'm afraid that by putting too much, people are not going to read all and then miss the fact that they can use the vertical scrolling bar for example to see the rest of the countries for the scatter plots 🙃

psforscher commented 3 months ago

i like the instructions as-is. i suppose this is something we can get feedback on during the soft launch, though. for now, let's keep them and see how people react?

rempsyc commented 3 months ago

Seems like the flexdashboard navigation menu does not handle special characters like + or & well, so I have to use Economics and Psychology instead.

psforscher commented 3 months ago

the current instructions (were they expanded?) seem to be overflowing the right side-bar. maybe they can be revised as follows:

The scatter plots use the plotly package, which converts regular images into interactive web graphics. Here are some tips to make the most of plotly figures.

When you hover over a plotly figure, you will see a menu appear on the top-right of the figure. This is called the Chart Studio Modebar. Some of these options include: (a) Download Plot as a PNG; (b) Zoom and Pan Buttons; (c) Zoom In/Out; (d) Autoscale and Reset Axes; and (e) Hover Options.

Interacting With Data Points and Legend Groupings

  1. When hovering over individual data points, you can see the raw data
  2. When hovering over the regression line, you can see the predicted data
  3. Click, hold, and drag to zoom into a specific window of the chart. Double-click anywhere to come back to the original zoom level
  4. Double-clicking on a group or line will isolate it. Double-clicking again will discontinue the isolation
  5. Single-clicking on a group or line will remove it. Single-clicking again will bring it back
  6. When the legend is too long, use the scroll bar to see the rest of it
psforscher commented 3 months ago

here's a simplification for the table instructions:

All tables use the DT package, which converts regular dataframes into interactive HTML table widgets. Here are some tips to make the most of these tables.

  1. You can click on the top left to change the number of entries to show
  2. You can change page by clicking on the bottom right area
  3. You can sort by column by clicking on the column of your choice
  4. You can search for specific values (for example journals or countries) by using the search bar at the top-right
psforscher commented 3 months ago

one more minor one -- in the top bar, change "Economics and Psychology" to "Psychology and Economics" to match the order of the menus in the top-bar

rempsyc commented 3 months ago

the current instructions (were they expanded?)

Yes, I expended them a little bit trying to fit all the instructions on it to get rid of the instructions page. I will go with your suggestions for changing instructions but just for future reference, I can only specify the size of the panels in absolute units, so screen size unfortunately will always matter a lot for these things. For example, this is how I see it on my screen (so I thought it was ok):

image

But if I zoom to simulate a smaller screen, then I see what you mean:

image

I suppose the best is that it works on smaller screens first since on bigger screens it will also naturally fit (if only adding more white space).

rempsyc commented 3 months ago

maybe they can be revised as follows:

Thanks, those are great suggestions, I like it. Shorter is better! I've kept some of the old stuff though, I hope that's fine. Now though it will hopefully fit most screens.

psforscher commented 3 months ago

image

seems like there's a bug here

psforscher commented 3 months ago

seems to affect all country waffle plots

image

rempsyc commented 3 months ago

Damn! That's because of my recent change for keeping the top 5 and clustering all the rest as the "Other" category. Unfortunately I won't have time to fix it until I come back I'm afraid!

psforscher commented 3 months ago

ok! the other graphs work though! enjoy your vacation <3

rempsyc commented 3 months ago

Updated waffle from your previous screenshot:

image

I changed the previous rule of only showing countries in the legend that had a high enough percentage because that still resulting in too many countries in the legend (the colour palette only has 8 colours so that's kind of the hard limit for number of countries). The new rule then only takes the top 8 countries globally.

This is why for example if you look at the economics journals:

image

You have some waffles that are almost entirely empty (like ADR, AJARE, and AJEMS). This is because that journal has more countries outside than inside the overall top 8 countries. This can create bias for example because small journals are less likely to pull their weight on the selection of the top 8 since they report fewer articles, and so if the countries are more diverse, this is less likely to show.

This is kind of a visualization challenge and I'd be happy to get your thoughts on it.

rempsyc commented 3 months ago

Another possiblity to better report small percentages is to have more than 100 squares, so you can better represent fractions like for the Global South continents. Here's an example for what I mean: https://github.com/haghish/shapley/blob/master/man/figures/waffle.png

So e.g., each square could represent .20% instead of 1%. But I am not so sure about it, it is harder to explain when communicating to a general audience. But just wanted to share this option.

psforscher commented 3 months ago

not what to do about the color pallette issue, but an easy fix to having empty squares is to add another category called "Another country" and populate the waffle plots with that

rempsyc commented 3 months ago

Oh true, this is what we used to do!

psforscher commented 3 months ago

one small thing that i noticed -- the "About" sidebar on the Github page still refers to "The Neglected 95% Dashboard". small thing but i thought i'd mention it!

Screenshot 2024-05-30 at 17 04 14
rempsyc commented 3 months ago

the "About" sidebar

Corrected, thanks!

rempsyc commented 3 months ago

having empty squares is to add another category called "Another country"

I'm going to try the same principle for the scatter plot by country also

rempsyc commented 3 months ago

The strategy to add a category "Other" for other countries does not work with the waffle plots using country flags because no flag can be identified with country name "Other", so it is not plotted.

psforscher commented 3 months ago

ok. maybe that one can be unchanged, but for the other graphs, we go with the "other" category

rempsyc commented 3 months ago

Ok. A little update for each panel:

  1. Continent: OK
  2. Continent, by Year: OK
  3. Continent, by Journal: OK
  4. Country: OK. I changed it so that the graph matched the table values, using full country names and getting rid of the legend title, and having the other countries as NA so it appears empty (best workaround for our problem)
  5. Country, by Year: OK (added eigth country cumulative "Other")
  6. Country, by Journal: OK (added eigth country cumulative "Other")

Is there anything else you'd like me to work on before the presentation day?

psforscher commented 3 months ago

looks great. we have the dashboard scheduled in the Busara SIPS social media campaign on June 4. i'll also be doing my own posting about the dashboard on the same day on both LinkedIn and Bluesky. finally, i plan to do a pop-up session about a few CREME projects during SIPS, with a strong focus on the dashboard. finally, the Busara Voice and Impact team is excited to link to the dashboard via the Busara website. so i think we have some decent initial dissemination planned!

psforscher commented 3 months ago

did i mention already that i am SO STOKED about this dashboard??? lol. truly, it already surpasses my wildest imaginings for what this project could have become.

some time after initial dissemination, i'd like to send the dashboard to a select few journal editors to see how they react to it. journal editors were my original targets for change. i'm open to your feedback about how comfortable you are with this, but i think we have a product we can show to at least a few of these

psforscher commented 3 months ago

update on dissemination plans: the busara social media team is pretty overloaded during SIPS, so i've been advised that it might make more sense to do the big social media push in august rather than in june. given that information, what i might do is do some soft promotion of the dashboard (through a talk) during SIPS and delay the big push at dissemination until around august.

does that plan sound reasonable to you as well?

rempsyc commented 3 months ago

update on dissemination plans: the busara social media team is pretty overloaded during SIPS, so i've been advised that it might make more sense to do the big social media push in august rather than in june. given that information, what i might do is do some soft promotion of the dashboard (through a talk) during SIPS and delay the big push at dissemination until around august. does that plan sound reasonable to you as well?

Yes, sounds reasonable.

did i mention already that i am SO STOKED about this dashboard??? lol. truly, it already surpasses my wildest imaginings for what this project could have become.

Thank you for partnering with me to make this dream come true 💙🌈

some time after initial dissemination, i'd like to send the dashboard to a select few journal editors to see how they react to it. journal editors were my original targets for change. i'm open to your feedback about how comfortable you are with this, but i think we have a product we can show to at least a few of these

That's OK, I think we have an Alpha Version 1.0 that we can stand behind now that we rely on OpenAlex. Of course, there is still MUCH work to be done, e.g., showing numbers by capita, which I'm sure we will hear again and again. There are also still many comments from the original round of review which I haven't finished addressing yet...

rempsyc commented 1 month ago

Closing this issue now that SIPS 2024 is over