Open bappun opened 2 years ago
@bappun I just saw you announce https://cdsp-scpo.github.io/dataverse-feed/build/ at https://dataversecommunity.slack.com/archives/C5V66TV6Y/p1638987754055400
I expected to see 127 results when checking the box for "Elections" under "Topic Classification Term" but I got 0 items (screenshots below). Is this because of this issue you're reporting?
@pdurbin Yes! I noticed the issue while working on this prototype. The online demo is using Dataverse v4.20 but I also tried with a pre-production instance using v5.8 and noticed the same problem on my implementation.
I would like to reproduce the same behavior as in the Dataverse UI where selecting the Elections
facet adds the Politics.Elections
filter:
FWIW: My guess is that it is just a bug that the filter shown above the results uses the base term. For the external vocab mechanism, both the facets and the filter are translated in the UI - I think as requested in review.
I'll also note that the issue here makes it hard to do a simple search for CVV as well, i.e., if the translation of Politics.Elections was 'Voting' (anything that didn't have the words 'politics' or 'elections' in it) simple search for the term visible on the page wouldn't get any results either. So, it isn't just an API issue.
After some discussion, it sounds like indexing the CVV values for all configured languages could be a reasonable way to solve this. (I think this can be done so the facets aren't affected but filtering for the base term or any translation would get a hit.) Unless there are concerns/somebody can see a problem with this approach, I'll look into it on Sciences PO's behalf.
it sounds like indexing the CVV values for all configured languages could be a reasonable way to solve this
Sure, I think that approach is worth exploring, at least.
What steps does it take to reproduce the issue? In a TSV file I have a controlled vocabulary with the value
Politics.Elections
forTopic Classification Term
. This value is then translated in two languages using the java properties:Elections
Élections
When I query the search API with facets enabled (
show_facet: true
), the label for this field is taken from the english translation. I get this:This becomes an issue when I try to search Dataverse using this facet. When I search
topicClassValue_ss:"Elections"
I get no results because the needed value for the search is the one not translated:topicClassValue_ss:"Politics.Elections"
. However, there is no way to get the needed value from the API.When does this issue occur? When querying the Search API with a translated facet where the english translation is different from the value in the TSV file.
Which page(s) does it occurs on? In the search API.
What happens? The query returns 0 elements.
To whom does it occur (all users, curators, superusers)? Tested only on public API (with no token), but it should occur for any user.
What did you expect to happen? Be able to use the translated value for facets or have the correct name intended to be used in the API as a new field.
Which version of Dataverse are you using? 5.8
Any related open or closed issues to this bug report?
8286