ukwa / ukwa-ui

A new user interface for the UK Web Archive
BSD 3-Clause "New" or "Revised" License
0 stars 6 forks source link

Open Access content not viewable #240

Open crarugal opened 5 years ago

crarugal commented 5 years ago

Unable to view what should be an Open Access site https://www.webarchive.org.uk/en/ukwa/collection/990

Screenshot 2019-06-12 11 34 05

Clicking the link to 3i using public interface on external pc: Screenshot 2019-06-12 11 34 12

https://www.webarchive.org.uk/act/targets/75812#licensing Screenshot 2019-06-12 11 34 29

anjackson commented 5 years ago

Hmm, weird. This should get carried through correctly. I'm looking at related issues at the moment, so will use this as a test case.

anjackson commented 5 years ago

Having poked around in the data, this seems to be an issue because the homepage appears under two Targets in W3ACT: https://www.webarchive.org.uk/act/targets/lookup?f=http%3A%2F%2Fwww.3i.com

https://www.webarchive.org.uk/act/targets/75812 https://www.webarchive.org.uk/act/targets/45145

anjackson commented 4 years ago

To be clear, this is a Target curation issue, as the whole system expects one Target per URL. Short term, removing the non-OA Target is the only solution.

crarugal commented 4 years ago

Seems like the duplication of targets with the same URL is due to the WCT migration

nicolabingham commented 4 years ago

I have deleted Target 45145 as this one did not have a permission for access. Will need to check whether the website becomes available in open access.

crarugal commented 3 years ago

Not sure if this is a quirk that I've forgotten, or a bug. It seems that Open Access hasn't come through(granted a few weeks ago), and there's no duplicate seed clashing in another target

image https://www.webarchive.org.uk/en/ukwa/search?text=www.timescapes.leeds.ac.uk&search_location=full_text&reset_filters=false&content_type=Web+Page

image https://www.webarchive.org.uk/en/ukwa/search?view_filter=va&content_type=Web+Page&from_date=&to_date=&modal_filter_domains_vals=&modal_filter_suffix_vals=&modal_filter_documenttypes_vals=&modal_filter_collections_vals=&filter_source=1&filter_array_x=&filter_array_x_item=&search_location=full_text&text=http%3A%2F%2Fwww.timescapes.leeds.ac.uk&view_sort=relevant&view_count=50

. . image

. .

http://www.timescapes.leeds.ac.uk/ https://www.webarchive.org.uk/act/targets/119544 image

crarugal commented 3 years ago

This is another issue where the results show that it's Open Access, but clicking through shows it not to be the case. It should be restricted access as the target has no licence

Domain: http://leics.police.uk/ https://www.webarchive.org.uk/en/ukwa/search?text=http%3A%2F%2Fleics.police.uk%2F&search_location=full_text&reset_filters=false&content_type=Web+Page image https://www.webarchive.org.uk/en/ukwa/wayback/OA/20130418183028/http://leics.police.uk/

https://www.webarchive.org.uk/act/targets/4684#crawlpolicy image

anjackson commented 3 years ago

The OA information in the Solr index is very out of date, or this could be a bug in that code. (maybe mistakely inferring OA from the record that covers uk,police,leics-pa).

I'm afraid we're struggling with getting around to updated the indexes.

HelenaByrne commented 1 year ago

@anjackson Just adding a recent example of the open access permission not automatically going through.

I created this record in the summer and the open access was signed a few weeks after: https://www.webarchive.org.uk/act/targets/163998 image

On the website the reading room message is under this record: https://www.webarchive.org.uk/en/ukwa/collection/4279

image

We didn't have any open access issues with the main website.

HelenaByrne commented 1 year ago

Hi @anjackson during the summer this target was Open Access on the website. However, now on the reading room it is Reading Room only. https://www.webarchive.org.uk/act/targets/160720

image

image

anjackson commented 1 year ago

Hi @HelenaByrne can you tell me the URL that your second screenshot came from? I can't find it.

Note that there are two separate issues here:

HelenaByrne commented 1 year ago

Hi Andy,

This is where the target sits on the UKWA website: https://www.webarchive.org.uk/en/ukwa/collection/4281

However, when I just went to retrieve the URL the target is now open access.

I put screenshots of the same page from this morning and now below.

Thanks,

Helena.

@. @. From: Andy Jackson @.> Sent: 11 January 2023 15:25 To: ukwa/ukwa-ui @.> Cc: Byrne, Helena @.>; Mention @.> Subject: Re: [ukwa/ukwa-ui] Open Access content not viewable (#240)

Hi @HelenaByrnehttps://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FHelenaByrne&data=05%7C01%7C%7C910dd13cff04444db25708daf3e7fe9d%7C21a44cb7f9c34f009afabd1e8e88bcd9%7C0%7C0%7C638090474978746618%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=l%2FbO4MetnZbY35sllJmrbxwpGTsmv8SKvA%2FePeQBCHc%3D&reserved=0 can you tell me the URL that your second screenshot came from? I can't find it.

Note that there are two separate issues here:

- Reply to this email directly, view it on GitHubhttps://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fukwa%2Fukwa-ui%2Fissues%2F240%23issuecomment-1378941254&data=05%7C01%7C%7C910dd13cff04444db25708daf3e7fe9d%7C21a44cb7f9c34f009afabd1e8e88bcd9%7C0%7C0%7C638090474978746618%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=xFDNjZT3qS70fiIUmzKNDch%2BxQCdbtW9i1%2B2OmMPdtM%3D&reserved=0, or unsubscribehttps://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAKOKLW2SFIAY4DNN6XZKS7DWR3GENANCNFSM4HXHUB3A&data=05%7C01%7C%7C910dd13cff04444db25708daf3e7fe9d%7C21a44cb7f9c34f009afabd1e8e88bcd9%7C0%7C0%7C638090474978746618%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=rJdDRQpqDxJUEMyBLkDnznWE9jeB5lg1sb%2BNnLebVh4%3D&reserved=0. You are receiving this because you were mentioned.Message ID: @.**@.>>


Experience the British Library online at www.bl.ukhttp://www.bl.uk/ The British Library's latest Annual Report and Accounts : www.bl.uk/aboutus/annrep/index.htmlhttp://www.bl.uk/aboutus/annrep/index.html Help the British Library conserve the world's knowledge. Adopt a Book. www.bl.uk/adoptabookhttp://www.bl.uk/adoptabook The Library's St Pancras site is WiFi - enabled


The information contained in this e-mail is confidential and may be legally privileged. It is intended for the addressee(s) only. If you are not the intended recipient, please delete this e-mail and notify the @.**@.> : The contents of this e-mail must not be disclosed or copied without the sender's consent. The statements and opinions expressed in this message are those of the author and do not necessarily reflect those of the British Library. The British Library does not take any responsibility for the views of the author.


Think before you print