iipc / openwayback

The OpenWayback Development
http://www.netpreserve.org/openwayback
Apache License 2.0
483 stars 274 forks source link

Problem with openwayback #396

Open jafamo opened 5 years ago

jafamo commented 5 years ago

Hello, I am using

I have a problem to show a website. I attached an image: When I try to search for this domain, I do not get anything, This is the URL imagen

This is the query result: imagen

but if I put the domain with a directory: http://girona.cat/testing I see these two links.

openwayback

Here my output from catalina.out Mar 22, 2019 1:41:10 PM org.archive.wayback.webapp.AccessPoint logNotInArchive INFO: NotInArchive standardaccesspoint http://girona.cat

I configured my wayback.xml and my CDXCollection.xml but I don't know what happens. I open the warc and I think it's OK I checked CDX file and checked permissions in my WARC dir.

What can I do ?

Thank you, Jafamo.

MohammedElsayyed commented 5 years ago

It sounds like OWB index is working properly. Can you please try to write

http://girona.cat/

in the search text field and click "Take Me Back" button?

Please give it a try and let us know how it turns out.

jafamo commented 5 years ago

Hi @MohammedElsayyed, When I search http://girona.cat/ and clicked button I have this response: FRONTEND

Resource Not In Archive 
The Resource you requested is not in this archive.

LOGS in Catalia.log

Mar 25, 2019 8:29:06 AM org.archive.wayback.webapp.AccessPoint logNotInArchive
INFO: NotInArchive  standardaccesspoint http://girona.cat/
MohammedElsayyed commented 5 years ago

Can we move discussing this issue on openwayback-dev group

https://groups.google.com/forum/#!forum/openwayback-dev

?

If it is a configuration-related issue, then it is better to carry on handling it there. Otherwise, someone is going to fix.

Just create an issue and upload your wayback.xml and CDXCollection.xml.

jafamo commented 5 years ago

Ok, thank you, I am waiting response to start a new issue.

regards, Jafamo

MohammedElsayyed commented 5 years ago

It is OK to use https://pastebin.com/ to share wayback.xml and CDXCollection.xml here for troubleshooting.

anjackson commented 5 years ago

I note these links appear to redirect to http://www2.girona.cat/ca (i.e. with a www2. instead of a www.) -- is it possible that didn't fall into the scope of the crawl?