DASISH / dwan-client-wiredmarker

DASISH Task 5.6. Client for annotation backend (Firefox)
2 stars 4 forks source link

A cached representation is incorrect #10

Open 5a2 opened 10 years ago

5a2 commented 10 years ago

7/4/2014 - I went on the web page www.keeleveeb.ee and chose ‘Õigekeelsussõnaraamat 2013’. I did a query about the word ‘lõpp-punkt’ and annotated the word ‘lõpp-punkt’ on the query result page. keeleveeb_lpp-punkt

The query result with the annotation is saved on the server. In a while I tried to check out cached representation (Cached representation -> Open remote cache), and it is incorrect. keeleveeb_lpp-punkt_cached representation

I have noticed the same problem on the web page www.keeleveeb.ee before as well (for example 7/3/2014 on www.keeleveeb.ee -> Morfoanalüsaator (Filosoft)). • DASISH Web Annotator 2.0 • Windows 7 • Firefox 30.0

nostneji commented 10 years ago

I tried to investigate this problem, but I do not know if the following information is relevant or not.

1) I made in www.keeleveeb.ee a new annotation (URI=https://lux17.mpi.nl/ds/webannotator/api/annotations/564a3a26-46c2-45a8-8046-cb3f58258fc8); 2) Target's ref is https://lux17.mpi.nl/ds/webannotator/api/targets/80526bf4-dbb4-4bd4-986b-ce2c7b1ce21c and link element is http://www.keeleveeb.ee/#xpointer(start-point(string-range(/html[1]/body[1]/table[1]/tbody[1]/tr[1]/td[1]/text()[1],'',0))/range-to(string-range(/html[1]/body[1]/table[1]/tbody[1]/tr[1]/td[1]/text()[1],'',4)))
3) When I asked to view cached representation the files (look https://github.com/DASISH/dwan-testing/blob/master/files/dwan-client-wiredmarker-issue-10/DWAN_Local_Cache_20140710113435.ZIP) from local cache were used.

Now I noticed that there is difference between the target's link #xpointer part (/html[1]/body[1]/table[1]/tbody[1]/tr[1]/td[1]/text()) and web page structure, which is something like (/html[1]/body[1]/div[1]/iframe[2]/html[1]/body[1]/table[1]/tbody[1]/tr[1]/td[1]/text()).

Like I said, I don't know if this is helping or not.

borsna commented 10 years ago

Hum, seams like keeleveeb.ee uses frames and only the parent frame is stored in the cached representation. For the remote cache there is only one document at the moment so this could be hard to solve for the current way of posting/retrieving the cache.

A planed feature was to send the whole site; all html files in frames, css, js and images. I added this as a feature to implement #15

5a2 commented 10 years ago

A problem with cached representation occurs on the web page www.vallaste.ee as well. The cached representation (Cached representation -> Open remote cache) looks like: it-terminite sonastik_cached representation