ecologylab / BigSemanticsJavaScript

Ecologylab project ecologylabSemanticsJavaScript from svn import.
2 stars 1 forks source link

MICE is no longer Displaying Images and Article Bodies Collection #17

Closed andru1d closed 10 years ago

andru1d commented 10 years ago

test case:

http://ecologylab.net/research/bigsemantics/MICE/index.html?url=http://well.blogs.nytimes.com/2014/07/30/running-just-5-minutes-a-day-has-long-lasting-benefits

adunofaiur commented 10 years ago

@quyin "http://localhost:8080/BigSemanticsService/mmd.jsonp?callback=MetadataLoader.setMetaMetadata&url=http%3A%2F%2Fwww.nytimes.com%2Fglogin%3FURI%3Dhttp%3A%2F%2Fwell.blogs.nytimes.com%2F2014%2F07%2F30%2Frunning-just-5-minutes-a-day-has-long-lasting-benefits%2F%26OQ%3D_phpQ3DtrueQ26_typeQ3DblogsQ26_rQ3D0%26OP%3D5b9972a6Q2FQ20Q7D10Q20gu0Q20777Q20M11yQ3AwQ7Dboj1T1&withurl"

This seems to be a problem with the service. The service correctly returns metadata for the page, but the service loads a different pages for the meta-metadata. Mice, correctly, does not display any metadata it can't find mmd for.

quyin commented 10 years ago

here is an example that still fails:

http://ecologylab.net/research/bigsemantics/MICE/index.html?url=http://www.ikea.com/us/en/catalog/products/70245792/

adunofaiur commented 10 years ago

I've got good news and bad news.

Good news is that the image composite in the ikea wrapper is now fixed (it didn't have a type).

Bad news is that the main_images problem for ikea appears to be a service issue. The xpath works fine (even with javascript disabled), and the raw_metadata from the service does not contain any location fields for MICE to download the image from.

andru1d commented 10 years ago

i see a field labeled image in the metadata:

"image": { "mm_name": "image", "location": " http://www.ikea.com/us/en/images/products/morvik-wardrobe__0180624_PE332691_S4.JPG ", "site_name": "IKEA US/EN", "download_status": "UNPROCESSED", "description": "IKEA - MORVIK, Wardrobe, black-brown/mirror glass , Sliding doors allow more room for furniture because they donᅢᄁ€ï¿¢ï¾„ï¾¢t take any space to open.A mirrored door saves space, no need to take up room on the wall or floor with a separate mirror.Adjustable shelves make it easy to customize the space according to your needs.If you want to organize inside, you can complement with organizing accessories from the SKUBB and GARNITYR series." }

{ "ikea_product": { "mm_name": "ikea_product", "location": " http://www.ikea.com/us/en/catalog/products/70245792/", "site_name": "IKEA US/EN", "download_status": "DOWNLOAD_DONE", "price": "$259.00", "title": "MORVIK \n \n \n Wardrobe, black-brown, mirror glass", "description": "Sliding doors allow more room for furniture because they don’t take any space to open. \n \n Read more \n \n \n Read more", "service_log_record": { "document_url": "http://www.ikea.com/us/en/catalog/products/70245792/", "in_mem_document_cache_hit": "true", "requester_ip": "128.194.128.19", "request_url": "http://www.ikea.com/us/en/catalog/products/70245792/", "begin_time": "Tue Aug 05 12:28:15 CDT 2014", "ms_total": 120 }, "main_images": [ { "mm_name": "image", "download_status": "UNPROCESSED" } ], "image": { "mm_name": "image", "location": " http://www.ikea.com/us/en/images/products/morvik-wardrobe__0180624_PE332691_S4.JPG ", "site_name": "IKEA US/EN", "download_status": "UNPROCESSED", "description": "IKEA - MORVIK, Wardrobe, black-brown/mirror glass , Sliding doors allow more room for furniture because they donᅢᄁ€ï¿¢ï¾„ï¾¢t take any space to open.A mirrored door saves space, no need to take up room on the wall or floor with a separate mirror.Adjustable shelves make it easy to customize the space according to your needs.If you want to organize inside, you can complement with organizing accessories from the SKUBB and GARNITYR series." } }

On Tue, Aug 5, 2014 at 12:20 PM, adunofaiur notifications@github.com wrote:

I've got good news and bad news.

Good news is that the image composite in the ikea wrapper is now fixed (it didn't have a type).

Bad news is that the main_images problem for ikea appears to be a service issue. The xpath works fine (even with javascript disabled), and the raw_metadata from the service does not contain any location fields for MICE to download the image from.

— Reply to this email directly or view it on GitHub https://github.com/ecologylab/BigSemanticsJavaScript/issues/17#issuecomment-51229833 .

andruid kerne, ph.d. director, interface ecology lab associate professor, department of computer science and engineering texas a&m university 979.862.3684 fax college station, tx 77843-3112 http://ecologylab.net

http://facebook.com/ecologylab

Interfaces are the multidimensional border zones through which the interdependent relationships of people, activities, codes, components, and systems are constituted. Interface ecology investigates the dynamic interactions of media, cultures, and disciplines that flow through interfaces.

adunofaiur commented 10 years ago

That is correct. IKEA has its own image composite in addition to the main_images collection. I just now pushed a fix to the wrapper that will allow the image composite to render.

For the main_images collection, however, it seems to only have a mm_name and DOWNLOAD_STATUS field, which isn't enough for MICE to display an image. The main_image xpath (found in primitives.xml) works if evaluated in a browser, which leads me to believe that the issue does not lie in the Wrapper or in mice.

andru1d commented 10 years ago

i think i see. are you saying that there is an inheritance problem, that the main_image xpath from primitives.xml would work, but is not being applied correctly?

if so, what is the meta-metadata we are seeing for main_images? how does it compare to what is specified?

andruid

On Tue, Aug 5, 2014 at 2:19 PM, adunofaiur notifications@github.com wrote:

That is correct. IKEA has its own image composite in addition to the main_images collection. I just now pushed a fix to the wrapper that will allow the image composite to render.

For the main_images collection, however, it seems to only have a mm_name and DOWNLOAD_STATUS field, which isn't enough for MICE to display an image. The main_image xpath (found in primitives.xml) works if evaluated in a browser, which leads me to believe that the issue does not lie in the Wrapper or in mice.

— Reply to this email directly or view it on GitHub https://github.com/ecologylab/BigSemanticsJavaScript/issues/17#issuecomment-51245850 .

andruid kerne, ph.d. director, interface ecology lab associate professor, department of computer science and engineering texas a&m university 979.862.3684 fax college station, tx 77843-3112 http://ecologylab.net

http://facebook.com/ecologylab

Interfaces are the multidimensional border zones through which the interdependent relationships of people, activities, codes, components, and systems are constituted. Interface ecology investigates the dynamic interactions of media, cultures, and disciplines that flow through interfaces.

adunofaiur commented 10 years ago

The mmd seems correct (it's a collection that hold composites which can hold scalar links). I talked to Yin, and he says he'll look at the service soon.

quyin commented 10 years ago

it seems that xpaths are not getting inherited properly.

this is also related to the problem of displaying "type" instead of "title", which is also an inheritance problem.

I'll work on it.

andru1d commented 10 years ago

that's great that we have use cases to help identify the problem.

On Tue, Aug 5, 2014 at 5:21 PM, Yin Qu (屈垠) notifications@github.com wrote:

it seems that xpaths are not getting inherited properly.

this is also related to the problem of displaying "type" instead of "title", which is also an inheritance problem.

I'll work on it.

— Reply to this email directly or view it on GitHub https://github.com/ecologylab/BigSemanticsJavaScript/issues/17#issuecomment-51268743 .

andruid kerne, ph.d. director, interface ecology lab associate professor, department of computer science and engineering texas a&m university 979.862.3684 fax college station, tx 77843-3112 http://ecologylab.net

http://facebook.com/ecologylab

Interfaces are the multidimensional border zones through which the interdependent relationships of people, activities, codes, components, and systems are constituted. Interface ecology investigates the dynamic interactions of media, cultures, and disciplines that flow through interfaces.

andru1d commented 10 years ago

As a small note, @adunofaiur, I was a bit confused when you called it a "service" issue, since the problem is in the BigSemanticsCore project, specifically in the inheritance code of the meta-metadata type system. The problem happens at initialization time, before any requests are made to the service.

This, i realize, is a detail. If you didn't have any sense of these details, that's fine. As a teacher, part of my job is to encourage you to think about and articulate the structure.

cheers,

@andru1d

On Tue, Aug 5, 2014 at 12:20 PM, adunofaiur notifications@github.com wrote:

I've got good news and bad news.

Good news is that the image composite in the ikea wrapper is now fixed (it didn't have a type).

Bad news is that the main_images problem for ikea appears to be a service issue. The xpath works fine (even with javascript disabled), and the raw_metadata from the service does not contain any location fields for MICE to download the image from.

— Reply to this email directly or view it on GitHub https://github.com/ecologylab/BigSemanticsJavaScript/issues/17#issuecomment-51229833 .

andruid kerne, ph.d. director, interface ecology lab associate professor, department of computer science and engineering texas a&m university 979.862.3684 fax college station, tx 77843-3112 http://ecologylab.net

http://facebook.com/ecologylab

Interfaces are the multidimensional border zones through which the interdependent relationships of people, activities, codes, components, and systems are constituted. Interface ecology investigates the dynamic interactions of media, cultures, and disciplines that flow through interfaces.