PILLUTLAAVINASH / google-enterprise-connector-manager

Automatically exported from code.google.com/p/google-enterprise-connector-manager
0 stars 0 forks source link

site: operator doesn't work with connector feeds #58

Closed GoogleCodeExporter closed 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
1. Index content with a connector.
2. Search for results.
3. Search for results using a site: operator and the index content's display 
URL.

What is the expected output? What do you see instead?

I expected the same output for steps 2 and 3. But in step 3 I get no results. 
This also affects the 
"More results from <site>" links, which consistently return no results.

What version of the product are you using? On what operating system?

GSA 5.0.0.G.14, Connector Manager 1.0.2.

Please provide any additional information below.

 I have content indexed with a display URL that looks like

    http://swift.example.com:9510/Livelink95/livelink?...

If I search for

    steeleye

I get results. If I search for

    steeleye site:swift.example.com
or
    steeleye site:swift.example.com:9510
or
    steeleye site:swift.example.com:9510/Livelink95
or
    steeleye site:swift.example.com:9510/Livelink95/livelink

I get no results.

Since the internal URL for these documents is

    googleconnector://dostoevsky.localhost/doc?docid=...

I tried

    steeleye site:dostoevsky
and
    steeleye site:dostoevsky.localhost
and
    steeleye site:dostoevsky.localhost/doc
and
    steeleye site:googleconnector://dostoevsky.localhost

Given that inmeta searches require some characters to be URL-escaped, I tried 
URL-escaping the 
host and port:

    steeleye site:swift.example.com%3A9510

That doesn't work. I noticed, however, that a query like

    foo site:www.example.com:8080

works on the Google web site, so the URL escaping doesn't appear to be 
necessary.

Finally, I thought to look at the Crawl Diagnostics page, which I know is not 
very helpful for 
connector feeds, but it has a strange entry. It has one entry, for 
"internal_host". Clicking that link 
shows one crawled URL for "googleconnector:". Clicking that shows one document 
with a File/
Directory of "/'. Clicking that link is an error. But to undigress, I tried

    steeleye site:internal_host

and that returned no results, but 

    steeleye site:googleconnector:

worked. Adding one or two trailing slashes works, too. Adding any more of the 
internal URL (or 
extra slashes) fails, as we saw before.

Original issue reported on code.google.com by jl1615@gmail.com on 19 Oct 2007 at 2:32

GoogleCodeExporter commented 8 years ago
P2, or really P1.5. It's an existing feature that is broken.

Original comment by jl1615@gmail.com on 19 Oct 2007 at 5:34

GoogleCodeExporter commented 8 years ago

Original comment by donald.z...@gmail.com on 18 Apr 2008 at 10:23

GoogleCodeExporter commented 8 years ago
GSA bug 1092511

Original comment by mobe...@gmail.com on 18 Apr 2008 at 10:25

GoogleCodeExporter commented 8 years ago
Fixed in GSA 5.2.0.G.6

Original comment by mgron...@gmail.com on 18 Nov 2008 at 12:47

GoogleCodeExporter commented 8 years ago

Original comment by jl1615@gmail.com on 12 Jan 2009 at 3:28