PILLUTLAAVINASH / google-enterprise-connector-manager

Automatically exported from code.google.com/p/google-enterprise-connector-manager
0 stars 0 forks source link

filetype: operator doesn't work with connector feeds #113

Closed GoogleCodeExporter closed 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
1. Index content with a connector.
2. Search for results.
3. Search for results adding the filetype: operator.

What is the expected output? What do you see instead?
I expected the same output for steps 2 and 3. But in step 3 I get no results.

What version of the product are you using? On what operating system?
GSA 5.0.0.G.22, Connector Manager 1.1.0.

Please provide any additional information below.

We have developed a connector using Google Connector Manager that uses
content feeds and everything is working properly (traversal and searches).
But we have detected a problem with filetype filtering in searches for
documents fed by a connector.

I include a feed sample of a ppt file:

<?xml version='1.0' encoding='UTF-8'?>
<!DOCTYPE gsafeed PUBLIC "-//Google//DTD GSA Feeds//EN"
"gsafeed.dtd"><gsafeed><header><datasource>ConectorPI</datasource>
<feedtype>incremental</feedtype>
</header>
<group>
<record url="googleconnector://ConectorPI.localhost/doc?docid=1de3445ds"
displayurl="http://host/documentos/presen2_no_public.ppt"
mimetype="application/vnd.ms-powerpoint" last-modified="mié, 12 nov 2008
09:23:13 GMT" authmethod="httpbasic">
<metadata>
<meta name="google:lastmodified" content="2008-11-12"/>
<meta name="google:mimetype" content="application/vnd.ms-powerpoint"/>
<meta name="google:displayurl"
content="http://host/documentos/presen2_no_public.ppt"/>
<meta name="TIPO_DOCUMENTO" content="ppt"/>
<meta name="google:ispublic" content="FALSE"/>

</metadata>
<content encoding="base64binary">PCFET0NUWVBFIGh0bWwgU
....
........
C9odG1sPg0K</content>
</record>
</group>
</gsafeed>

I send you some queries we made, and the corresponding results:

=> Query "StarOffice Tutorial" gets one result (the document I included above)
=> Query "StarOffice Tutorial inmeta:tipo_documento=ppt" gets the same
document as result.
=> Query "StarOffice Tutorial filetype:ppt" gets no results.

If we change the record url of the content feed, deleting the
"googleconnector://" prefix, filetype filtering works properly.

We were able to repeat the same tests with documents fed by supported
Documentum connector, and we got the same results: no documents when using
filetype filtering.

Original issue reported on code.google.com by scasalde...@gmail.com on 11 Dec 2008 at 12:02

GoogleCodeExporter commented 8 years ago

Original comment by mgron...@gmail.com on 28 Jan 2009 at 10:56

GoogleCodeExporter commented 8 years ago
This issue is a duplicate of Google bug #1540867.

Original comment by jl1615@gmail.com on 3 Feb 2009 at 10:15

GoogleCodeExporter commented 8 years ago
Is there any indication if Google bug #1540867 will require a fix on the GSA 
itself
or on the Connector Manager/Connector?

Original comment by Brad.Thu...@gmail.com on 29 Apr 2009 at 1:52

GoogleCodeExporter commented 8 years ago
This bug is still an issue in 5.2.0.G.32

Original comment by Brad.Thu...@gmail.com on 8 May 2009 at 4:05

GoogleCodeExporter commented 8 years ago
It's still an issue in GSA 6.0, also. There are multiple ideas floating around,
including issue 136, but in any case I suspect the fix will involve changes to 
both
the connector manager and the GSA.

Original comment by jl1615@gmail.com on 22 Oct 2009 at 4:42