googlegsa / manager.v3

Google Search Appliance Connector Manager
Apache License 2.0
10 stars 10 forks source link

AuthorizeDocIds recieve incomplete docid when the docid contains the char & #214

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Send non-public content feed with a docid contains the char '&'
2. crawl the content
3. search for the content in the doc, authorizeDocId recieves the docid 
which contains only the characters before '&' 

What is the expected output? What do you see instead?
Complete docid in authorizeDocIds instead got only partial id which can 
not be used for authorizing the document

What version of the product are you using? On what operating system?
GSA 6.2 , Connector Manager 2.4.2

Please provide any additional information below.
Crawl diagnostics shows the proper docid, but when it received in 
authorizDocIds methods it looses the characters after '&' due to some 
parsing constraints.

Original issue reported on code.google.com by dharani....@valero.com on 24 Feb 2010 at 8:56

GoogleCodeExporter commented 9 years ago
Verified this as a problem. The docid is not URL-encoded when we put it into 
the googleconnector URL, so we 
mis-parse it when we extract the docid from the googleconnector URL.

Original comment by jla...@google.com on 1 Mar 2010 at 11:31

GoogleCodeExporter commented 9 years ago

Original comment by jla...@google.com on 25 Jun 2011 at 1:45

GoogleCodeExporter commented 9 years ago
This issue is filed as Google issue #6514016

Original comment by tdnguyen@google.com on 18 May 2012 at 9:38