Open GoogleCodeExporter opened 9 years ago
Hi there,
I have finished re-indexing the enwiki-20090618-pages-articles.xml.bz2 for the
third
time now and it all seems to work properly. Could you please verify the MD5 sum
of
your dump file? Maybe it was damaged during download. Also, could you please
provide
the hardware specs of your PC? Maybe the bug is somehow related to the amount of
memory available on your machine.
Regards.
Original comment by halle...@gmail.com
on 12 Jul 2009 at 10:20
Hi,
Dump file was download properly.I verified using wikiTaxi.
My hardware specs ,AMD 1.67GHz, RAM : 480MB and HDD : 40GB.
Original comment by santhosh...@gmail.com
on 12 Jul 2009 at 4:15
Right... Maybe you could post here the full exception text? Just copy&paste the
text
from the exception window. That might give me a clue.
Vlad
Original comment by halle...@gmail.com
on 12 Jul 2009 at 9:06
Hi Vlad,
I downloaded the latest dump file successfully.I will try with this dump and
let you
know the results.
Original comment by santhosh...@gmail.com
on 13 Jul 2009 at 12:34
Details on the exception is as follows.
See the end of this message for details on invoking
just-in-time (JIT) debugging instead of this dialog box.
************** Exception Text **************
System.Exception: The index does not exist for D:\My Documents
Backup\Downloads\enwiki-latest-pages-articles.xml.bz2
at BzReader.Indexer.Search(String term, IEnumerable`1 indexers, Int32 maxResults)
at BzReader.BrowseForm.LaunchSearch(Boolean interactive)
at BzReader.BrowseForm.searchBox_PreviewKeyDown(Object sender,
PreviewKeyDownEventArgs e)
at System.Windows.Forms.Control.OnPreviewKeyDown(PreviewKeyDownEventArgs e)
at System.Windows.Forms.Control.PreProcessControlMessageInternal(Control target,
Message& msg)
at System.Windows.Forms.Application.ThreadContext.PreTranslateMessage(MSG& msg)
************** Loaded Assemblies **************
mscorlib
Assembly Version: 2.0.0.0
Win32 Version: 2.0.50727.42 (RTM.050727-4200)
CodeBase: file:///C:/WINNT/Microsoft.NET/Framework/v2.0.50727/mscorlib.dll
----------------------------------------
BzReader
Assembly Version: 1.0.11.0
Win32 Version: 1.0.11.0
CodeBase:
file:///C:/Documents%20and%20Settings/jois/Local%20Settings/Application%20Data/A
pps/BzReader/BzReader.exe
----------------------------------------
System.Windows.Forms
Assembly Version: 2.0.0.0
Win32 Version: 2.0.50727.42 (RTM.050727-4200)
CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/System.Windows.Forms/2.0.0.0__b77a5c561934e08
9/System.Windows.Forms.dll
----------------------------------------
System
Assembly Version: 2.0.0.0
Win32 Version: 2.0.50727.42 (RTM.050727-4200)
CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/System/2.0.0.0__b77a5c561934e089/System.dll
----------------------------------------
System.Drawing
Assembly Version: 2.0.0.0
Win32 Version: 2.0.50727.42 (RTM.050727-4200)
CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/System.Drawing/2.0.0.0__b03f5f7f11d50a3a/Syst
em.Drawing.dll
----------------------------------------
Accessibility
Assembly Version: 2.0.0.0
Win32 Version: 2.0.50727.42 (RTM.050727-4200)
CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/Accessibility/2.0.0.0__b03f5f7f11d50a3a/Acces
sibility.dll
----------------------------------------
System.Configuration
Assembly Version: 2.0.0.0
Win32 Version: 2.0.50727.42 (RTM.050727-4200)
CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/System.Configuration/2.0.0.0__b03f5f7f11d50a3
a/System.Configuration.dll
----------------------------------------
System.Xml
Assembly Version: 2.0.0.0
Win32 Version: 2.0.50727.42 (RTM.050727-4200)
CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/System.Xml/2.0.0.0__b77a5c561934e089/System.X
ml.dll
----------------------------------------
vpyx1ao1
Assembly Version: 2.0.0.0
Win32 Version: 2.0.50727.42 (RTM.050727-4200)
CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/System/2.0.0.0__b77a5c561934e089/System.dll
----------------------------------------
Lucene.Net
Assembly Version: 1.9.0.5
Win32 Version: 1.9.0.5
CodeBase:
file:///C:/Documents%20and%20Settings/jois/Local%20Settings/Application%20Data/A
pps/BzReader/Lucene.Net.DLL
----------------------------------------
Snowball.Net
Assembly Version: 1.4.3.1
Win32 Version: 1.4.3.1
CodeBase:
file:///C:/Documents%20and%20Settings/jois/Local%20Settings/Application%20Data/A
pps/BzReader/Snowball.Net.DLL
----------------------------------------
************** JIT Debugging **************
To enable just-in-time (JIT) debugging, the .config file for this
application or computer (machine.config) must have the
jitDebugging value set in the system.windows.forms section.
The application must also be compiled with debugging
enabled.
For example:
<configuration>
<system.windows.forms jitDebugging="true" />
</configuration>
When JIT debugging is enabled, any unhandled exception
will be sent to the JIT debugger registered on the computer
rather than be handled by this dialog box.
Original comment by santhosh...@gmail.com
on 13 Jul 2009 at 1:06
Attachments:
Does that happen while indexing the dump?
Original comment by halle...@gmail.com
on 13 Jul 2009 at 1:39
After indexing(Indexing fails),when we search for a string we will get this
exception.
Original comment by santhosh...@gmail.com
on 13 Jul 2009 at 1:47
And could please copy&paste the error message during the indexing? I can only
see the
end of the message on the screenshot you have provided :-/
Original comment by halle...@gmail.com
on 13 Jul 2009 at 1:56
Hi Vlad,
It is difficult paste that information because it's a run time output and
vanishes
suddenly during indexing the dump.I was lucky when hitting PrtSc.
Original comment by santhosh...@gmail.com
on 13 Jul 2009 at 2:01
Ah, OK. I will prepare a debug version of BzReader that will keep the indexing
window
open until you specifically close it. I will post an update once that's done.
Thanks for your help :-)
Original comment by halle...@gmail.com
on 13 Jul 2009 at 2:09
Hi again,
I've prepared a debug build of BzReader which should leave the progress window
open
on failure. Please rename the file to .exe and replace the original
BzReader.exe in
your BzReader install location.
Vlad
Original comment by halle...@gmail.com
on 13 Jul 2009 at 2:56
Attachments:
Hi Vlad,
I have started indexing the new dump.Following message I got while indexing.
Locating the blocks
Indexing
System.OutOfMemoryException: Exception of type 'System.OutOfMemoryException'
was thrown.
at Lucene.Net.Store.RAMOutputStream.FlushBuffer(Byte[] src, Int32 len)
at Lucene.Net.Store.BufferedIndexOutput.Flush()
at Lucene.Net.Store.RAMOutputStream.Seek(Int64 pos)
at Lucene.Net.Index.TermInfosWriter.Close()
at Lucene.Net.Index.TermInfosWriter.Close()
at Lucene.Net.Index.DocumentWriter.WritePostings(Posting[] postings, String segment)
at Lucene.Net.Index.DocumentWriter.AddDocument(String segment, Document doc)
at Lucene.Net.Index.IndexWriter.AddDocument(Document doc, Analyzer analyzer)
at Lucene.Net.Index.IndexWriter.AddDocument(Document doc)
at BzReader.Indexer.TokenizeAndAdd(Object state)
at BzReader.Indexer.IndexString(String currentText, Int64 beginning, Int64 end,
Int32 charCarryOver, Boolean lastBlock)
at BzReader.Indexer.CreateIndexAsync()
Original comment by santhosh...@gmail.com
on 13 Jul 2009 at 4:32
OK, so this really is a memory problem. I have prepared another debug build of
BzReader with a lower memory requirement. I would be grateful if you could test
the
indexing with this build.
Also, please advise on the time it takes to index the dump - I'm just curious
:-)
Vlad
Original comment by halle...@gmail.com
on 13 Jul 2009 at 4:45
Attachments:
Hi Vlad,
I have started the indexing now.Well,hoping that it will work.
Original comment by santhosh...@gmail.com
on 13 Jul 2009 at 5:07
Hi Vlad,
Still getting the same error.
Original comment by santhosh...@gmail.com
on 13 Jul 2009 at 5:38
Tried once again.Getting the following message during indexing.
Locating the blocks
Indexing
System.SystemException: System.Reflection.TargetInvocationException: Exception
has
been thrown by the target of an invocation. ---> System.OutOfMemoryException:
Exception of type 'System.OutOfMemoryException' was thrown.
at SF.Snowball.Ext.EnglishStemmer.InitBlock()
at SF.Snowball.Ext.EnglishStemmer..ctor()
at System.RuntimeType.CreateInstanceImpl(Boolean publicOnly, Boolean
skipVisibilityChecks, Boolean fillCache)
--- End of inner exception stack trace ---
at System.RuntimeType.CreateInstanceImpl(Boolean publicOnly, Boolean
skipVisibilityChecks, Boolean fillCache)
at System.Activator.CreateInstance(Type type, Boolean nonPublic)
at Lucene.Net.Analysis.Snowball.SnowballFilter..ctor(TokenStream in_Renamed,
String name)
at Lucene.Net.Analysis.Snowball.SnowballFilter..ctor(TokenStream in_Renamed,
String name)
at Lucene.Net.Analysis.Snowball.SnowballAnalyzer.TokenStream(String fieldName,
TextReader reader)
at Lucene.Net.Index.DocumentWriter.InvertDocument(Document doc)
at Lucene.Net.Index.DocumentWriter.AddDocument(String segment, Document doc)
at Lucene.Net.Index.IndexWriter.AddDocument(Document doc, Analyzer analyzer)
at Lucene.Net.Index.IndexWriter.AddDocument(Document doc)
at BzReader.Indexer.TokenizeAndAdd(Object state)
at BzReader.Indexer.IndexString(String currentText, Int64 beginning, Int64 end,
Int32 charCarryOver, Boolean lastBlock)
at BzReader.Indexer.CreateIndexAsync()
Original comment by santhosh...@gmail.com
on 13 Jul 2009 at 6:13
All right, I think that's enough of testing on the live system, I will continue
on
the VmWare image. Thanks for reporting.
Original comment by halle...@gmail.com
on 14 Jul 2009 at 10:46
[deleted comment]
I'm getting the same error.
Original comment by MaciejOs...@gmail.com
on 18 Jan 2012 at 2:30
Original issue reported on code.google.com by
santhosh...@gmail.com
on 8 Jul 2009 at 5:23Attachments: