AnantLabs / bzreader

Automatically exported from code.google.com/p/bzreader
0 stars 0 forks source link

Unhandled Exception #14

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Open enwiki-latest-pages-articles.xml
2. Search for an article
3. Exception will state that the index is not available
What is the expected output? What do you see instead?
I tried simplewiki-latest-pages-articles.xml to verify whether there is any
issue with my settings and configuration.Everything was perfectly
indexed.But with enwiki-latest-pages-articles.xml indexing is failing.

What version of the product are you using? On what operating system?
BzReader.v1.0.11.Windows XP

Please provide any additional information below.
Indexing is failing for current wikipedia dump
enwiki-latest-pages-articles.xml(downloaded on 1st July,2009)

Please find the attachment containing exception description.

Original issue reported on code.google.com by santhosh...@gmail.com on 8 Jul 2009 at 5:23

Attachments:

GoogleCodeExporter commented 9 years ago
Hi there,

I have finished re-indexing the enwiki-20090618-pages-articles.xml.bz2 for the 
third
time now and it all seems to work properly. Could you please verify the MD5 sum 
of
your dump file? Maybe it was damaged during download. Also, could you please 
provide
the hardware specs of your PC? Maybe the bug is somehow related to the amount of
memory available on your machine.

Regards.

Original comment by halle...@gmail.com on 12 Jul 2009 at 10:20

GoogleCodeExporter commented 9 years ago
Hi,

Dump file was download properly.I verified using wikiTaxi.
My hardware specs ,AMD 1.67GHz, RAM : 480MB and HDD : 40GB.

Original comment by santhosh...@gmail.com on 12 Jul 2009 at 4:15

GoogleCodeExporter commented 9 years ago
Right... Maybe you could post here the full exception text? Just copy&paste the 
text
from the exception window. That might give me a clue.

Vlad

Original comment by halle...@gmail.com on 12 Jul 2009 at 9:06

GoogleCodeExporter commented 9 years ago
Hi Vlad,

I downloaded the latest dump file successfully.I will try with this dump and 
let you
know the results.

Original comment by santhosh...@gmail.com on 13 Jul 2009 at 12:34

GoogleCodeExporter commented 9 years ago
Details on the exception is as follows.

See the end of this message for details on invoking 
just-in-time (JIT) debugging instead of this dialog box.

************** Exception Text **************
System.Exception: The index does not exist for D:\My Documents
Backup\Downloads\enwiki-latest-pages-articles.xml.bz2
   at BzReader.Indexer.Search(String term, IEnumerable`1 indexers, Int32 maxResults)
   at BzReader.BrowseForm.LaunchSearch(Boolean interactive)
   at BzReader.BrowseForm.searchBox_PreviewKeyDown(Object sender,
PreviewKeyDownEventArgs e)
   at System.Windows.Forms.Control.OnPreviewKeyDown(PreviewKeyDownEventArgs e)
   at System.Windows.Forms.Control.PreProcessControlMessageInternal(Control target,
Message& msg)
   at System.Windows.Forms.Application.ThreadContext.PreTranslateMessage(MSG& msg)

************** Loaded Assemblies **************
mscorlib
    Assembly Version: 2.0.0.0
    Win32 Version: 2.0.50727.42 (RTM.050727-4200)
    CodeBase: file:///C:/WINNT/Microsoft.NET/Framework/v2.0.50727/mscorlib.dll
----------------------------------------
BzReader
    Assembly Version: 1.0.11.0
    Win32 Version: 1.0.11.0
    CodeBase:
file:///C:/Documents%20and%20Settings/jois/Local%20Settings/Application%20Data/A
pps/BzReader/BzReader.exe
----------------------------------------
System.Windows.Forms
    Assembly Version: 2.0.0.0
    Win32 Version: 2.0.50727.42 (RTM.050727-4200)
    CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/System.Windows.Forms/2.0.0.0__b77a5c561934e08
9/System.Windows.Forms.dll
----------------------------------------
System
    Assembly Version: 2.0.0.0
    Win32 Version: 2.0.50727.42 (RTM.050727-4200)
    CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/System/2.0.0.0__b77a5c561934e089/System.dll
----------------------------------------
System.Drawing
    Assembly Version: 2.0.0.0
    Win32 Version: 2.0.50727.42 (RTM.050727-4200)
    CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/System.Drawing/2.0.0.0__b03f5f7f11d50a3a/Syst
em.Drawing.dll
----------------------------------------
Accessibility
    Assembly Version: 2.0.0.0
    Win32 Version: 2.0.50727.42 (RTM.050727-4200)
    CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/Accessibility/2.0.0.0__b03f5f7f11d50a3a/Acces
sibility.dll
----------------------------------------
System.Configuration
    Assembly Version: 2.0.0.0
    Win32 Version: 2.0.50727.42 (RTM.050727-4200)
    CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/System.Configuration/2.0.0.0__b03f5f7f11d50a3
a/System.Configuration.dll
----------------------------------------
System.Xml
    Assembly Version: 2.0.0.0
    Win32 Version: 2.0.50727.42 (RTM.050727-4200)
    CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/System.Xml/2.0.0.0__b77a5c561934e089/System.X
ml.dll
----------------------------------------
vpyx1ao1
    Assembly Version: 2.0.0.0
    Win32 Version: 2.0.50727.42 (RTM.050727-4200)
    CodeBase:
file:///C:/WINNT/assembly/GAC_MSIL/System/2.0.0.0__b77a5c561934e089/System.dll
----------------------------------------
Lucene.Net
    Assembly Version: 1.9.0.5
    Win32 Version: 1.9.0.5
    CodeBase:
file:///C:/Documents%20and%20Settings/jois/Local%20Settings/Application%20Data/A
pps/BzReader/Lucene.Net.DLL
----------------------------------------
Snowball.Net
    Assembly Version: 1.4.3.1
    Win32 Version: 1.4.3.1
    CodeBase:
file:///C:/Documents%20and%20Settings/jois/Local%20Settings/Application%20Data/A
pps/BzReader/Snowball.Net.DLL
----------------------------------------

************** JIT Debugging **************
To enable just-in-time (JIT) debugging, the .config file for this
application or computer (machine.config) must have the
jitDebugging value set in the system.windows.forms section.
The application must also be compiled with debugging
enabled.

For example:

<configuration>
    <system.windows.forms jitDebugging="true" />
</configuration>

When JIT debugging is enabled, any unhandled exception
will be sent to the JIT debugger registered on the computer
rather than be handled by this dialog box.

Original comment by santhosh...@gmail.com on 13 Jul 2009 at 1:06

Attachments:

GoogleCodeExporter commented 9 years ago
Does that happen while indexing the dump?

Original comment by halle...@gmail.com on 13 Jul 2009 at 1:39

GoogleCodeExporter commented 9 years ago
After indexing(Indexing fails),when we search for a string we will get this 
exception.

Original comment by santhosh...@gmail.com on 13 Jul 2009 at 1:47

GoogleCodeExporter commented 9 years ago
And could please copy&paste the error message during the indexing? I can only 
see the
end of the message on the screenshot you have provided :-/

Original comment by halle...@gmail.com on 13 Jul 2009 at 1:56

GoogleCodeExporter commented 9 years ago
Hi Vlad,

It is difficult paste that information because it's a run time output and 
vanishes
suddenly during indexing the dump.I was lucky when hitting PrtSc.

Original comment by santhosh...@gmail.com on 13 Jul 2009 at 2:01

GoogleCodeExporter commented 9 years ago
Ah, OK. I will prepare a debug version of BzReader that will keep the indexing 
window
open until you specifically close it. I will post an update once that's done.

Thanks for your help :-)

Original comment by halle...@gmail.com on 13 Jul 2009 at 2:09

GoogleCodeExporter commented 9 years ago
Hi again,

I've prepared a debug build of BzReader which should leave the progress window 
open
on failure. Please rename the file to .exe and replace the original 
BzReader.exe in
your BzReader install location.

Vlad

Original comment by halle...@gmail.com on 13 Jul 2009 at 2:56

Attachments:

GoogleCodeExporter commented 9 years ago
Hi Vlad,

I have started indexing the new dump.Following message I got while indexing.

Locating the blocks
Indexing
System.OutOfMemoryException: Exception of type 'System.OutOfMemoryException' 
was thrown.
   at Lucene.Net.Store.RAMOutputStream.FlushBuffer(Byte[] src, Int32 len)
   at Lucene.Net.Store.BufferedIndexOutput.Flush()
   at Lucene.Net.Store.RAMOutputStream.Seek(Int64 pos)
   at Lucene.Net.Index.TermInfosWriter.Close()
   at Lucene.Net.Index.TermInfosWriter.Close()
   at Lucene.Net.Index.DocumentWriter.WritePostings(Posting[] postings, String segment)
   at Lucene.Net.Index.DocumentWriter.AddDocument(String segment, Document doc)
   at Lucene.Net.Index.IndexWriter.AddDocument(Document doc, Analyzer analyzer)
   at Lucene.Net.Index.IndexWriter.AddDocument(Document doc)
   at BzReader.Indexer.TokenizeAndAdd(Object state)
   at BzReader.Indexer.IndexString(String currentText, Int64 beginning, Int64 end,
Int32 charCarryOver, Boolean lastBlock)
   at BzReader.Indexer.CreateIndexAsync()

Original comment by santhosh...@gmail.com on 13 Jul 2009 at 4:32

GoogleCodeExporter commented 9 years ago
OK, so this really is a memory problem. I have prepared another debug build of
BzReader with a lower memory requirement. I would be grateful if you could test 
the
indexing with this build. 

Also, please advise on the time it takes to index the dump - I'm just curious 
:-)

Vlad

Original comment by halle...@gmail.com on 13 Jul 2009 at 4:45

Attachments:

GoogleCodeExporter commented 9 years ago
Hi Vlad,

I have started the indexing now.Well,hoping that it will work.

Original comment by santhosh...@gmail.com on 13 Jul 2009 at 5:07

GoogleCodeExporter commented 9 years ago
Hi Vlad,

Still getting the same error.

Original comment by santhosh...@gmail.com on 13 Jul 2009 at 5:38

GoogleCodeExporter commented 9 years ago
Tried once again.Getting the following message during indexing.
Locating the blocks
Indexing
System.SystemException: System.Reflection.TargetInvocationException: Exception 
has
been thrown by the target of an invocation. ---> System.OutOfMemoryException:
Exception of type 'System.OutOfMemoryException' was thrown.
   at SF.Snowball.Ext.EnglishStemmer.InitBlock()
   at SF.Snowball.Ext.EnglishStemmer..ctor()
   at System.RuntimeType.CreateInstanceImpl(Boolean publicOnly, Boolean
skipVisibilityChecks, Boolean fillCache)
   --- End of inner exception stack trace ---
   at System.RuntimeType.CreateInstanceImpl(Boolean publicOnly, Boolean
skipVisibilityChecks, Boolean fillCache)
   at System.Activator.CreateInstance(Type type, Boolean nonPublic)
   at Lucene.Net.Analysis.Snowball.SnowballFilter..ctor(TokenStream in_Renamed,
String name)
   at Lucene.Net.Analysis.Snowball.SnowballFilter..ctor(TokenStream in_Renamed,
String name)
   at Lucene.Net.Analysis.Snowball.SnowballAnalyzer.TokenStream(String fieldName,
TextReader reader)
   at Lucene.Net.Index.DocumentWriter.InvertDocument(Document doc)
   at Lucene.Net.Index.DocumentWriter.AddDocument(String segment, Document doc)
   at Lucene.Net.Index.IndexWriter.AddDocument(Document doc, Analyzer analyzer)
   at Lucene.Net.Index.IndexWriter.AddDocument(Document doc)
   at BzReader.Indexer.TokenizeAndAdd(Object state)
   at BzReader.Indexer.IndexString(String currentText, Int64 beginning, Int64 end,
Int32 charCarryOver, Boolean lastBlock)
   at BzReader.Indexer.CreateIndexAsync()

Original comment by santhosh...@gmail.com on 13 Jul 2009 at 6:13

GoogleCodeExporter commented 9 years ago
All right, I think that's enough of testing on the live system, I will continue 
on
the VmWare image. Thanks for reporting.

Original comment by halle...@gmail.com on 14 Jul 2009 at 10:46

GoogleCodeExporter commented 9 years ago
[deleted comment]
GoogleCodeExporter commented 9 years ago
I'm getting the same error.

Original comment by MaciejOs...@gmail.com on 18 Jan 2012 at 2:30