nathan2718 / maui-indexer

Automatically exported from code.google.com/p/maui-indexer
0 stars 0 forks source link

Missing table during Model Building for wikipedia extraction #4

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
1. Install Wikipedia Miner english
2. Comment lines 272-280 (topic extraction)
3. Uncomment lines 262-270 (model build)
3. Execute maui.main.Examples indexing_with_wikipedia

Expect to build a model.
Throw exceptions. Cf. below.
It doesn't build the model. If I launch an extraction after, it doesn't work : 
-- Reading instance
-- Converting instance for document 0018
Warning! This documents does not contain valid keyphrases
---- Extracting candidates... 
---- Disambiguating candidates...
54 candidates 
0 positive; 54 negative instances
-- Processing document: 0018
-- Keyphrases and feature values:

What version of the product are you using? On what operating system?
Maui 1.0 on Windows XP JDK 1.6

Here's the exception:

java.sql.SQLException: Table 'wiki_db_en.anchor_occurance_casefolder'
doesn't exist
    at com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:2975)
    at com.mysql.jdbc.MysqlIO.sendCommand(MysqlIO.java:1600)
    at com.mysql.jdbc.MysqlIO.sqlQueryDirect(MysqlIO.java:1695)
    at com.mysql.jdbc.Connection.execSQL(Connection.java:2998)
    at com.mysql.jdbc.Connection.execSQL(Connection.java:2927)
    at com.mysql.jdbc.Statement.executeQuery(Statement.java:956)
    at org.wikipedia.miner.model.Anchor.initializeFromDatabase(Anchor.java:105)
    at org.wikipedia.miner.model.Anchor.<init>(Anchor.java:69)
    at maui.filters.MauiFilter.getCandidates(MauiFilter.java:1556)
    at maui.filters.MauiFilter.selectCandidates(MauiFilter.java:660)
    at maui.filters.MauiFilter.batchFinished(MauiFilter.java:626)
    at maui.main.MauiModelBuilder.buildModel(MauiModelBuilder.java:785)
    at maui.main.Examples.testIndexingWithWikipedia(Examples.java:269)
    at maui.main.Examples.main(Examples.java:319)
Error adding ngram approach

The complete trace is attached.

Original issue reported on code.google.com by cinl...@gmail.com on 26 Aug 2009 at 7:59

Attachments:

GoogleCodeExporter commented 8 years ago
Sorry, a mistake
>>>>>>>>
The version is Maui 1.1.
<<<<<<<

Original comment by cinl...@gmail.com on 26 Aug 2009 at 8:04

GoogleCodeExporter commented 8 years ago
I fixed the problem replacing line 1557:
   anchor = new Anchor(form, textProcessor, wikipedia.getDatabase());
by:
   anchor = new Anchor(form, null, wikipedia.getDatabase());
Don't know if that "TextProcessor" was important or not...

Original comment by cinl...@gmail.com on 26 Aug 2009 at 8:58

GoogleCodeExporter commented 8 years ago
In WikipediaMiner different kind of tables with pre-processed article titles 
can be generated. When document 
phrases are mapped to Wikipedia articles they can be case folded, stemmed etc. 

Original comment by medel...@gmail.com on 27 Aug 2009 at 8:52

GoogleCodeExporter commented 8 years ago
Hi,
What is the best table to use for keyword extraction with the newly uploaded 
MauiTopicExtractor model.

Thank you
Jason  

Original comment by charlesj...@gmail.com on 29 Sep 2010 at 6:35

GoogleCodeExporter commented 8 years ago
In Wikipedia-miner, while creating and populating the tables using the 
loadData() method in WikipediaDatabase.java, also uncomment the following line:
wikipedia.getDatabase().prepareForTextProcessor(new CaseFolder()) ;

This will create the tables anchor_CaseFolder and anchor_occurance_CaseFolder 
and then Maui will build the model correctly.

Original comment by amrita.l...@gmail.com on 23 Nov 2010 at 10:55