Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing table during Model Building for wikipedia extraction #4

Open
GoogleCodeExporter opened this issue Feb 20, 2016 · 5 comments
Open

Comments

@GoogleCodeExporter
Copy link

What steps will reproduce the problem?
1. Install Wikipedia Miner english
2. Comment lines 272-280 (topic extraction)
3. Uncomment lines 262-270 (model build)
3. Execute maui.main.Examples indexing_with_wikipedia

Expect to build a model.
Throw exceptions. Cf. below.
It doesn't build the model. If I launch an extraction after, it doesn't work : 
-- Reading instance
-- Converting instance for document 0018
Warning! This documents does not contain valid keyphrases
---- Extracting candidates... 
---- Disambiguating candidates...
54 candidates 
0 positive; 54 negative instances
-- Processing document: 0018
-- Keyphrases and feature values:

What version of the product are you using? On what operating system?
Maui 1.0 on Windows XP JDK 1.6

Here's the exception:

java.sql.SQLException: Table 'wiki_db_en.anchor_occurance_casefolder'
doesn't exist
    at com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:2975)
    at com.mysql.jdbc.MysqlIO.sendCommand(MysqlIO.java:1600)
    at com.mysql.jdbc.MysqlIO.sqlQueryDirect(MysqlIO.java:1695)
    at com.mysql.jdbc.Connection.execSQL(Connection.java:2998)
    at com.mysql.jdbc.Connection.execSQL(Connection.java:2927)
    at com.mysql.jdbc.Statement.executeQuery(Statement.java:956)
    at org.wikipedia.miner.model.Anchor.initializeFromDatabase(Anchor.java:105)
    at org.wikipedia.miner.model.Anchor.<init>(Anchor.java:69)
    at maui.filters.MauiFilter.getCandidates(MauiFilter.java:1556)
    at maui.filters.MauiFilter.selectCandidates(MauiFilter.java:660)
    at maui.filters.MauiFilter.batchFinished(MauiFilter.java:626)
    at maui.main.MauiModelBuilder.buildModel(MauiModelBuilder.java:785)
    at maui.main.Examples.testIndexingWithWikipedia(Examples.java:269)
    at maui.main.Examples.main(Examples.java:319)
Error adding ngram approach

The complete trace is attached.

Original issue reported on code.google.com by [email protected] on 26 Aug 2009 at 7:59

Attachments:

@GoogleCodeExporter
Copy link
Author

Sorry, a mistake
>>>>>>>>
The version is Maui 1.1.
<<<<<<<

Original comment by [email protected] on 26 Aug 2009 at 8:04

@GoogleCodeExporter
Copy link
Author

I fixed the problem replacing line 1557:
   anchor = new Anchor(form, textProcessor, wikipedia.getDatabase());
by:
   anchor = new Anchor(form, null, wikipedia.getDatabase());
Don't know if that "TextProcessor" was important or not...

Original comment by [email protected] on 26 Aug 2009 at 8:58

@GoogleCodeExporter
Copy link
Author

In WikipediaMiner different kind of tables with pre-processed article titles 
can be generated. When document 
phrases are mapped to Wikipedia articles they can be case folded, stemmed etc. 

Original comment by [email protected] on 27 Aug 2009 at 8:52

@GoogleCodeExporter
Copy link
Author

Hi,
What is the best table to use for keyword extraction with the newly uploaded 
MauiTopicExtractor model.

Thank you
Jason  

Original comment by [email protected] on 29 Sep 2010 at 6:35

@GoogleCodeExporter
Copy link
Author

In Wikipedia-miner, while creating and populating the tables using the 
loadData() method in WikipediaDatabase.java, also uncomment the following line:
wikipedia.getDatabase().prepareForTextProcessor(new CaseFolder()) ;

This will create the tables anchor_CaseFolder and anchor_occurance_CaseFolder 
and then Maui will build the model correctly.

Original comment by [email protected] on 23 Nov 2010 at 10:55

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant