Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extraction and linking #89

Merged
merged 63 commits into from
May 3, 2016
Merged

Extraction and linking #89

merged 63 commits into from
May 3, 2016

Conversation

bolandka
Copy link
Member

@bolandka bolandka commented May 2, 2016

bolandka and others added 30 commits February 21, 2016 12:48
…ted extraction of contexts to tokenized input text; inserted some tokenizer markup to stopword list
…lgorithms that will be run on the extracted files as default
…dTimeMatcher defined in RegexUtils; treat years as stopwords
bolandka and others added 28 commits April 11, 2016 12:31
…patterns and contexts created when searching for a seed now stored only temporarily; use of custom PostingsHighlighter relying on tokenized input instead of applying lucene's sentence splitting
…ed as patternRegex, old patternRegex is not needed anymore due to extracting contexts using a lucene highlighter
…to tokenizer; removed empty quotations in regex
…eral already accepted pattern (as before) but allow acceptance of equally general patterns
…en; queryService uri for searchResult is set in solrQS; QS reliability score is used in reliability score calculation in SearchResultLinker (#53)
…tstrapping (#16); post intermediate results to temporary data store (#87)
@kba kba merged commit d3d507d into master May 3, 2016
@kba kba deleted the extractionAndLinking branch May 4, 2016 13:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants