Feature List

boolean inEntry;
boolean allWordsCapitalized;
boolean inLink;
boolean inMetaSection;
boolean inTitle;
boolean inURL;
// queryFreq is currently unused
double termFreq, docFreq, queryFreq;
// docLength is a word count, phraseLength is a character count
double docLength, phraseLength;
// posInDoc is the ratio of (word position / document word count)
double posInDoc;
// this is number of phrases that are a (non-proper) subset of this phrase  
double numberOfSubPhrases;
// probablity that this is a keyphrase (output)
double probability;
nlp-private/feature-list.txt · Last modified: 2015/04/22 15:07 by ryancha
Back to top
CC Attribution-Share Alike 4.0 International
chimeric.de = chi`s home Valid CSS Driven by DokuWiki do yourself a favour and use a real browser - get firefox!! Recent changes RSS feed Valid XHTML 1.0