Size: 369
Comment:
|
Size: 525
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 5: | Line 5: |
One of the most efficient ways to implement a StopList is to incorporate them into a lexical analyzer. | There are two ways to filter stoplist words from an input token stream: a. Examine lexical analyzer output and remove any stopwords a. Remove stopwords as part of the lexical analysis: This is one of the more efficient ways to implement a StopList |
Back to ComputerTerms, InformationRetrieval
A list of words that for reasons of volume or ["Precision"] and ["Recall"] will not be included in the index and hence are not searchable. E.g. "and", "or", "not" etc.
There are two ways to filter stoplist words from an input token stream:
- Examine lexical analyzer output and remove any stopwords
Remove stopwords as part of the lexical analysis: This is one of the more efficient ways to implement a StopList
Back to ComputerTerms, InformationRetrieval