Size: 582
Comment:
|
← Revision 4 as of 2006-02-19 20:50:24 ⇥
Size: 582
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 3: | Line 3: |
Signature files tipically use Super Imposed Coding | Signature files typically use Super Imposed Coding |
Back to ComputerTerms, InformationRetrieval
Signature files typically use Super Imposed Coding
Each document is divided into logical blocks containing D distinct words (StopWords are usually removed before we make the block)
Each word yields a binary "word signature" using some kind of hash code that is F bits in length with m bits set to 1.
The word signature are OR'd together to form the block signature
- The block signatures are concatenated together to form the document signature.
Back to ComputerTerms, InformationRetrieval