[CADIHash.h Index] [CADIHash.h Hierarchy]
An accessor to an inverted file More...
#include <../libInvertedFile/include/CAcSQLInvertedFile.h>
Inherits: CAcURL2FTS
An accessor to an inverted file. This access is done "by hand" at present this not really efficient, however we plan to move to memory mapped files.
for testing if the inverted file is correctly constructed
This opens an exsisting inverted file, and then inits this structure. After that it is fully usable
As a paramter it takes an XMLElement which contains a "collection" element and its content.
If the attribute vi-generate-inverted-file is true, then a new inverted file will be generated using the parameters given in inCollectionElement. you will NOT be able to use *this afterwards.
The REAL constructor.
called by constructors
Destructor
Translate a DocumentID to a URL (for output)
Translate an URL to its document ID
List of documents containing the feature
List of features contained by a document
List of features contained by a document with ID inDID
Collection frequency for a given feature
What kind of feature is the feature with ID inFeatureID?
returns the maximum document frequency for one document ID
Returns the document-frequency square sum for a given document ID
Returns this function for a given document ID
Generating an inverted File, if there is none.Fast but stupid in-memory method. This method is very fast, if all the inverted file (and a bit more) can be kept in memory at runtime. If this is not the case, extensive swapping is the result, virtually halting the inverted file creation.
Generating an inverted File, if there is none. Employing the two-way-merge method described in "managing gigabytes", chapter 5.2. Sort-based inversion. (Page 181)
Check the consistency of the inverted file system accessed
Is the Document with inDocumentID contained in the document frequency list of the feature inFeatureID and
This is interesting for browsing
Getting a list of all features contained in this.This function is necessary, because in the present system only about 50 percent of the features are really used.
A feature is considered used if it arises in mIDToOffset.
the maximum feature ID arising in this file
Feature -> Offset in inverted file
File of feature descriptions
Name of the inverted file
Name of the Offset file
Name for the file with the feature description
map from feature id to the offset for this feature
additional information about the document like, e.g.the euclidean length of the feature list.
add a pair of FeatureID,Offset to the open offset file (helper function for inverted file construction)
loads a *.fts file. and returns the feature list
Documentation generated by muellerw@pc7170 on Son Okt 8 16:04:40 CEST 2000 | Kdoc |