|
|
For details about the exact searching process and the implementation of incremental query expansion please refer to Appendix G.
BSS_PARMPATH | The database parameter files |
BSS_TEMPPATH | Directory for storage of temporary files |
GUI_CONFIG_FILES | The locations of the two interface configuration files ".okapi_rc" and ".okapi_db" |
OKAPI_LOGS_DIR. | The location of the logfiles for each search |
NOTE: The system will not operate unless these instructions have been followed.
topic_no | An INTEGER number assigned to the query. |
user_id | A character string identifier for the searcher |
rf_flag | An INTEGER flag to turn relevance feedback off (0) or on (1). |
E.g. For user zebra to search on topic 262 with relevance feed back ON we would enter at the Unix prompt:
The parameters <user_id> and <topic_no> define the subdirectory of <OKAPI_LOGS_DIR> in which the three logfiles history , termset and relsfile will be kept. The purpose and structure of these three files is discussed in Appendix M.
Typing the preceding command will result in the display of the Main Window (Figure 1) . Full documents that the user wishes to see as the result of a search will be shown in a separate, Pop-Up Window (Figure 2) . The following sections will describe the structure of the interface and user interaction.
1. Term Entry Box |
2. Working query |
3. Hitlist |
4. Removed Terms |
5. Rels Pool |
6a. Search | 6b. Options | 6c. Exit |
Figure 1
NOTE: The "Exit" button is always enabled. The "Search" button is disabled immediately after a search has been executed. It is re-enabled when the "working query" is modified in any way.
Full Document Display |
Relevant | Not Relevant |
Figure 2
The functionality to carry out a passage search is included in the interface. When a search is performed on a database that has been indexed to include positional information about paragraphs the system will search for the best sub-passage of each document. In this case, when each full document is displayed a third button will be available which allows the user to choose the sub-passage for relevance feedback rather than the whole document. This will not be the case for the sample databases provided with "Okapi-Pack" as the records are too short. However, by examining the indexing programs described in Appendix E and Appendix F it should be possible to see how to produce a suitable database.
rules regulations laws
stock market +
A phrase defined in this way, if it exists in the database, will consist of:
The type of phrase found will be determined by the value of the environment variable <BOTH_PHRASE_OPS> which is set in .okapi_rc.
BOTH_PHRASE_OPS | Meaning |
---|---|
0 | TRUE PHRASES only, possibly with intervening stopwords |
1 | SAME SENTENCE occurrences and TRUE PHRASES if they also exist. |
The default value of <BOTH_PHRASE_OPS> is 1. Change it to 0 if you only want true phrases. This may be necessary if you have a text database in which there are no sentence delimiter characters in the fields.
The way in which the interface creates user-defined phrases is described in more detail in Appendix G.
In both (a) and (b) entry is completed by pressing the <Return> key. Errors may be corrected in the normal way by moving the cursor and inserting/deleting characters.
If all input terms are found in the database, the term entry box will be cleared once the terms are displayed in the "working query" window. If, however, one or more terms do not exist in the database the entered terms will be highlighted in the term entry box. The text may be edited or deleted in the usual way.
The functions "Search" and "Exit" may be executed by pressing "Ctrl-S" and "Ctrl-X" respectively.
The initial "working query" is built by entering terms via the keyboard.
Enter in the Term Entry Box :
>> | stock market rules + <Ret> |
>> | insider trading + <Ret> |
>> | regulations laws <Ret> |
These will be displayed in the
where:
Each entry of the ranked
hitlist will consist of three components.
Note: Terms may occur here in different forms than in
the working query. This is because:
This will result in the document being displayed in a
pop-up window.
The user may scroll up and down through the document using the
"Up"/"Down" and "Page Up"/"Page Down" cursor keys.
At the bottom of the window are
two buttons:
The user must make a relevance judgement before moving onto any other
part of the search.
Making a positive relevance judgement will cause terms extracted from
the appropriate section of the document to be merged with the current
set of "candidate terms". The title information about every document
judged as relevant by the user is displayed in the
relspool window immediately
below the hitlist window.
149
0 :
stock market rules (B)
308
0 :
insider trading
10472
0 :
regulations
13292
0 :
laws
5.2. Searching the database on the
Working query.
>>
Single-left-click the Search button
2
rank order
in the set
FT924-14664
document
identifier
[736]
Okapi weight
normalised onto
the range 1-1000
1/2 pages
Passage / Document Lengths
in screen pages of
2000 characters
insider trading (2) regulations (1) law (1)
5.3. Viewing full documents.
>>
Double-left-click on any line in the appropriate
hitlist
entry (including the blank line at the end).
Relevant
Relevance Feedback (RF) from the full record text.
Not Relevant
5.4. Make a relevance judgement:
>>
Single-left-click on one of the three buttons.
5.5. Modifying the working query.
The
working query can/may be modified at any time as
follows.
5.6. Further searches.
Further searches can be made once the working query has been
modified as described
above. The result will be a new document set from which a new
hitlist is
formed. A member of the new document set that occured in a previous
"hitlist" and has already been "viewed" will:
5.8. Quitting.
Clicking the quit button finishes the application cleanly. The database
and all open files are closed. All temporary files are deleted.
Okapi-Pack Main Menu
Mail Okapi Support
Registration
Last modified: 12th November 2001