Information Retrieval (Cont.)


Filtering
Filtering is a typical transformation in information retrieval, for example to reduce the size of text, and/or standardize it to simplify searching. Text is the input and a processed or filtered version of the text is the output. Major filtering techniques include: Indexing
Almost all types of indexes are based on some kind of trees or hashing, except clustered data structures, and direct acyclic word graph.


DBMSs versus Files
Using a DBMS can relieve your task of designing a file structure for your applications. However, flexibility is the cost. In fact, a DBMS is, first and foremost, a disk access manager.