1. Taxonomy
of IR Models – Classic models- Set theoretic model-
Algebraic models- Probabilistic model- Structured text
retrieval models- Models for browsing- Retrieval evaluations-Reference
collections
2. Query languages-query operations-text and multimedia
languages-Text operations-document preprocessing- matrix
decompositions and latent semantic indexing-text compression
–indexing and searching-inverted files-suffix trees-
Boolean queries-sequential searching-pattern matching
3. Text Classification, and Naïve bayes-vector space
classification-support vector machines and machine learning
on documents-flat clustering –hierarchical clustering
4. Web search basics-web characteristics-index size
and estimation- near duplicates and shingling-web crawling-distributing
indexes- connectivity servers-link analysis-web as a
graph-PageRank-Hubs and authorities- question answering
5. Online IR systems- online public access catalogs-digital
libraries-architectural issues-document models -representations
and access- protocols |