Главная    Ex Libris    Книги    Журналы    Статьи    Серии    Каталог    Wanted    Загрузка    ХудЛит    Справка    Поиск по индексам    Поиск    Форум   
blank
Авторизация

       
blank
Поиск по указателям

blank
blank
blank
Красота
blank
Clarke C.L.A., Cormack G.V. — Information Retrieval: Implementing and Evaluating Search Engines
Clarke C.L.A., Cormack G.V. — Information Retrieval: Implementing and Evaluating Search Engines



Обсудите книгу на научном форуме



Нашли опечатку?
Выделите ее мышкой и нажмите Ctrl+Enter


Название: Information Retrieval: Implementing and Evaluating Search Engines

Авторы: Clarke C.L.A., Cormack G.V.

Аннотация:

Information retrieval is the foundation for modern search engines. This textbook offers an introduction to the core topics underlying modern search technologies, including algorithms, data structures, indexing, retrieval, and evaluation. The emphasis is on implementation and experimentation; each chapter includes exercises and suggestions for student projects. Wumpus, a multi-user open-source information retrieval system developed by one of the authors and available online, provides model implementations and a basis for student work.
The modular structure of the book allows instructors to use it in a variety of graduate-level courses, including courses taught from a database systems implementation perspective, traditional information retrieval courses with a focus on IR theory, and courses covering the basics of Web retrieval. Additionally, professionals in computer science, computer engineering, and software engineering will find Information Retrieval a valuable reference.
After an introduction to the basics of information retrieval, the text covers three major topic areas — indexing, retrieval, and evaluation — in self-contained parts. The final part of the book draws on and extends the general material in the earlier parts, treating specific application areas, including parallel search engines, link analysis, crawling, and information retrieval over collections of XML documents. End-of-chapter references point to further reading; end-of-chapter exercises range from pencil and paper problems to substantial programming projects.


Язык: en

Рубрика: Технология/

Статус предметного указателя: Готов указатель с номерами страниц

ed2k: ed2k stats

Год издания: 2010

Количество страниц: 632

Добавлена в каталог: 18.06.2014

Операции: Положить на полку | Скопировать ссылку для форума | Скопировать ID
blank
Предметный указатель
Reciprocal rank      409 461
Reciprocal rank fusion      380
Redundancy      496—498
Refresh policy      548
Region algebra      160—168 169 567
Relevance      8 24 67 261 442
Relevance feedback      273—275 319 326 354
Relevance Ranking      3
Relevance, binary      8 407
Relevance, graded      8 395 451—453
Replacement algorithm      482
Replication      471 497
Replication, dormant      498
Replication, partial      497
Resampling      424
research hypothesis      427
Response time      8 75 470 476
Restart probability (PageRank)      518
Retrieval model, Boolean      63
Retrieval model, language modeling      258 286
Retrieval model, probabilistic      258
Retrieval model, vector space      54
Retrieval status value      see "Score"
Rice code      see "Golomb code"
Robertson, Stephen      258
Robertson/Spaerck Jones weighting formula      265
Robots Exclusion Protocol      544
Robots.txt      544
ROC curve      329 397
Rocchio classifier      354
Rocchio feedback      280
Romeo and Juliet      50 58
Routing      4 310
RSV      see "Score"
Run (TREC)      73 411
Russian      94
S stemmer      97
SALSA algorithm      532—534 554
Salton, Gerald      54
Sample      420
Scalar product      55
Scheduling algorithm      478
Schema-dependent index      48
Schema-independent index      33 48 49
Score      7 59
Scottish Gaelic      94
Seek latency      493 592
Segmentation      94 95
Selective dissemination of information      4 310
Selector      193 208
Self-indexing      112
Self-information      299
Semi-static coding      177
Semi-supervised learning      336
Sensitivity      332
SEO      553
Sequential scan      40
SERP      510
Service rate      see "Throughput"
Service time      470
SGML      11 568
Shakespeare in Love      303
Shakespeare, William      9 33 51 89 160 263 278 302 536
Shannon's Theorem      see "Source coding th'm"
Shannon, Claude      180 191
Shape property      141
Shard      500
Shingle      550
Shortest job first      478
Signature file      77 131
significance      425
Significance level      416
Significance test      430—438
Significant inversion rate      445
Simple-9      207
SJF      see "Shortest job first"
SMART      78
Smoothing      20—21 264 290 340 450
Smoothing, Dirichlet      291 295
Smoothing, Jelinek — Mercer      291 295
Smoothing, linear      291
Snippet      7 131 540
Snippet, generation      302
Snowball stemmer      95 97
Source coding theorem      180 188
Source population      415
Spaerck Jones, Karen      258
Spam      507 555
spam filtering      325 342
Spanish      94 95 98
Spec      469
Specificity      8 332 584
Spelling correction      98
Splits      384
stacking      376 381—385
Standard error      386 422 429
Standard Generalized Markup Language      see "SGML"
Starvation      479
Static coding      177
Static page      510
Static rank      54 517—535
Stationary distribution of a Markov chain      530
Steady state      231 248 249
Stemming      84 86—89 95 97
STL      120
Stochastic matrix      22
Stochastically independent      268
stopping      84
Stopword      85 89—90
Structural index      585
Structural metadata      568
Student's t-distribution      423
Suffix array      77 131
Suffix tree      77 131 133
summarization      5
Supervised learning      319 336
Support vector machine      see "SVM"
Support vectors      353
SVM      353 368
SVM, multicategory      393
SVM, ranking      396
Swedish      94 95
Symbol      14 176
Synchronization point      112 195 219
Synonymy      78
Systematic error      413
t-distribution      423
Table-driven decoding      208—209
Target population      415
tdt      see "Topic detection and tracking"
Teleport (PageRank)      523
Term      6 15
Term descriptor      217
Term frequency      48 53 57 266
Term partitioning      493—495 496
Term proximity      54 60—63 302—304
Term selection value      274
Term vector      51
Test collection      23—26 411 453
Test collection, ClueWeb      09 25
Test collection, construction      73—75
Test collection, GOV2      25
Test collection, INEX      583
Test collection, TREC45      26
Text REtrieval Conference      see "TREC"
TF      see "Term frequency"
TF-IDF      57 270 293
The Merry Wives of Windsor      20
The Winter's Tale      14
Thompson, Ken      97
Throughput      8 75 470 477
Token      13
Toolbars      526
Topic      5
Topic detection and tracking      5
Traffic intensity      see "Utilization"
Transactional query      514
Transductive learning      336
Transfer function      356 422
Transferability      415
Transition matrix      22
TREC      23—26 67 98 272 410—412
TREC, Filtering Track      314
TREC, Million Query Track      25
TREC, Public Spam Corpus      325
TREC, Robust Track      282
TREC, Spam Track      371
TREC, Terabyte Track      25 213 539
TREC45 collection      26
Trigram      115
True negative      332
True negative rate      332
True positive      332
True positive rate      332
Trust bias      540
TrustRank      555
TSV      see "Term selection value"
Turkish      95
Twig      585
Two-Poisson model      267
Type 1 error      331
Type 2 error      331
Unary code      192
Unicode      13 91 95 97
Universal codeword set      223
University of Massachusetts      27
Unsupervised learning      337
URL      525
User intent      513
user satisfaction      410 453 470
UTF-8      13 91 97
utilization      476 477 493
Validity      406 413 434—438
Validity, external      415
Validity, internal      415
vByte      205—206 213 220 223 253
Vector space model      54—60 78
Vocabulary      14
W3C      564 572
Warmup period (cache)      480
Web crawler      507 541—552 556
Web crawler, crawler trap      557
Web crawler, incremental      547
Web graph      508
Web query      507 513
Web search evaluation      538
Web spam      507 555
Web, hidden      511
Web, indexable      511
Webster, John      18
Wikipedia      3 27 277
Wilcoxon T distribution      433
Winnie the Pooh      80
Wisdom of crowds      376
Word-aligned coding      206—207
World Wide Web Consortium      564
WUMPUS      28
XCG      584
XML      11 29 160 565—570
XML Query      574
XML schema      568 570
XML, declaration      565
XML, DTD      568
XML, empty-element tag      566
XML, exhaustivity      584
XML, overlap      580
XML, ranked retrieval      579—584
XML, specificity      584
XML, well-formed document      568
XML, XCG      584
XML, XML Schema      568
XPath      564 571—572
XQuery      564 574—576
XSL      572
Zelazny, Roger      39
Zero-order model      178 286
Zipf's law      16 107 121 237 239 480 513
Zipf, George      16
Ziv — Lempel compression      191 220
1 2 3
blank
Реклама
blank
blank
HR
@Mail.ru
       © Электронная библиотека попечительского совета мехмата МГУ, 2004-2024
Электронная библиотека мехмата МГУ | Valid HTML 4.01! | Valid CSS! О проекте