Pearson R.K. — Mining imperfect data: dealing with contamination and incomplete records :: Электронная библиотека попечительского совета мехмата МГУ

Главная Ex Libris Книги Журналы Статьи Серии Каталог Wanted Загрузка ХудЛит Справка Поиск по индексам Поиск Форум

Авторизация

Поиск по указателям

Красота

Pearson R.K. — Mining imperfect data: dealing with contamination and incomplete records

Pearson R.K. — Mining imperfect data: dealing with contamination and incomplete records

Обсудите книгу на научном форуме

Нашли опечатку?
Выделите ее мышкой и нажмите Ctrl+Enter

Название: Mining imperfect data: dealing with contamination and incomplete records

Автор: Pearson R.K.

Аннотация:

Data mining is concerned with the analysis of databases large enough that various anomalies, including outliers, incomplete data records, and more subtle phenomena such as misalignment errors, are virtually certain to be present. Mining Imperfect Data: Dealing with Contamination and Incomplete Records describes in detail a number of these problems, as well as their sources, their consequences, their detection, and their treatment. Specific strategies for data pretreatment and analytical validation that are broadly applicable are described, making them useful in conjunction with most data mining analysis methods. Examples are presented to illustrate the performance of the pretreatment and validation methods in a variety of situations; these include simulation-based examples in which "correct" results are known unambiguously as well as real data examples that illustrate typical cases met in practice.
Mining Imperfect Data, which deals with a wider range of data anomalies than are usually treated in one book, includes a discussion of detecting anomalies through generalized sensitivity analysis (GSA), a process of identifying inconsistencies using systematic and extensive comparisons of results obtained by analysis of exchangeable datasets or subsets. The book makes extensive use of real data, both in the form of a detailed analysis of a few real datasets and various published examples. Also included is a succinct introduction to functional equations that illustrates their utility in describing various forms of qualitative behavior for useful data characterizations.

Язык:

Рубрика: Математика/

Статус предметного указателя: Готов указатель с номерами страниц

ed2k: ed2k stats

Год издания: 2005

Количество страниц: 316

Добавлена в каталог: 23.10.2010

Операции: Положить на полку | Скопировать ссылку для форума | Скопировать ID

Предметный указатель

Skewness, definition      42
Skewness, industrial pressure dataset      235
Skewness, outlier-sensitivity      43
Software errors      56—58 272
Stampede phenomena      65
Standard deviation, bounds      163—166
Standard deviation, comparisons      29—30
Starburst plot      204
Stick-slip phenomena      65
Stratification      186—188 225—230
Stratification principle      228
Subdistributive      169
Subscenario      28 186—190
Supsmu smoother      254

Swamping      77
t-test      45 129
Three-valued logic      61—63
Trimmed mean      275
Trimmed mean filter      253
Trimmed mean, comparisons      27—29 186
Trimmed mean, definition      27
Undersampling      231
Unknown-but-bounded      see "Set-valued variables"
Unmeasurable variable      108—110 276
Vague problems      177
Variance outlier-sensitivity      43
Volcano plot      45—47
White noise      245
z-score      220—222 246 248 250

1 2

Реклама

© Электронная библиотека попечительского совета мехмата МГУ, 2004-2026

Электронная библиотека мехмата МГУ

Valid HTML 4.01!

|

Valid CSS!

О проекте