Главная    Ex Libris    Книги    Журналы    Статьи    Серии    Каталог    Wanted    Загрузка    ХудЛит    Справка    Поиск по индексам    Поиск    Форум   
blank
Авторизация

       
blank
Поиск по указателям

blank
blank
blank
Красота
blank
Pearson R.K. — Mining imperfect data: dealing with contamination and incomplete records
Pearson R.K. — Mining imperfect data: dealing with contamination and incomplete records



Обсудите книгу на научном форуме



Нашли опечатку?
Выделите ее мышкой и нажмите Ctrl+Enter


Название: Mining imperfect data: dealing with contamination and incomplete records

Автор: Pearson R.K.

Аннотация:

Data mining is concerned with the analysis of databases large enough that various anomalies, including outliers, incomplete data records, and more subtle phenomena such as misalignment errors, are virtually certain to be present. Mining Imperfect Data: Dealing with Contamination and Incomplete Records describes in detail a number of these problems, as well as their sources, their consequences, their detection, and their treatment. Specific strategies for data pretreatment and analytical validation that are broadly applicable are described, making them useful in conjunction with most data mining analysis methods. Examples are presented to illustrate the performance of the pretreatment and validation methods in a variety of situations; these include simulation-based examples in which "correct" results are known unambiguously as well as real data examples that illustrate typical cases met in practice.
Mining Imperfect Data, which deals with a wider range of data anomalies than are usually treated in one book, includes a discussion of detecting anomalies through generalized sensitivity analysis (GSA), a process of identifying inconsistencies using systematic and extensive comparisons of results obtained by analysis of exchangeable datasets or subsets. The book makes extensive use of real data, both in the form of a detailed analysis of a few real datasets and various published examples. Also included is a succinct introduction to functional equations that illustrates their utility in describing various forms of qualitative behavior for useful data characterizations.


Язык: en

Рубрика: Математика/

Статус предметного указателя: Готов указатель с номерами страниц

ed2k: ed2k stats

Год издания: 2005

Количество страниц: 316

Добавлена в каталог: 23.10.2010

Операции: Положить на полку | Скопировать ссылку для форума | Скопировать ID
blank
Предметный указатель
Skewness, definition      42
Skewness, industrial pressure dataset      235
Skewness, outlier-sensitivity      43
Software errors      56—58 272
Stampede phenomena      65
Standard deviation, bounds      163—166
Standard deviation, comparisons      29—30
Starburst plot      204
Stick-slip phenomena      65
Stratification      186—188 225—230
Stratification principle      228
Subdistributive      169
Subscenario      28 186—190
Supsmu smoother      254
Swamping      77
t-test      45 129
Three-valued logic      61—63
Trimmed mean      275
Trimmed mean filter      253
Trimmed mean, comparisons      27—29 186
Trimmed mean, definition      27
Undersampling      231
Unknown-but-bounded      see "Set-valued variables"
Unmeasurable variable      108—110 276
Vague problems      177
Variance outlier-sensitivity      43
Volcano plot      45—47
White noise      245
z-score      220—222 246 248 250
1 2
blank
Реклама
blank
blank
HR
@Mail.ru
       © Электронная библиотека попечительского совета мехмата МГУ, 2004-2024
Электронная библиотека мехмата МГУ | Valid HTML 4.01! | Valid CSS! О проекте