Adaptive Stream Mining

Adaptive Stream Mining
Author: Albert Bifet
Publisher: IOS Press
Total Pages: 224
Release: 2010
Genre: Computers
ISBN: 1607500906

Download Adaptive Stream Mining Book in PDF, Epub and Kindle

This book is a significant contribution to the subject of mining time-changing data streams and addresses the design of learning algorithms for this purpose. It introduces new contributions on several different aspects of the problem, identifying research opportunities and increasing the scope for applications. It also includes an in-depth study of stream mining and a theoretical analysis of proposed methods and algorithms. The first section is concerned with the use of an adaptive sliding window algorithm (ADWIN). Since this has rigorous performance guarantees, using it in place of counters or accumulators, it offers the possibility of extending such guarantees to learning and mining algorithms not initially designed for drifting data. Testing with several methods, including Naïve Bayes, clustering, decision trees and ensemble methods, is discussed as well. The second part of the book describes a formal study of connected acyclic graphs, or 'trees', from the point of view of closure-based mining, presenting efficient algorithms for subtree testing and for mining ordered and unordered frequent closed trees. Lastly, a general methodology to identify closed patterns in a data stream is outlined. This is applied to develop an incremental method, a sliding-window based method, and a method that mines closed trees adaptively from data streams. These are used to introduce classification methods for tree data streams.

Adaptivity in Data Stream Mining

Adaptivity in Data Stream Mining
Author: Conny Franke
Publisher:
Total Pages:
Release: 2009
Genre:
ISBN: 9781109661774

Download Adaptivity in Data Stream Mining Book in PDF, Epub and Kindle

In recent years data streams became a ubiquitous source of information, and thus stream mining emerged as a new field in database research. Due to the inherently dynamic nature of data streams, stream mining algorithms benefit from being adaptive to changes in the properties of a data stream. In addition, when stream mining is done in a dynamic environment like a data stream management system or a sensor network, stream mining algorithms also profit from being adaptive to the changing conditions in this environment. This work investigates two kinds of adaptivity in data stream mining. First, a model for quality-driven resource adaptive stream mining is developed. The model is applied to stream mining algorithms so they efficiently utilize available resources to achieve mining results of the highest quality possible. Every stream mining algorithm is unique in its parameters, quality measures, and resource consumption patterns. We generalize these characteristics and develop a model that captures the interactions and correlations between variables involved in the stream mining process. We then express resource adaptive stream mining as a multiobjective optimization problem and use its solution to tune the input parameters of stream mining algorithms, which results in high quality mining and optimal resource utilization. The second topic investigated in this work is feature adaptive stream mining, which is concerned with adjusting the focus of the mining process to interesting features detected in the data stream. This research is motivated by the need to efficiently detect environmental phenomena from sensor data streams. We propose methods to detect and predict heterogeneous outlier regions, which represent areas of environmental phenomena of different intensities. With the help of predictions about the location and size of outlier regions, the sampling rate of individual sensors is adapted such that sensors in the vicinity of environmental phenomena obtain new measurements more frequently than other sensors in the network to allow for a precise and timely region tracking. The research in this work enhances the state-of-the-art in data stream mining as it makes stream mining algorithms more flexible to adapt to changes in the data stream and the mining environment.

Machine Learning for Data Streams

Machine Learning for Data Streams
Author: Albert Bifet
Publisher: MIT Press
Total Pages: 255
Release: 2018-03-16
Genre: Computers
ISBN: 0262346052

Download Machine Learning for Data Streams Book in PDF, Epub and Kindle

A hands-on approach to tasks and techniques in data stream mining and real-time analytics, with examples in MOA, a popular freely available open-source software framework. Today many information sources—including sensor networks, financial markets, social networks, and healthcare monitoring—are so-called data streams, arriving sequentially and at high speed. Analysis must take place in real time, with partial data and without the capacity to store the entire data set. This book presents algorithms and techniques used in data stream mining and real-time analytics. Taking a hands-on approach, the book demonstrates the techniques using MOA (Massive Online Analysis), a popular, freely available open-source software framework, allowing readers to try out the techniques after reading the explanations. The book first offers a brief introduction to the topic, covering big data mining, basic methodologies for mining data streams, and a simple example of MOA. More detailed discussions follow, with chapters on sketching techniques, change, classification, ensemble methods, regression, clustering, and frequent pattern mining. Most of these chapters include exercises, an MOA-based lab session, or both. Finally, the book discusses the MOA software, covering the MOA graphical user interface, the command line, use of its API, and the development of new methods within MOA. The book will be an essential reference for readers who want to use data stream mining as a tool, researchers in innovation or data stream mining, and programmers who want to create new algorithms for MOA.

Advances in Knowledge Discovery and Data Mining

Advances in Knowledge Discovery and Data Mining
Author: Honghua Dai
Publisher: Springer Science & Business Media
Total Pages: 731
Release: 2004-05-11
Genre: Business & Economics
ISBN: 354022064X

Download Advances in Knowledge Discovery and Data Mining Book in PDF, Epub and Kindle

This book constitutes the refereed proceedings of the 8th Pacific-Asia Conference on Knowledge Discovery and Data mining, PAKDD 2004, held in Sydney, Australia in May 2004. The 50 revised full papers and 31 revised short papers presented were carefully reviewed and selected from a total of 238 submissions. The papers are organized in topical sections on classification; clustering; association rules; novel algorithms; event mining, anomaly detection, and intrusion detection; ensemble learning; Bayesian network and graph mining; text mining; multimedia mining; text mining and Web mining; statistical methods, sequential data mining, and time series mining; and biomedical data mining.

Advances in Machine Learning

Advances in Machine Learning
Author: Zhi-Hua Zhou
Publisher: Springer
Total Pages: 426
Release: 2009-11-03
Genre: Computers
ISBN: 364205224X

Download Advances in Machine Learning Book in PDF, Epub and Kindle

The First Asian Conference on Machine Learning (ACML 2009) was held at Nanjing, China during November 2–4, 2009.This was the ?rst edition of a series of annual conferences which aim to provide a leading international forum for researchers in machine learning and related ?elds to share their new ideas and research ?ndings. This year we received 113 submissions from 18 countries and regions in Asia, Australasia, Europe and North America. The submissions went through a r- orous double-blind reviewing process. Most submissions received four reviews, a few submissions received ?ve reviews, while only several submissions received three reviews. Each submission was handled by an Area Chair who coordinated discussions among reviewers and made recommendation on the submission. The Program Committee Chairs examined the reviews and meta-reviews to further guarantee the reliability and integrity of the reviewing process. Twenty-nine - pers were selected after this process. To ensure that important revisions required by reviewers were incorporated into the ?nal accepted papers, and to allow submissions which would have - tential after a careful revision, this year we launched a “revision double-check” process. In short, the above-mentioned 29 papers were conditionally accepted, and the authors were requested to incorporate the “important-and-must”re- sionssummarizedbyareachairsbasedonreviewers’comments.Therevised?nal version and the revision list of each conditionally accepted paper was examined by the Area Chair and Program Committee Chairs. Papers that failed to pass the examination were ?nally rejected.

Adaptive, Hands-Off Stream Mining

Adaptive, Hands-Off Stream Mining
Author:
Publisher:
Total Pages: 32
Release: 2002
Genre:
ISBN:

Download Adaptive, Hands-Off Stream Mining Book in PDF, Epub and Kindle

Sensor devices and embedded processors are becoming ubiquitous, especially in measurement and monitoring applications. Automatic discovery of patterns and trends in the large volumes of such data is of paramount importance. The combination of relatively limited resources (CPU, memory and/or communication bandwidth and power) poses some interesting challenges. We need both powerful and concise languages to represent the important features of the data, which can (a) adapt and handle arbitrary periodic components, including bursts, and (b) require little memory and a single pass over the data. This allows sensors to automatically (a) discover interesting patterns and trends in the data, and (b) perform outlier detection to alert users. We need a way so that a sensor can discover something like the hourly phone call volume so far follows a daily and a weekly periodicity, with bursts roughly every year, which a human might recognize as, e.g., the Mother's Day surge. When possible and if desired, the user can then issue explicit queries to further investigate the reported patterns. In this work we propose AWSOM (Arbitrary Window Stream mOdeling Method), which allows sensors operating in remote or hostile environments to discover patterns efficiently and effectively, with practically no user intervention. Our algorithms require limited resources and can thus be incorporated in individual sensors, possibly alongside a distributed query processing engine [CCC+02, BGS01, MSHR02]. Updates are performed in constant time, using sub-linear (in fact, logarithmic) space. Existing, state of the art forecasting methods (AR, SARIMA, GARCH, etc.) fall short on one or more of these requirements. To the best of our knowledge, AWSOM is the first method that has all the above characteristics.

Knowledge Discovery from Data Streams

Knowledge Discovery from Data Streams
Author: Joao Gama
Publisher: CRC Press
Total Pages: 256
Release: 2010-05-25
Genre: Business & Economics
ISBN: 1439826129

Download Knowledge Discovery from Data Streams Book in PDF, Epub and Kindle

Since the beginning of the Internet age and the increased use of ubiquitous computing devices, the large volume and continuous flow of distributed data have imposed new constraints on the design of learning algorithms. Exploring how to extract knowledge structures from evolving and time-changing data, Knowledge Discovery from Data Streams presents

Green IT Engineering: Concepts, Models, Complex Systems Architectures

Green IT Engineering: Concepts, Models, Complex Systems Architectures
Author: Vyacheslav Kharchenko
Publisher: Springer
Total Pages: 308
Release: 2016-09-21
Genre: Technology & Engineering
ISBN: 3319441620

Download Green IT Engineering: Concepts, Models, Complex Systems Architectures Book in PDF, Epub and Kindle

This volume provides a comprehensive state of the art overview of a series of advanced trends and concepts that have recently been proposed in the area of green information technologies engineering as well as of design and development methodologies for models and complex systems architectures and their intelligent components. The contributions included in the volume have their roots in the authors’ presentations, and vivid discussions that have followed the presentations, at a series of workshop and seminars held within the international TEMPUS-project GreenCo project in United Kingdom, Italy, Portugal, Sweden and the Ukraine, during 2013-2015 and at the 1st - 5th Workshops on Green and Safe Computing (GreenSCom) held in Russia, Slovakia and the Ukraine. The book presents a systematic exposition of research on principles, models, components and complex systems and a description of industry- and society-oriented aspects of the green IT engineering. A chapter-oriented structure has been adopted for this book following a “vertical view” of the green IT, from hardware (CPU and FPGA) and software components to complex industrial systems. The 15 chapters of the book are grouped into five sections: (1) Methodology and Principles of Green IT Engineering for Complex Systems, (2) Green Components and Programmable Systems, (3) Green Internet Computing, Cloud and Communication Systems, (4) Modeling and Assessment of Green Computer Systems and Infrastructures, and (5) Green PLC-Based Systems for Industry Applications. The chapters provide an easy to follow, comprehensive introduction to the topics that are addressed, including the most relevant references, so that anyone interested in them can start the study by being able to easily find an introduction to the topic through these references. At the same time, all of them correspond to different aspects of the work in progress being carried out by various research groups throughout the world and, therefore, provide information on the state of the art of some of these topics, challenges and perspectives.

Learning from Data Streams

Learning from Data Streams
Author: João Gama
Publisher: Springer Science & Business Media
Total Pages: 486
Release: 2007-10-11
Genre: Computers
ISBN: 3540736786

Download Learning from Data Streams Book in PDF, Epub and Kindle

Processing data streams has raised new research challenges over the last few years. This book provides the reader with a comprehensive overview of stream data processing, including famous prototype implementations like the Nile system and the TinyOS operating system. Applications in security, the natural sciences, and education are presented. The huge bibliography offers an excellent starting point for further reading and future research.