Learning from Data Streams in Evolving Environments

Learning from Data Streams in Evolving Environments
Author: Moamar Sayed-Mouchaweh
Publisher: Springer
Total Pages: 320
Release: 2018-07-28
Genre: Technology & Engineering
ISBN: 3319898035

Download Learning from Data Streams in Evolving Environments Book in PDF, Epub and Kindle

This edited book covers recent advances of techniques, methods and tools treating the problem of learning from data streams generated by evolving non-stationary processes. The goal is to discuss and overview the advanced techniques, methods and tools that are dedicated to manage, exploit and interpret data streams in non-stationary environments. The book includes the required notions, definitions, and background to understand the problem of learning from data streams in non-stationary environments and synthesizes the state-of-the-art in the domain, discussing advanced aspects and concepts and presenting open problems and future challenges in this field. Provides multiple examples to facilitate the understanding data streams in non-stationary environments; Presents several application cases to show how the methods solve different real world problems; Discusses the links between methods to help stimulate new research and application directions.

Learning from Data Streams in Dynamic Environments

Learning from Data Streams in Dynamic Environments
Author: Moamar Sayed-Mouchaweh
Publisher: Springer
Total Pages: 82
Release: 2015-12-10
Genre: Technology & Engineering
ISBN: 331925667X

Download Learning from Data Streams in Dynamic Environments Book in PDF, Epub and Kindle

This book addresses the problems of modeling, prediction, classification, data understanding and processing in non-stationary and unpredictable environments. It presents major and well-known methods and approaches for the design of systems able to learn and to fully adapt its structure and to adjust its parameters according to the changes in their environments. Also presents the problem of learning in non-stationary environments, its interests, its applications and challenges and studies the complementarities and the links between the different methods and techniques of learning in evolving and non-stationary environments.

A Reservoir of Adaptive Algorithms for Online Learning from Evolving Data Streams

A Reservoir of Adaptive Algorithms for Online Learning from Evolving Data Streams
Author: Ali Pesaranghader
Publisher:
Total Pages:
Release: 2018
Genre:
ISBN:

Download A Reservoir of Adaptive Algorithms for Online Learning from Evolving Data Streams Book in PDF, Epub and Kindle

Continuous change and development are essential aspects of evolving environments and applications, including, but not limited to, smart cities, military, medicine, nuclear reactors, self-driving cars, aviation, and aerospace. That is, the fundamental characteristics of such environments may evolve, and so cause dangerous consequences, e.g., putting people lives at stake, if no reaction is adopted. Therefore, learning systems need to apply intelligent algorithms to monitor evolvement in their environments and update themselves effectively. Further, we may experience fluctuations regarding the performance of learning algorithms due to the nature of incoming data as it continuously evolves. That is, the current efficient learning approach may become deprecated after a change in data or environment. Hence, the question 'how to have an efficient learning algorithm over time against evolving data?' has to be addressed. In this thesis, we have made two contributions to settle the challenges described above. In the machine learning literature, the phenomenon of (distributional) change in data is known as concept drift. Concept drift may shift decision boundaries, and cause a decline in accuracy. Learning algorithms, indeed, have to detect concept drift in evolving data streams and replace their predictive models accordingly. To address this challenge, adaptive learners have been devised which may utilize drift detection methods to locate the drift points in dynamic and changing data streams. A drift detection method able to discover the drift points quickly, with the lowest false positive and false negative rates, is preferred. False positive refers to incorrectly alarming for concept drift, and false negative refers to not alarming for concept drift. In this thesis, we introduce three algorithms, called as the Fast Hoeffding Drift Detection Method (FHDDM), the Stacking Fast Hoeffding Drift Detection Method (FHDDMS), and the McDiarmid Drift Detection Methods (MDDMs), for detecting drift points with the minimum delay, false positive, and false negative rates. FHDDM is a sliding window-based algorithm and applies Hoeffding's inequality (Hoeffding, 1963) to detect concept drift. FHDDM slides its window over the prediction results, which are either 1 (for a correct prediction) or 0 (for a wrong prediction). Meanwhile, it compares the mean of elements inside the window with the maximum mean observed so far; subsequently, a significant difference between the two means, upper-bounded by the Hoeffding inequality, indicates the occurrence of concept drift. The FHDDMS extends the FHDDM algorithm by sliding multiple windows over its entries for a better drift detection regarding the detection delay and false negative rate. In contrast to FHDDM/S, the MDDM variants assign weights to their entries, i.e., higher weights are associated with the most recent entries in the sliding window, for faster detection of concept drift. The rationale is that recent examples reflect the ongoing situation adequately. Then, by putting higher weights on the latest entries, we may detect concept drift quickly. An MDDM algorithm bounds the difference between the weighted mean of elements in the sliding window and the maximum weighted mean seen so far, using McDiarmid's inequality (McDiarmid, 1989). Eventually, it alarms for concept drift once a significant difference is experienced. We experimentally show that FHDDM/S and MDDMs outperform the state-of-the-art by representing promising results in terms of the adaptation and classification measures. Due to the evolving nature of data streams, the performance of an adaptive learner, which is defined by the classification, adaptation, and resource consumption measures, may fluctuate over time. In fact, a learning algorithm, in the form of a (classifier, detector) pair, may present a significant performance before a concept drift point, but not after. We define this problem by the question 'how can we ensure that an efficient classifier-detector pair is present at any time in an evolving environment?' To answer this, we have developed the Tornado framework which runs various kinds of learning algorithms simultaneously against evolving data streams. Each algorithm incrementally and independently trains a predictive model and updates the statistics of its drift detector. Meanwhile, our framework monitors the (classifier, detector) pairs, and recommends the efficient one, concerning the classification, adaptation, and resource consumption performance, to the user. We further define the holistic CAR measure that integrates the classification, adaptation, and resource consumption measures for evaluating the performance of adaptive learning algorithms. Our experiments confirm that the most efficient algorithm may differ over time because of the developing and evolving nature of data streams.

Learning in Non-Stationary Environments

Learning in Non-Stationary Environments
Author: Moamar Sayed-Mouchaweh
Publisher: Springer Science & Business Media
Total Pages: 439
Release: 2012-04-13
Genre: Technology & Engineering
ISBN: 1441980202

Download Learning in Non-Stationary Environments Book in PDF, Epub and Kindle

Recent decades have seen rapid advances in automatization processes, supported by modern machines and computers. The result is significant increases in system complexity and state changes, information sources, the need for faster data handling and the integration of environmental influences. Intelligent systems, equipped with a taxonomy of data-driven system identification and machine learning algorithms, can handle these problems partially. Conventional learning algorithms in a batch off-line setting fail whenever dynamic changes of the process appear due to non-stationary environments and external influences. Learning in Non-Stationary Environments: Methods and Applications offers a wide-ranging, comprehensive review of recent developments and important methodologies in the field. The coverage focuses on dynamic learning in unsupervised problems, dynamic learning in supervised classification and dynamic learning in supervised regression problems. A later section is dedicated to applications in which dynamic learning methods serve as keystones for achieving models with high accuracy. Rather than rely on a mathematical theorem/proof style, the editors highlight numerous figures, tables, examples and applications, together with their explanations. This approach offers a useful basis for further investigation and fresh ideas and motivates and inspires newcomers to explore this promising and still emerging field of research.

Machine Learning for Data Streams

Machine Learning for Data Streams
Author: Albert Bifet
Publisher: MIT Press
Total Pages: 255
Release: 2018-03-16
Genre: Computers
ISBN: 0262346052

Download Machine Learning for Data Streams Book in PDF, Epub and Kindle

A hands-on approach to tasks and techniques in data stream mining and real-time analytics, with examples in MOA, a popular freely available open-source software framework. Today many information sources—including sensor networks, financial markets, social networks, and healthcare monitoring—are so-called data streams, arriving sequentially and at high speed. Analysis must take place in real time, with partial data and without the capacity to store the entire data set. This book presents algorithms and techniques used in data stream mining and real-time analytics. Taking a hands-on approach, the book demonstrates the techniques using MOA (Massive Online Analysis), a popular, freely available open-source software framework, allowing readers to try out the techniques after reading the explanations. The book first offers a brief introduction to the topic, covering big data mining, basic methodologies for mining data streams, and a simple example of MOA. More detailed discussions follow, with chapters on sketching techniques, change, classification, ensemble methods, regression, clustering, and frequent pattern mining. Most of these chapters include exercises, an MOA-based lab session, or both. Finally, the book discusses the MOA software, covering the MOA graphical user interface, the command line, use of its API, and the development of new methods within MOA. The book will be an essential reference for readers who want to use data stream mining as a tool, researchers in innovation or data stream mining, and programmers who want to create new algorithms for MOA.

Adaptive Stream Mining

Adaptive Stream Mining
Author: Albert Bifet
Publisher: IOS Press
Total Pages: 224
Release: 2010
Genre: Computers
ISBN: 1607500906

Download Adaptive Stream Mining Book in PDF, Epub and Kindle

This book is a significant contribution to the subject of mining time-changing data streams and addresses the design of learning algorithms for this purpose. It introduces new contributions on several different aspects of the problem, identifying research opportunities and increasing the scope for applications. It also includes an in-depth study of stream mining and a theoretical analysis of proposed methods and algorithms. The first section is concerned with the use of an adaptive sliding window algorithm (ADWIN). Since this has rigorous performance guarantees, using it in place of counters or accumulators, it offers the possibility of extending such guarantees to learning and mining algorithms not initially designed for drifting data. Testing with several methods, including Naïve Bayes, clustering, decision trees and ensemble methods, is discussed as well. The second part of the book describes a formal study of connected acyclic graphs, or 'trees', from the point of view of closure-based mining, presenting efficient algorithms for subtree testing and for mining ordered and unordered frequent closed trees. Lastly, a general methodology to identify closed patterns in a data stream is outlined. This is applied to develop an incremental method, a sliding-window based method, and a method that mines closed trees adaptively from data streams. These are used to introduce classification methods for tree data streams.

Machine Learning and Knowledge Discovery in Databases

Machine Learning and Knowledge Discovery in Databases
Author: Paolo Frasconi
Publisher: Springer
Total Pages: 850
Release: 2016-09-03
Genre: Computers
ISBN: 331946227X

Download Machine Learning and Knowledge Discovery in Databases Book in PDF, Epub and Kindle

The three volume set LNAI 9851, LNAI 9852, and LNAI 9853 constitutes the refereed proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2016, held in Riva del Garda, Italy, in September 2016. The 123 full papers and 16 short papers presented were carefully reviewed and selected from a total of 460 submissions. The papers presented focus on practical and real-world studies of machine learning, knowledge discovery, data mining; innovative prototype implementations or mature systems that use machine learning techniques and knowledge discovery processes in a real setting; recent advances at the frontier of machine learning and data mining with other disciplines. Part I and Part II of the proceedings contain the full papers of the contributions presented in the scientific track and abstracts of the scientific plenary talks. Part III contains the full papers of the contributions presented in the industrial track, short papers describing demonstration, the nectar papers, and the abstracts of the industrial plenary talks.

15th International Conference on Soft Computing Models in Industrial and Environmental Applications (SOCO 2020)

15th International Conference on Soft Computing Models in Industrial and Environmental Applications (SOCO 2020)
Author: Álvaro Herrero
Publisher: Springer Nature
Total Pages: 880
Release: 2020-08-28
Genre: Technology & Engineering
ISBN: 303057802X

Download 15th International Conference on Soft Computing Models in Industrial and Environmental Applications (SOCO 2020) Book in PDF, Epub and Kindle

This book contains accepted papers presented at SOCO 2020 conference held in the beautiful and historic city of Burgos (Spain), in September 2020. Soft computing represents a collection or set of computational techniques in machine learning, computer science and some engineering disciplines, which investigate, simulate, and analyze very complex issues and phenomena. After a through peer-review process, the SOCO 2020 International Program Committee selected 83 papers which are published in these conference proceedings and represents an acceptance rate of 35%. Due to the COVID-19 outbreak, the SOCO 2020 edition was blended, combining on-site and on-line participation. In this relevant edition a special emphasis was put on the organization of special sessions. Eleven special session were organized related to relevant topics such as: Soft Computing Applications in Precision Agriculture, Manufacturing and Management Systems, Management of Industrial and Environmental Enterprises, Logistics and Transportation Systems, Robotics and Autonomous Vehicles, Computer Vision, Laser-Based Sensing and Measurement and other topics such as Forecasting Industrial Time Series, IoT, Big Data and Cyber Physical Systems, Non-linear Dynamical Systems and Fluid Dynamics, Modeling and Control systems The selection of papers was extremely rigorous in order to maintain the high quality of SOCO conference editions and we would like to thank the members of the Program Committees for their hard work in the reviewing process. This is a crucial process to the creation of a high standard conference and the SOCO conference would not exist without their help.

Advances in Data Mining - Theoretical Aspects and Applications

Advances in Data Mining - Theoretical Aspects and Applications
Author: Petra Perner
Publisher: Springer
Total Pages: 362
Release: 2007-08-18
Genre: Computers
ISBN: 354073435X

Download Advances in Data Mining - Theoretical Aspects and Applications Book in PDF, Epub and Kindle

The papers in this volume represent the proceedings of the 7th Industrial Conference on Data Mining. They are organized into topical sections on aspects of classification and prediction, clustering, web mining, data mining in medicine, applications of data mining, time series and frequent pattern mining, and association rule mining. Readers gain new insights into theories underlying data mining and discover state-of-the-technology applications.

Proceedings. 20. Workshop Computational Intelligence, Dortmund, 1. Dezember - 3. Dezember 2010

Proceedings. 20. Workshop Computational Intelligence, Dortmund, 1. Dezember - 3. Dezember 2010
Author: Frank Hoffmann
Publisher: KIT Scientific Publishing
Total Pages: 328
Release: 2014-08-14
Genre: Computers
ISBN: 3866445806

Download Proceedings. 20. Workshop Computational Intelligence, Dortmund, 1. Dezember - 3. Dezember 2010 Book in PDF, Epub and Kindle

Dieser Tagungsband enthält die Beiträge des 20. Workshops "Computational Intelligence" des Fachausschusses 5.14 der VDI/VDE-Gesellschaft für Mess- und Automatisierungstechnik (GMA) der vom 1.-3. Dezember 2010 im Haus Bommerholz (Dortmund) stattfand. Die Schwerpunkte waren Methoden, Anwendungen und Tools für- Fuzzy-Systeme, - Künstliche Neuronale Netze, - Evolutionäre Algorithmen und- Data-Mining-Verfahrensowie der Methodenvergleich anhand von industriellen und Benchmark-Problemen.