Machine Learning and Knowledge Discovery in Databases

Machine Learning and Knowledge Discovery in Databases
Author: Walter Daelemans
Publisher: Springer Science & Business Media
Total Pages: 714
Release: 2008-09-04
Genre: Computers
ISBN: 354087478X

Download Machine Learning and Knowledge Discovery in Databases Book in PDF, Epub and Kindle

This book constitutes the refereed proceedings of the joint conference on Machine Learning and Knowledge Discovery in Databases: ECML PKDD 2008, held in Antwerp, Belgium, in September 2008. The 100 papers presented in two volumes, together with 5 invited talks, were carefully reviewed and selected from 521 submissions. In addition to the regular papers the volume contains 14 abstracts of papers appearing in full version in the Machine Learning Journal and the Knowledge Discovery and Databases Journal of Springer. The conference intends to provide an international forum for the discussion of the latest high quality research results in all areas related to machine learning and knowledge discovery in databases. The topics addressed are application of machine learning and data mining methods to real-world problems, particularly exploratory research that describes novel learning and mining tasks and applications requiring non-standard techniques.

Community Structure of Complex Networks

Community Structure of Complex Networks
Author: Hua-Wei Shen
Publisher: Springer Science & Business Media
Total Pages: 128
Release: 2013-01-06
Genre: Computers
ISBN: 3642318215

Download Community Structure of Complex Networks Book in PDF, Epub and Kindle

Community structure is a salient structural characteristic of many real-world networks. Communities are generally hierarchical, overlapping, multi-scale and coexist with other types of structural regularities of networks. This poses major challenges for conventional methods of community detection. This book will comprehensively introduce the latest advances in community detection, especially the detection of overlapping and hierarchical community structures, the detection of multi-scale communities in heterogeneous networks, and the exploration of multiple types of structural regularities. These advances have been successfully applied to analyze large-scale online social networks, such as Facebook and Twitter. This book provides readers a convenient way to grasp the cutting edge of community detection in complex networks. The thesis on which this book is based was honored with the “Top 100 Excellent Doctoral Dissertations Award” from the Chinese Academy of Sciences and was nominated as the “Outstanding Doctoral Dissertation” by the Chinese Computer Federation.

Overlapping Communities on Large-Scale Networks

Overlapping Communities on Large-Scale Networks
Author: Alessandro Antonio Grande
Publisher:
Total Pages: 0
Release: 2022
Genre:
ISBN:

Download Overlapping Communities on Large-Scale Networks Book in PDF, Epub and Kindle

We study AdaSVI on a matrix factorization model and find that it significantly improves SVI, leading to faster convergence on synthetic data.

Overlapping Community Detection in Massive Social Networks

Overlapping Community Detection in Massive Social Networks
Author: Joyce Jiyoung Whang
Publisher:
Total Pages: 258
Release: 2015
Genre:
ISBN:

Download Overlapping Community Detection in Massive Social Networks Book in PDF, Epub and Kindle

Massive social networks have become increasingly popular in recent years. Community detection is one of the most important techniques for the analysis of such complex networks. A community is a set of cohesive vertices that has more connections inside the set than outside. In many social and information networks, these communities naturally overlap. For instance, in a social network, each vertex in a graph corresponds to an individual who usually participates in multiple communities. In this thesis, we propose scalable overlapping community detection algorithms that effectively identify high quality overlapping communities in various real-world networks. We first develop an efficient overlapping community detection algorithm using a seed set expansion approach. The key idea of this algorithm is to find good seeds and then greedily expand these seeds using a personalized PageRank clustering scheme. Experimental results show that our algorithm significantly outperforms other state-of-the-art overlapping community detection methods in terms of run time, cohesiveness of communities, and ground-truth accuracy. To develop more principled methods, we formulate the overlapping community detection problem as a non-exhaustive, overlapping graph clustering problem where clusters are allowed to overlap with each other, and some nodes are allowed to be outside of any cluster. To tackle this non-exhaustive, overlapping clustering problem, we propose a simple and intuitive objective function that captures the issues of overlap and non-exhaustiveness in a unified manner. To optimize the objective, we develop not only fast iterative algorithms but also more sophisticated algorithms using a low-rank semidefinite programming technique. Our experimental results show that the new objective and the algorithms are effective in finding ground-truth clusterings that have varied overlap and non-exhaustiveness. We extend our non-exhaustive, overlapping clustering techniques to co-clustering where the goal is to simultaneously identify a clustering of the rows as well as the columns of a data matrix. As an example application, consider recommender systems where users have ratings on items. This can be represented by a bipartite graph where users and items are denoted by two different types of nodes, and the ratings are denoted by weighted edges between the users and the items. In this case, co-clustering would be a simultaneous clustering of users and items. We propose a new co-clustering objective function and an efficient co-clustering algorithm that is able to identify overlapping clusters as well as outliers on both types of the nodes in the bipartite graph. We show that our co-clustering algorithm is able to effectively capture the underlying co-clustering structure of the data, which results in boosting the performance of a standard one-dimensional clustering. Finally, we study the design of parallel data-driven algorithms, which enables us to further increase the scalability of our overlapping community detection algorithms. Using PageRank as a model problem, we look at three algorithm design axes: work activation, data access pattern, and scheduling. We investigate the impact of different algorithm design choices. Using these design axes, we design and test a variety of PageRank implementations finding that data-driven, push-based algorithms are able to achieve a significantly superior scalability than standard PageRank implementations. The design choices affect both single-threaded performance as well as parallel scalability. The lessons learned from this study not only guide efficient implementations of many graph mining algorithms but also provide a framework for designing new scalable algorithms, especially for large-scale community detection.

LARGE SCALE COMPLEX NETWORK ANALYSIS

LARGE SCALE COMPLEX NETWORK ANALYSIS
Author: Subhankar Dhar
Publisher: Academic Publishers
Total Pages: 100
Release: 2015-12-19
Genre: Computers
ISBN: 9383420723

Download LARGE SCALE COMPLEX NETWORK ANALYSIS Book in PDF, Epub and Kindle

Workshop Proceedings, Indian Statistical Institute, Kolkata December 19-20, 2015

Overlapping Communities on Social Networks

Overlapping Communities on Social Networks
Author: Tianyi Li (S.M.)
Publisher:
Total Pages: 41
Release: 2020
Genre:
ISBN:

Download Overlapping Communities on Social Networks Book in PDF, Epub and Kindle

Community detection is a central topic in network studies, whereas no community detection algorithm can be optimal for all possible networks; thus it is important to identify whether the algorithm is suitable for a given network. We propose a multi-step algorithmic solution scheme for overlapping community detection based on an advanced label propagation process, which imitates the community formation process on social networks. Our algorithm is parameter-free and is able to reveal the hierarchical order of communities in the graph. The unique property of our solution scheme is self-falsifiability; an automatic quality check of the results is conducted after the detection, and the fitness of the algorithm for the specific network is reported. Extensive experiments show that our algorithm is self-consistent, reliable on networks of a wide range of size and different sorts, and is more robust than existing algorithms on both sparse and large-scale social networks. Results further suggest that our solution scheme may uncover features of networks’ intrinsic community structures, which implies that this study builds up potential theoretical ground for future research, beyond expected applications in a wider-scale.

Graph Theoretic Approaches for Analyzing Large-Scale Social Networks

Graph Theoretic Approaches for Analyzing Large-Scale Social Networks
Author: Meghanathan, Natarajan
Publisher: IGI Global
Total Pages: 376
Release: 2017-07-13
Genre: Computers
ISBN: 1522528156

Download Graph Theoretic Approaches for Analyzing Large-Scale Social Networks Book in PDF, Epub and Kindle

Social network analysis has created novel opportunities within the field of data science. The complexity of these networks requires new techniques to optimize the extraction of useful information. Graph Theoretic Approaches for Analyzing Large-Scale Social Networks is a pivotal reference source for the latest academic research on emerging algorithms and methods for the analysis of social networks. Highlighting a range of pertinent topics such as influence maximization, probabilistic exploration, and distributed memory, this book is ideally designed for academics, graduate students, professionals, and practitioners actively involved in the field of data science.