Linking Sensitive Data

Linking Sensitive Data
Author: Peter Christen
Publisher:
Total Pages: 476
Release: 2020
Genre: Computer security
ISBN: 3030597067

Download Linking Sensitive Data Book in PDF, Epub and Kindle

This book provides modern technical answers to the legal requirements of pseudonymisation as recommended by privacy legislation. It covers topics such as modern regulatory frameworks for sharing and linking sensitive information, concepts and algorithms for privacy-preserving record linkage and their computational aspects, practical considerations such as dealing with dirty and missing data, as well as privacy, risk, and performance assessment measures. Existing techniques for privacy-preserving record linkage are evaluated empirically and real-world application examples that scale to population sizes are described. The book also includes pointers to freely available software tools, benchmark data sets, and tools to generate synthetic data that can be used to test and evaluate linkage techniques. This book consists of fourteen chapters grouped into four parts, and two appendices. The first part introduces the reader to the topic of linking sensitive data, the second part covers methods and techniques to link such data, the third part discusses aspects of practical importance, and the fourth part provides an outlook of future challenges and open research problems relevant to linking sensitive databases. The appendices provide pointers and describe freely available, open-source software systems that allow the linkage of sensitive data, and provide further details about the evaluations presented. A companion Web site at https://dmm.anu.edu.au/lsdbook2020 provides additional material and Python programs used in the book. This book is mainly written for applied scientists, researchers, and advanced practitioners in governments, industry, and universities who are concerned with developing, implementing, and deploying systems and tools to share sensitive information in administrative, commercial, or medical databases. The Book describes how linkage methods work and how to evaluate their performance. It covers all the major concepts and methods and also discusses practical matters such as computational efficiency, which are critical if the methods are to be used in practice - and it does all this in a highly accessible way! David J. Hand, Imperial College, London.

Data Matching

Data Matching
Author: Peter Christen
Publisher: Springer Science & Business Media
Total Pages: 279
Release: 2012-07-04
Genre: Computers
ISBN: 3642311644

Download Data Matching Book in PDF, Epub and Kindle

Data matching (also known as record or data linkage, entity resolution, object identification, or field matching) is the task of identifying, matching and merging records that correspond to the same entities from several databases or even within one database. Based on research in various domains including applied statistics, health informatics, data mining, machine learning, artificial intelligence, database management, and digital libraries, significant advances have been achieved over the last decade in all aspects of the data matching process, especially on how to improve the accuracy of data matching, and its scalability to large databases. Peter Christen’s book is divided into three parts: Part I, “Overview”, introduces the subject by presenting several sample applications and their special challenges, as well as a general overview of a generic data matching process. Part II, “Steps of the Data Matching Process”, then details its main steps like pre-processing, indexing, field and record comparison, classification, and quality evaluation. Lastly, part III, “Further Topics”, deals with specific aspects like privacy, real-time matching, or matching unstructured data. Finally, it briefly describes the main features of many research and open source systems available today. By providing the reader with a broad range of data matching concepts and techniques and touching on all aspects of the data matching process, this book helps researchers as well as students specializing in data quality or data matching aspects to familiarize themselves with recent research advances and to identify open research challenges in the area of data matching. To this end, each chapter of the book includes a final section that provides pointers to further background and research material. Practitioners will better understand the current state of the art in data matching as well as the internal workings and limitations of current systems. Especially, they will learn that it is often not feasible to simply implement an existing off-the-shelf data matching system without substantial adaption and customization. Such practical considerations are discussed for each of the major steps in the data matching process.

Record Linkage and Privacy

Record Linkage and Privacy
Author: United States. General Accounting Office
Publisher: DIANE Publishing
Total Pages: 172
Release: 2001
Genre: Electronic records
ISBN: 1428949291

Download Record Linkage and Privacy Book in PDF, Epub and Kindle

Record Linkage and Privacy

Record Linkage and Privacy
Author:
Publisher:
Total Pages: 174
Release: 2001
Genre: Electronic records
ISBN:

Download Record Linkage and Privacy Book in PDF, Epub and Kindle

Data Quality and Record Linkage Techniques

Data Quality and Record Linkage Techniques
Author: Thomas N. Herzog
Publisher: Springer Science & Business Media
Total Pages: 225
Release: 2007-05-23
Genre: Computers
ISBN: 0387695052

Download Data Quality and Record Linkage Techniques Book in PDF, Epub and Kindle

This book offers a practical understanding of issues involved in improving data quality through editing, imputation, and record linkage. The first part of the book deals with methods and models, focusing on the Fellegi-Holt edit-imputation model, the Little-Rubin multiple-imputation scheme, and the Fellegi-Sunter record linkage model. The second part presents case studies in which these techniques are applied in a variety of areas, including mortgage guarantee insurance, medical, biomedical, highway safety, and social insurance as well as the construction of list frames and administrative lists. This book offers a mixture of practical advice, mathematical rigor, management insight and philosophy.

Methodological Developments in Data Linkage

Methodological Developments in Data Linkage
Author: Katie Harron
Publisher: John Wiley & Sons
Total Pages: 286
Release: 2015-12-14
Genre: Medical
ISBN: 1118745876

Download Methodological Developments in Data Linkage Book in PDF, Epub and Kindle

A comprehensive compilation of new developments in data linkage methodology The increasing availability of large administrative databases has led to a dramatic rise in the use of data linkage, yet the standard texts on linkage are still those which describe the seminal work from the 1950-60s, with some updates. Linkage and analysis of data across sources remains problematic due to lack of discriminatory and accurate identifiers, missing data and regulatory issues. Recent developments in data linkage methodology have concentrated on bias and analysis of linked data, novel approaches to organising relationships between databases and privacy-preserving linkage. Methodological Developments in Data Linkage brings together a collection of contributions from members of the international data linkage community, covering cutting edge methodology in this field. It presents opportunities and challenges provided by linkage of large and often complex datasets, including analysis problems, legal and security aspects, models for data access and the development of novel research areas. New methods for handling uncertainty in analysis of linked data, solutions for anonymised linkage and alternative models for data collection are also discussed. Key Features: Presents cutting edge methods for a topic of increasing importance to a wide range of research areas, with applications to data linkage systems internationally Covers the essential issues associated with data linkage today Includes examples based on real data linkage systems, highlighting the opportunities, successes and challenges that the increasing availability of linkage data provides Novel approach incorporates technical aspects of both linkage, management and analysis of linked data This book will be of core interest to academics, government employees, data holders, data managers, analysts and statisticians who use administrative data. It will also appeal to researchers in a variety of areas, including epidemiology, biostatistics, social statistics, informatics, policy and public health.

Advances in Business Statistics, Methods and Data Collection

Advances in Business Statistics, Methods and Data Collection
Author: Ger Snijkers
Publisher: John Wiley & Sons
Total Pages: 900
Release: 2023-02-22
Genre: Business & Economics
ISBN: 1119672309

Download Advances in Business Statistics, Methods and Data Collection Book in PDF, Epub and Kindle

ADVANCES IN BUSINESS STATISTICS, METHODS AND DATA COLLECTION Advances in Business Statistics, Methods and Data Collection delivers insights into the latest state of play in producing establishment statistics, obtained from businesses, farms and institutions. Presenting materials and reflecting discussions from the 6th International Conference on Establishment Statistics (ICES-VI), this edited volume provides a broad overview of methodology underlying current establishment statistics from every aspect of the production life cycle while spotlighting innovative and impactful advancements in the development, conduct, and evaluation of modern establishment statistics programs. Highlights include: Practical discussions on agile, timely, and accurate measurement of rapidly evolving economic phenomena such as globalization, new computer technologies, and the informal sector. Comprehensive explorations of administrative and new data sources and technologies, covering big (organic) data sources and methods for data integration, linking, machine learning and visualization. Detailed compilations of statistical programs’ responses to wide-ranging data collection and production challenges, among others caused by the Covid-19 pandemic. In-depth examinations of business survey questionnaire design, computerization, pretesting methods, experimentation, and paradata. Methodical presentations of conventional and emerging procedures in survey statistics techniques for establishment statistics, encompassing probability sampling designs and sample coordination, non-probability sampling, missing data treatments, small area estimation and Bayesian methods. Providing a broad overview of most up-to-date science, this book challenges the status quo and prepares researchers for current and future challenges in establishment statistics and methods. Perfect for survey researchers, government statisticians, National Bank employees, economists, and undergraduate and graduate students in survey research and economics, Advances in Business Statistics, Methods and Data Collection will also earn a place in the toolkit of researchers working –with data– in industries across a variety of fields.

Computer Matching and Privacy Protection Act of 1987

Computer Matching and Privacy Protection Act of 1987
Author: United States. Congress. House. Committee on Government Operations. Government Information, Justice, and Agriculture Subcommittee
Publisher:
Total Pages: 152
Release: 1987
Genre: Administrative agencies
ISBN:

Download Computer Matching and Privacy Protection Act of 1987 Book in PDF, Epub and Kindle

Linking Data for Health Services Research

Linking Data for Health Services Research
Author: Agency for and Quality
Publisher: Createspace Independent Publishing Platform
Total Pages: 0
Release: 2014-12-31
Genre: Linked data
ISBN: 9781505859430

Download Linking Data for Health Services Research Book in PDF, Epub and Kindle

Health registries greatly enhance health services research, especially when linked with other data sources such as administrative claims. Recently, concerns about patient privacy and data security have produced policies such as the Health Insurance Portability and Accountability Act (HIPAA) that reduce the availability of sensitive identifying information. In this context, the development of effective record linkage approaches for varying scenarios of data availability is critical. This report presents a conceptual framework and instructional information that scientifically describe the strengths and limitations of different approaches to record linkage of registries to other data sources. The report defines the requirements for high-quality record linkage of registries to other data sources and describes the strengths and limitations of different approaches. By explaining the spectrum of activities involved, it serves as an instructional guide for researchers designing new CER studies using patient registries linked with other secondary data sources. Through this report, we provide an overview of linkage from registries to administrative claims, including considerations for researchers, data managers, information technology managers, and other stakeholders who are likely to be involved in the process of data linkage. We also apply the data linkage framework to a real-world problem and discuss the results.