Browse Results

Showing 52,176 through 52,200 of 61,805 results

Statistical Causal Discovery: LiNGAM Approach (SpringerBriefs in Statistics)

by Shohei Shimizu

This is the first book to provide a comprehensive introduction to a new semiparametric causal discovery approach known as LiNGAM, with the fundamental background needed to understand it. It offers a general overview of the basics of the LiNGAM approach for causal discovery, estimation principles, and algorithms. This semiparametric approach is one of the most exciting new topics in the field of causal discovery. The new framework assumes parametric assumptions on the functional forms of structural equations but makes no assumption on the distributions of exogenous variables other than non-Gaussianity. It provides data-analysis tools capable of estimating a much wider class of causal relations even in the presence of hidden common causes. This feature is in contrast to conventional nonparametric approaches based on conditional independence of variables. This book is highly recommended to readers who seek an in-depth and up-to-date overview of this new causal discovery approach to advance the technique as well as to those who are interested in applying this approach to real-world problems. This LiNGAM approach should become a standard item in the toolbox of statisticians, machine learners, and practitioners who need to perform observational studies.

Statistical Computing in C++ and R (Chapman & Hall/CRC The R Series)

by Randall L. Eubank Ana Kupresanin

Parallel processing can be ideally suited for the solving of more complex problems in statistical computing. This book discusses code development in C++ and R, before going beyond to look at the valuable use of these two languages in unison. It covers linear equation solution with regression and linear models motivation, optimization with maximum likelihood and nonlinear least squares motivation, and random number generation. While the text does require a working knowledge of basic concepts in statistics and experience in programming, it does not require knowledge specific to C++ or R.

Statistical Data Analysis Using SAS: Intermediate Statistical Methods (Springer Texts in Statistics)

by Mervyn G. Marasinghe Kenneth J. Koehler

The aim of this textbook (previously titled SAS for Data Analytics) is to teach the use of SAS for statistical analysis of data for advanced undergraduate and graduate students in statistics, data science, and disciplines involving analyzing data.The book begins with an introduction beyond the basics of SAS, illustrated with non-trivial, real-world, worked examples. It proceeds to SAS programming and applications, SAS graphics, statistical analysis of regression models, analysis of variance models, analysis of variance with random and mixed effects models, and then takes the discussion beyond regression and analysis of variance to conclude.Pedagogically, the authors introduce theory and methodological basis topic by topic, present a problem as an application, followed by a SAS analysis of the data provided and a discussion of results. The text focuses on applied statistical problems and methods. Key features include: end of chapter exercises, downloadable SAS code and data sets, and advanced material suitable for a second course in applied statistics with every method explained using SAS analysis to illustrate a real-world problem.New to this edition:• Covers SAS v9.2 and incorporates new commands• Uses SAS ODS (output delivery system) for reproduction of tables and graphics output• Presents new commands needed to produce ODS output• All chapters rewritten for clarity• New and updated examples throughout• All SAS outputs are new and updated, including graphics• More exercises and problems• Completely new chapter on analysis of nonlinear and generalized linear models• Completely new appendixMervyn G. Marasinghe, PhD, is Associate Professor Emeritus of Statistics at Iowa State University, where he has taught courses in statistical methods and statistical computing.Kenneth J. Koehler, PhD, is University Professor of Statistics at Iowa State University, where he teaches courses in statistical methodology at both graduate and undergraduate levels and primarily uses SAS to supplement his teaching.

Statistical Data Cleaning with Applications in R

by Edwin De Jonge Mark van der Loo

A comprehensive guide to automated statistical data cleaning <p><p> The production of clean data is a complex and time-consuming process that requires both technical know-how and statistical expertise. Statistical Data Cleaning with Applications in R brings together a wide range of techniques for cleaning textual, numeric or categorical data. This book examines technical data cleaning methods relating to data representation and data structure. A prominent role is given to statistical data validation, data cleaning based on predefined restrictions, and data cleaning strategy.

Statistical Data Mining Using SAS Applications (Chapman & Hall/CRC Data Mining and Knowledge Discovery Series)

by George Fernandez

Statistical Data Mining Using SAS Applications, Second Edition describes statistical data mining concepts and demonstrates the features of user-friendly data mining SAS tools. Integrating the statistical and graphical analysis tools available in SAS systems, the book provides complete statistical data mining solutions without writing SAS program co

Statistical Data Mining and Knowledge Discovery

by Hamparsum Bozdogan

Massive data sets pose a great challenge to many cross-disciplinary fields, including statistics. The high dimensionality and different data types and structures have now outstripped the capabilities of traditional statistical, graphical, and data visualization tools. Extracting useful information from such large data sets calls for novel approache

Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis

by Alfred P. Rovai

The purpose of this book is to provide a working background of descriptive and inferential statistics and step-by-step examples of how to perform various statistical procedures using Microsoft Excel's native operators and functions. Automated procedures are also described using Excel's Analysis TookPak and AnalystSoft StatPlus.

Statistical Inference via Data Science: A ModernDive into R and the Tidyverse (Chapman & Hall/CRC The R Series)

by Chester Ismay Albert Y. Kim Arturo Valdivia

Statistical Inference via Data Science: A ModernDive into R and the Tidyverse, Second Edition offers a comprehensive guide to learning statistical inference with data science tools widely used in industry, academia, and government. The first part of this book introduces the tidyverse suite of R packages, including ggplot2 for data visualization and dplyr for data wrangling. The second part introduces data modeling via simple and multiple linear regression. The third part presents statistical inference using simulation-based methods within a general framework implemented in R via the infer package, a suitable complement to the tidyverse. By working with these methods, readers can implement effective exploratory data analyses, conduct statistical modeling with data, and carry out statistical inference via confidence intervals and hypothesis testing. All of these tasks are performed by strongly emphasizing data visualization.Key Features in the Second Edition: Minimal Prerequisites: No prior calculus or coding experience is needed, making the content accessible to a wide audience. Real-World Data: Learn with real-world datasets, including all domestic flights leaving New York City in 2023, the Gapminder project, FiveThirtyEight.com data, and new datasets on health, global development, music, coffee quality, and geyser eruptions. Simulation-Based Inference: Statistical inference through simulation-based methods. Expanded Theoretical Discussions: Includes deeper coverage of theory-based approaches, their connection with simulation-based approaches, and a presentation of intuitive and formal aspects of these methods. Enhanced Use of the infer Package: Leverages the infer package for “tidy” and transparent statistical inference, enabling readers to construct confidence intervals and conduct hypothesis tests through multiple linear regression and beyond. Dynamic Online Resources: All code and output are embedded in the text, with additional interactive exercises, discussions, and solutions available online. Broadened Applications: Suitable for undergraduate and graduate courses, including statistics, data science, and courses emphasizing reproducible research. The first edition of the book has been used in so many different ways--for courses in statistical inference, statistical programming, business analytics, and data science for social policy, and by professionals in many other means. Ideal for those new to statistics or looking to deepen their knowledge, this edition provides a clear entry point into data science and modern statistical methods.

Statistical Inference: The Minimum Distance Approach (Chapman & Hall/CRC Monographs on Statistics and Applied Probability)

by Ayanendranath Basu Hiroyuki Shioya Chanseok Park

In many ways, estimation by an appropriate minimum distance method is one of the most natural ideas in statistics. However, there are many different ways of constructing an appropriate distance between the data and the model: the scope of study referred to by "Minimum Distance Estimation" is literally huge. Filling a statistical resource gap, Stati

Statistical Language and Speech Processing

by Carlos Martín-Vide Adrian-Horia Dediu Laurent Besacier

This book constitutes the refereed proceedings of the Second International Conference on Statistical Language and Speech Processing, SLSP 2014, held in Grenoble, France, in October 2014. The 18 full papers presented together with three invited talks were carefully reviewed and selected from 53 submissions. The papers are organized in topical sections on machine translation, speech and speaker recognition, machine learning methods, text extraction and categorization, and mining text.

Statistical Language and Speech Processing

by Carlos Martín-Vide Adrian-Horia Dediu Klára Vicsi

This book constitutes the refereed proceedings of theThird International Conference on Statistical Language and Speech Processing,SLSP 2015, held in Budapest, Hungary, in November 2015. The 26 full papers presented together with twoinvited talks were carefully reviewed and selected from 71 submissions. Thepapers cover topics such as: anaphora and coreference resolution; authorshipidentification, plagiarism and spam filtering; computer-aided translation;corpora and language resources; data mining and semantic Web; informationextraction; information retrieval; knowledge representation and ontologies;lexicons and dictionaries; machine translation; multimodal technologies;natural language understanding; neural representation of speech and language;opinion mining and sentiment analysis; parsing; part-of-speech tagging;question-answering systems; semantic role labelling; speaker identification andverification; speech and language generation; speech recognition; speechsynthesis; speech transcription; spelling correction; spoken dialogue systems;term extraction; text categorisation; text summarisation; and user modeling.

Statistical Language and Speech Processing: 4th International Conference, SLSP 2016, Pilsen, Czech Republic, October 11-12, 2016, Proceedings (Lecture Notes in Computer Science #9918)

by Carlos Martín-Vide Pavel Král

This book constitutes the refereed proceedings of the 4th International Conference on Statistical Language and Speech Processing, SLSP 2016, held in Pilsen, Czech Republic, in October 2016. The 11 full papers presented together with two invited talks were carefully reviewed and selected from 38 submissions. The papers cover topics such as anaphora and coreference resolution; authorship identification, plagiarism and spam filtering; computer-aided translation; corpora and language resources; data mining and semantic web; information extraction; information retrieval; knowledge representation and ontologies; lexicons and dictionaries; machine translation; multimodal technologies; natural language understanding; neural representation of speech and language; opinion mining and sentiment analysis; parsing; part-of-speech tagging; question and answering systems; semantic role labeling; speaker identification and verification; speech and language generation; speech recognition; speech synthesis; speech transcription; speech correction; spoken dialogue systems; term extraction; text categorization; test summarization; user modeling.

Statistical Language and Speech Processing: 5th International Conference, SLSP 2017, Le Mans, France, October 23–25, 2017, Proceedings (Lecture Notes in Computer Science #10583)

by Nathalie Camelin, Yannick Estève and Carlos Martín-Vide

This book constitutes the refereed proceedings of the 5th International Conference on Statistical Language and Speech Processing, SLSP 2017, held in Le Mans, France, in October 2017. The 21 full papers presented were carefully reviewed and selected from 39 submissions. The papers cover topics such as anaphora and conference resolution; authorship identification, plagiarism and spam filtering; computer-aided translation; corpora and language resources; data mining and semanticweb; information extraction; information retrieval; knowledge representation and ontologies; lexicons and dictionaries; machine translation; multimodal technologies; natural language understanding; neural representation of speech and language; opinion mining and sentiment analysis; parsing; part-of-speech tagging; question and answering systems; semantic role labeling; speaker identification and verification; speech and language generation; speech recognition; speech synthesis; speech transcription; speech correction; spoken dialogue systems; term extraction; text categorization; test summarization; user modeling. They are organized in the following sections: language and information extraction; post-processing and applications of automatic transcriptions; speech paralinguistics and synthesis; speech recognition: modeling and resources.

Statistical Language and Speech Processing: 6th International Conference, SLSP 2018, Mons, Belgium, October 15–16, 2018, Proceedings (Lecture Notes in Computer Science #11171)

by Carlos Martín-Vide Thierry Dutoit Gueorgui Pironkov

This book constitutes the proceedings of the 6th International Conference on Statistical Language and Speech Processing, SLSP 2018, held in Mons, Belgium, in October 2018. The 15 full papers presented in this volume were carefully reviewed and selected from 40 submissions. They were organized in topical sections named: speech synthesis and spoken language generation; speech recognition and post-processing; natural language processing and understanding; and text processing and analysis.

Statistical Language and Speech Processing: 7th International Conference, SLSP 2019, Ljubljana, Slovenia, October 14–16, 2019, Proceedings (Lecture Notes in Computer Science #11816)

by Carlos Martín-Vide Matthew Purver Senja Pollak

This book constitutes the proceedings of the 7th International Conference on Statistical Language and Speech Processing, SLSP 2019, held in Ljubljana, Slovenia, in October 2019. The 25 full papers presented together with one invited paper in this volume were carefully reviewed and selected from 48 submissions. They were organized in topical sections named: Dialogue and Spoken Language Understanding; Language Analysis and Generation; Speech Analysis and Synthesis; Speech Recognition; Text Analysis and Classification.

Statistical Language and Speech Processing: 8th International Conference, SLSP 2020, Cardiff, UK, October 14–16, 2020, Proceedings (Lecture Notes in Computer Science #12379)

by Carlos Martín-Vide Luis Espinosa-Anke Irena Spasić

This book constitutes the proceedings of the 8th International Conference on Statistical Language and Speech Processing, SLSP 2020, held in Cardiff, UK, in October 2020. The 13 full papers presented together with one invited paper in this volume were carefully reviewed and selected from 25 submissions. They papers cover the wide spectrum of statistical methods that are currently in use in computational language or speech processing.

Statistical Language and Speech Processing: 9th International Conference, SLSP 2021, Virtual Event, November 22-26, 2021, Proceedings (Lecture Notes in Computer Science #13062)

by Carlos Martín-Vide Luis Espinosa-Anke Irena Spasić

This book constitutes the proceedings of the 9th International Conference on Statistical Language and Speech Processing, SLSP 2021, held in Cardiff, UK, in November 2021.The 9 full papers presented in this volume were carefully reviewed and selected from 21 submissions. The papers present topics of either theoretical or applied interest discussing the employment of statistical models (including machine learning) within language and speech processing.

Statistical Laws in Complex Systems: Combining Mechanistic Models and Data Analysis (Understanding Complex Systems)

by Eduardo G. Altmann

This book provides a unifying approach to the study of statistical laws, critically evaluating their role in the theoretical understanding of complex systems and the different data-analysis methods used to evaluate them. Statistical laws describe regular patterns observed in diverse scientific domains, ranging from the magnitude of earthquakes (Gutenberg-Richter law) and metabolic rates in organisms (Kleiber's law), to the frequency distribution of words in texts (Zipf's and Herdan-Heaps' laws), and productivity metrics of cities (urban scaling laws). The origins of these laws, their empirical validity, and the insights they provide into underlying systems have been subjects of scientific inquiry for centuries. Through a historical review and a unified analysis, this book argues that the persistent controversies on the validity of statistical laws are predominantly rooted not in novel empirical findings but in the discordance among data-analysis techniques, mechanistic models, and the interpretations of statistical laws. Starting with simple examples and progressing to more advanced time-series and statistical methods, this book and its accompanying repository provide comprehensive material for researchers interested in analyzing data, testing and comparing different laws, and interpreting results in both existing and new datasets.

Statistical Learning Tools for Electricity Load Forecasting (Statistics for Industry, Technology, and Engineering)

by Jean-Michel Poggi Anestis Antoniadis Jairo Cugliari Matteo Fasiolo Yannig Goude

This monograph explores a set of statistical and machine learning tools that can be effectively utilized for applied data analysis in the context of electricity load forecasting. Drawing on their substantial research and experience with forecasting electricity demand in industrial settings, the authors guide readers through several modern forecasting methods and tools from both industrial and applied perspectives – generalized additive models (GAMs), probabilistic GAMs, functional time series and wavelets, random forests, aggregation of experts, and mixed effects models. A collection of case studies based on sizable high-resolution datasets, together with relevant R packages, then illustrate the implementation of these techniques. Five real datasets at three different levels of aggregation (nation-wide, region-wide, or individual) from four different countries (UK, France, Ireland, and the USA) are utilized to study five problems: short-term point-wise forecasting, selection of relevant variables for prediction, construction of prediction bands, peak demand prediction, and use of individual consumer data. This text is intended for practitioners, researchers, and post-graduate students working on electricity load forecasting; it may also be of interest to applied academics or scientists wanting to learn about cutting-edge forecasting tools for application in other areas. Readers are assumed to be familiar with standard statistical concepts such as random variables, probability density functions, and expected values, and to possess some minimal modeling experience.

Statistical Learning Using Neural Networks: A Guide for Statisticians and Data Scientists with Python

by Basilio de Braganca Pereira Calyampudi Radhakrishna Rao Fabio Borges de Oliveira

Statistical Learning using Neural Networks: A Guide for Statisticians and Data Scientists with Python introduces artificial neural networks starting from the basics and increasingly demanding more effort from readers, who can learn the theory and its applications in statistical methods with concrete Python code examples. It presents a wide range of widely used statistical methodologies, applied in several research areas with Python code examples, which are available online. It is suitable for scientists and developers as well as graduate students.Key Features: Discusses applications in several research areas Covers a wide range of widely used statistical methodologies Includes Python code examples Gives numerous neural network models This book covers fundamental concepts on Neural Networks including Multivariate Statistics Neural Networks, Regression Neural Network Models, Survival Analysis Networks, Time Series Forecasting Networks, Control Chart Networks, and Statistical Inference Results. This book is suitable for both teaching and research. It introduces neural networks and is a guide for outsiders of academia working in data mining and artificial intelligence (AI). This book brings together data analysis from statistics to computer science using neural networks.

Statistical Learning and Data Science

by Fionn Murtagh Mireille Gettler Summa Léon Bottou Bernard Goldfarb Catherine Pardoux Myriam Touati

Data analysis is changing fast. Driven by a vast range of application domains and affordable tools, machine learning has become mainstream. Unsupervised data analysis, including cluster analysis, factor analysis, and low dimensionality mapping methods continually being updated, have reached new heights of achievement in the incredibly rich data wor

Statistical Learning of Complex Data (Studies in Classification, Data Analysis, and Knowledge Organization)

by Maurizio Vichi Francesca Greselin Laura Deldossi Luca Bagnato

This book of peer-reviewed contributions presents the latest findings in classification, statistical learning, data analysis and related areas, including supervised and unsupervised classification, clustering, statistical analysis of mixed-type data, big data analysis, statistical modeling, graphical models and social networks. It covers both methodological aspects as well as applications to a wide range of fields such as economics, architecture, medicine, data management, consumer behavior and the gender gap. In addition, it describes the basic features of the software behind the data analysis results, and provides links to the corresponding codes and data sets where necessary. This book is intended for researchers and practitioners who are interested in the latest developments and applications in the field of data analysis and classification. It gathers selected and peer-reviewed contributions presented at the 11th Scientific Meeting of the Classification and Data Analysis Group of the Italian Statistical Society (CLADAG 2017), held in Milan, Italy, on September 13–15, 2017.

Statistical Learning with Math and Python: 100 Exercises for Building Logic

by Joe Suzuki

The most crucial ability for machine learning and data science is mathematical logic for grasping their essence rather than knowledge and experience. This textbook approaches the essence of machine learning and data science by considering math problems and building Python programs. As the preliminary part, Chapter 1 provides a concise introduction to linear algebra, which will help novices read further to the following main chapters. Those succeeding chapters present essential topics in statistical learning: linear regression, classification, resampling, information criteria, regularization, nonlinear regression, decision trees, support vector machines, and unsupervised learning. Each chapter mathematically formulates and solves machine learning problems and builds the programs. The body of a chapter is accompanied by proofs and programs in an appendix, with exercises at the end of the chapter. Because the book is carefully organized to provide the solutions to the exercises in each chapter, readers can solve the total of 100 exercises by simply following the contents of each chapter. This textbook is suitable for an undergraduate or graduate course consisting of about 12 lectures. Written in an easy-to-follow and self-contained style, this book will also be perfect material for independent learning.

Statistical Learning with Math and R: 100 Exercises for Building Logic

by Joe Suzuki

The most crucial ability for machine learning and data science is mathematical logic for grasping their essence rather than knowledge and experience. This textbook approaches the essence of machine learning and data science by considering math problems and building R programs. As the preliminary part, Chapter 1 provides a concise introduction to linear algebra, which will help novices read further to the following main chapters. Those succeeding chapters present essential topics in statistical learning: linear regression, classification, resampling, information criteria, regularization, nonlinear regression, decision trees, support vector machines, and unsupervised learning. Each chapter mathematically formulates and solves machine learning problems and builds the programs. The body of a chapter is accompanied by proofs and programs in an appendix, with exercises at the end of the chapter. Because the book is carefully organized to provide the solutions to the exercises in each chapter, readers can solve the total of 100 exercises by simply following the contents of each chapter. This textbook is suitable for an undergraduate or graduate course consisting of about 12 lectures. Written in an easy-to-follow and self-contained style, this book will also be perfect material for independent learning.

Statistical Learning with Sparsity: The Lasso and Generalizations (Chapman & Hall/CRC Monographs on Statistics and Applied Probability)

by Trevor Hastie Robert Tibshirani Martin Wainwright

Discover New Methods for Dealing with High-Dimensional DataA sparse statistical model has only a small number of nonzero parameters or weights; therefore, it is much easier to estimate and interpret than a dense model. Statistical Learning with Sparsity: The Lasso and Generalizations presents methods that exploit sparsity to help recover the underl

Refine Search

Showing 52,176 through 52,200 of 61,805 results