The Foundations of Statistics: A Simulation-based Approach
Title | The Foundations of Statistics: A Simulation-based Approach PDF eBook |
Author | Shravan Vasishth |
Publisher | Springer Science & Business Media |
Total Pages | 187 |
Release | 2010-11-11 |
Genre | Mathematics |
ISBN | 3642163130 |
Statistics and hypothesis testing are routinely used in areas (such as linguistics) that are traditionally not mathematically intensive. In such fields, when faced with experimental data, many students and researchers tend to rely on commercial packages to carry out statistical data analysis, often without understanding the logic of the statistical tests they rely on. As a consequence, results are often misinterpreted, and users have difficulty in flexibly applying techniques relevant to their own research — they use whatever they happen to have learned. A simple solution is to teach the fundamental ideas of statistical hypothesis testing without using too much mathematics. This book provides a non-mathematical, simulation-based introduction to basic statistical concepts and encourages readers to try out the simulations themselves using the source code and data provided (the freely available programming language R is used throughout). Since the code presented in the text almost always requires the use of previously introduced programming constructs, diligent students also acquire basic programming abilities in R. The book is intended for advanced undergraduate and graduate students in any discipline, although the focus is on linguistics, psychology, and cognitive science. It is designed for self-instruction, but it can also be used as a textbook for a first course on statistics. Earlier versions of the book have been used in undergraduate and graduate courses in Europe and the US. ”Vasishth and Broe have written an attractive introduction to the foundations of statistics. It is concise, surprisingly comprehensive, self-contained and yet quite accessible. Highly recommended.” Harald Baayen, Professor of Linguistics, University of Alberta, Canada ”By using the text students not only learn to do the specific things outlined in the book, they also gain a skill set that empowers them to explore new areas that lie beyond the book’s coverage.” Colin Phillips, Professor of Linguistics, University of Maryland, USA
Probability, Statistics, and Data
Title | Probability, Statistics, and Data PDF eBook |
Author | Darrin Speegle |
Publisher | CRC Press |
Total Pages | 644 |
Release | 2021-11-26 |
Genre | Business & Economics |
ISBN | 1000504514 |
This book is a fresh approach to a calculus based, first course in probability and statistics, using R throughout to give a central role to data and simulation. The book introduces probability with Monte Carlo simulation as an essential tool. Simulation makes challenging probability questions quickly accessible and easily understandable. Mathematical approaches are included, using calculus when appropriate, but are always connected to experimental computations. Using R and simulation gives a nuanced understanding of statistical inference. The impact of departure from assumptions in statistical tests is emphasized, quantified using simulations, and demonstrated with real data. The book compares parametric and non-parametric methods through simulation, allowing for a thorough investigation of testing error and power. The text builds R skills from the outset, allowing modern methods of resampling and cross validation to be introduced along with traditional statistical techniques. Fifty-two data sets are included in the complementary R package fosdata. Most of these data sets are from recently published papers, so that you are working with current, real data, which is often large and messy. Two central chapters use powerful tidyverse tools (dplyr, ggplot2, tidyr, stringr) to wrangle data and produce meaningful visualizations. Preliminary versions of the book have been used for five semesters at Saint Louis University, and the majority of the more than 400 exercises have been classroom tested.
OpenIntro Statistics
Title | OpenIntro Statistics PDF eBook |
Author | David Diez |
Publisher | |
Total Pages | |
Release | 2015-07-02 |
Genre | |
ISBN | 9781943450046 |
The OpenIntro project was founded in 2009 to improve the quality and availability of education by producing exceptional books and teaching tools that are free to use and easy to modify. We feature real data whenever possible, and files for the entire textbook are freely available at openintro.org. Visit our website, openintro.org. We provide free videos, statistical software labs, lecture slides, course management tools, and many other helpful resources.
Monte-Carlo Simulation-Based Statistical Modeling
Title | Monte-Carlo Simulation-Based Statistical Modeling PDF eBook |
Author | Ding-Geng (Din) Chen |
Publisher | Springer |
Total Pages | 430 |
Release | 2017-02-01 |
Genre | Medical |
ISBN | 9811033072 |
This book brings together expert researchers engaged in Monte-Carlo simulation-based statistical modeling, offering them a forum to present and discuss recent issues in methodological development as well as public health applications. It is divided into three parts, with the first providing an overview of Monte-Carlo techniques, the second focusing on missing data Monte-Carlo methods, and the third addressing Bayesian and general statistical modeling using Monte-Carlo simulations. The data and computer programs used here will also be made publicly available, allowing readers to replicate the model development and data analysis presented in each chapter, and to readily apply them in their own research. Featuring highly topical content, the book has the potential to impact model development and data analyses across a wide spectrum of fields, and to spark further research in this direction.
Statistical Foundations of Data Science
Title | Statistical Foundations of Data Science PDF eBook |
Author | Jianqing Fan |
Publisher | CRC Press |
Total Pages | 752 |
Release | 2020-09-21 |
Genre | Mathematics |
ISBN | 1466510854 |
Statistical Foundations of Data Science gives a thorough introduction to commonly used statistical models, contemporary statistical machine learning techniques and algorithms, along with their mathematical insights and statistical theories. It aims to serve as a graduate-level textbook and a research monograph on high-dimensional statistics, sparsity and covariance learning, machine learning, and statistical inference. It includes ample exercises that involve both theoretical studies as well as empirical applications. The book begins with an introduction to the stylized features of big data and their impacts on statistical analysis. It then introduces multiple linear regression and expands the techniques of model building via nonparametric regression and kernel tricks. It provides a comprehensive account on sparsity explorations and model selections for multiple regression, generalized linear models, quantile regression, robust regression, hazards regression, among others. High-dimensional inference is also thoroughly addressed and so is feature screening. The book also provides a comprehensive account on high-dimensional covariance estimation, learning latent factors and hidden structures, as well as their applications to statistical estimation, inference, prediction and machine learning problems. It also introduces thoroughly statistical machine learning theory and methods for classification, clustering, and prediction. These include CART, random forests, boosting, support vector machines, clustering algorithms, sparse PCA, and deep learning.
An Introduction to Statistical Computing
Title | An Introduction to Statistical Computing PDF eBook |
Author | Jochen Voss |
Publisher | John Wiley & Sons |
Total Pages | 322 |
Release | 2013-08-28 |
Genre | Mathematics |
ISBN | 1118728025 |
A comprehensive introduction to sampling-based methods in statistical computing The use of computers in mathematics and statistics has opened up a wide range of techniques for studying otherwise intractable problems. Sampling-based simulation techniques are now an invaluable tool for exploring statistical models. This book gives a comprehensive introduction to the exciting area of sampling-based methods. An Introduction to Statistical Computing introduces the classical topics of random number generation and Monte Carlo methods. It also includes some advanced methods such as the reversible jump Markov chain Monte Carlo algorithm and modern methods such as approximate Bayesian computation and multilevel Monte Carlo techniques An Introduction to Statistical Computing: Fully covers the traditional topics of statistical computing. Discusses both practical aspects and the theoretical background. Includes a chapter about continuous-time models. Illustrates all methods using examples and exercises. Provides answers to the exercises (using the statistical computing environment R); the corresponding source code is available online. Includes an introduction to programming in R. This book is mostly self-contained; the only prerequisites are basic knowledge of probability up to the law of large numbers. Careful presentation and examples make this book accessible to a wide range of students and suitable for self-study or as the basis of a taught course.
Introductory Statistics with Randomization and Simulation
Title | Introductory Statistics with Randomization and Simulation PDF eBook |
Author | David M. Diez |
Publisher | |
Total Pages | 354 |
Release | 2014-07-18 |
Genre | Statistics |
ISBN | 9781500576691 |
This textbook may be downloaded as a free PDF on the project's website, and the paperback is sold royalty-free. OpenIntro develops free textbooks and course resources for introductory statistics that exceeds the quality standards of traditional textbooks and resources, and that maximizes accessibility options for the typical student. The approach taken in this textbooks differs from OpenIntro Statistics in its introduction to inference. The foundations for inference are provided using randomization and simulation methods. Once a solid foundation is formed, a transition is made to traditional approaches, where the normal and t distributions are used for hypothesis testing and the construction of confidence intervals.