The Discipline of Data

The Discipline of Data
Title The Discipline of Data PDF eBook
Author Jerald Savin
Publisher Taylor & Francis
Total Pages 234
Release 2023-07-06
Genre Business & Economics
ISBN 1000894525

Download The Discipline of Data Book in PDF, Epub and Kindle

Pulling aside the curtain of ‘Big Data’ buzz, this book introduces C-suite and other non-technical senior leaders to the essentials of obtaining and maintaining accurate, reliable data, especially for decision-making purposes. Bad data begets bad decisions, and an understanding of data fundamentals — how data is generated, organized, stored, evaluated, and maintained — has never been more important when solving problems such as the pandemic-related supply chain crisis. This book addresses the data-related challenges that businesses face, answering questions such as: What are the characteristics of high-quality data? How do you get from bad data to good data? What procedures and practices ensure high-quality data? How do you know whether your data supports the decisions you need to make? This clear and valuable resource will appeal to C-suite executives and top-line managers across industries, as well as business analysts at all career stages and data analytics students.

The Real Work of Data Science

The Real Work of Data Science
Title The Real Work of Data Science PDF eBook
Author Ron S. Kenett
Publisher John Wiley & Sons
Total Pages 142
Release 2019-04-01
Genre Science
ISBN 111957076X

Download The Real Work of Data Science Book in PDF, Epub and Kindle

The essential guide for data scientists and for leaders who must get more from their data science teams The Economist boldly claims that data are now "the world's most valuable resource." But, as Kenett and Redman so richly describe, unlocking that value requires far more than technical excellence. The Real Work of Data Science explores understanding the problems, dealing with quality issues, building trust with decision makers, putting data science teams in the right organizational spots, and helping companies become data-driven. This is the work that spells the difference between a good data scientist and a great one, between a team that makes marginal contributions and one that drives the business, between a company that gains some value from its data and one in which data truly is "the most valuable resource." "These two authors are world-class experts on analytics, data management, and data quality; they've forgotten more about these topics than most of us will ever know. Their book is pragmatic, understandable, and focused on what really counts. If you want to do data science in any capacity, you need to read it." —Thomas H. Davenport, Distinguished Professor, Babson College and Fellow, MIT Initiative on the Digital Economy "I like your book. The chapters address problems that have faced statisticians for generations, updated to reflect today's issues, such as computational Big Data." —Sir David Cox, Warden of Nuffield College and Professor of Statistics, Oxford University "Data science is critical for competitiveness, for good government, for correct decisions. But what is data science? Kenett and Redman give, by far, the best introduction to the subject I have seen anywhere. They address the critical questions of formulating the right problem, collecting the right data, doing the right analyses, making the right decisions, and measuring the actual impact of the decisions. This book should become required reading in statistics and computer science departments, business schools, analytics institutes and, most importantly, by all business managers." —A. Blanton Godfrey, Joseph D. Moore Distinguished University Professor, Wilson College of Textiles, North Carolina State University

Envisioning the Data Science Discipline

Envisioning the Data Science Discipline
Title Envisioning the Data Science Discipline PDF eBook
Author National Academies of Sciences, Engineering, and Medicine
Publisher National Academies Press
Total Pages 69
Release 2018-03-05
Genre Education
ISBN 0309465052

Download Envisioning the Data Science Discipline Book in PDF, Epub and Kindle

The need to manage, analyze, and extract knowledge from data is pervasive across industry, government, and academia. Scientists, engineers, and executives routinely encounter enormous volumes of data, and new techniques and tools are emerging to create knowledge out of these data, some of them capable of working with real-time streams of data. The nation's ability to make use of these data depends on the availability of an educated workforce with necessary expertise. With these new capabilities have come novel ethical challenges regarding the effectiveness and appropriateness of broad applications of data analyses. The field of data science has emerged to address the proliferation of data and the need to manage and understand it. Data science is a hybrid of multiple disciplines and skill sets, draws on diverse fields (including computer science, statistics, and mathematics), encompasses topics in ethics and privacy, and depends on specifics of the domains to which it is applied. Fueled by the explosion of data, jobs that involve data science have proliferated and an array of data science programs at the undergraduate and graduate levels have been established. Nevertheless, data science is still in its infancy, which suggests the importance of envisioning what the field might look like in the future and what key steps can be taken now to move data science education in that direction. This study will set forth a vision for the emerging discipline of data science at the undergraduate level. This interim report lays out some of the information and comments that the committee has gathered and heard during the first half of its study, offers perspectives on the current state of data science education, and poses some questions that may shape the way data science education evolves in the future. The study will conclude in early 2018 with a final report that lays out a vision for future data science education.

Doing Data Science

Doing Data Science
Title Doing Data Science PDF eBook
Author Cathy O'Neil
Publisher "O'Reilly Media, Inc."
Total Pages 408
Release 2013-10-09
Genre Computers
ISBN 144936389X

Download Doing Data Science Book in PDF, Epub and Kindle

Now that people are aware that data can make the difference in an election or a business model, data science as an occupation is gaining ground. But how can you get started working in a wide-ranging, interdisciplinary field that’s so clouded in hype? This insightful book, based on Columbia University’s Introduction to Data Science class, tells you what you need to know. In many of these chapter-long lectures, data scientists from companies such as Google, Microsoft, and eBay share new algorithms, methods, and models by presenting case studies and the code they use. If you’re familiar with linear algebra, probability, and statistics, and have programming experience, this book is an ideal introduction to data science. Topics include: Statistical inference, exploratory data analysis, and the data science process Algorithms Spam filters, Naive Bayes, and data wrangling Logistic regression Financial modeling Recommendation engines and causality Data visualization Social networks and data journalism Data engineering, MapReduce, Pregel, and Hadoop Doing Data Science is collaboration between course instructor Rachel Schutt, Senior VP of Data Science at News Corp, and data science consultant Cathy O’Neil, a senior data scientist at Johnson Research Labs, who attended and blogged about the course.

Responsible Data Science

Responsible Data Science
Title Responsible Data Science PDF eBook
Author Peter C. Bruce
Publisher John Wiley & Sons
Total Pages 304
Release 2021-04-13
Genre Computers
ISBN 1119741777

Download Responsible Data Science Book in PDF, Epub and Kindle

Explore the most serious prevalent ethical issues in data science with this insightful new resource The increasing popularity of data science has resulted in numerous well-publicized cases of bias, injustice, and discrimination. The widespread deployment of “Black box” algorithms that are difficult or impossible to understand and explain, even for their developers, is a primary source of these unanticipated harms, making modern techniques and methods for manipulating large data sets seem sinister, even dangerous. When put in the hands of authoritarian governments, these algorithms have enabled suppression of political dissent and persecution of minorities. To prevent these harms, data scientists everywhere must come to understand how the algorithms that they build and deploy may harm certain groups or be unfair. Responsible Data Science delivers a comprehensive, practical treatment of how to implement data science solutions in an even-handed and ethical manner that minimizes the risk of undue harm to vulnerable members of society. Both data science practitioners and managers of analytics teams will learn how to: Improve model transparency, even for black box models Diagnose bias and unfairness within models using multiple metrics Audit projects to ensure fairness and minimize the possibility of unintended harm Perfect for data science practitioners, Responsible Data Science will also earn a spot on the bookshelves of technically inclined managers, software developers, and statisticians.

The 9 Pitfalls of Data Science

The 9 Pitfalls of Data Science
Title The 9 Pitfalls of Data Science PDF eBook
Author Gary Smith
Publisher Oxford University Press
Total Pages 240
Release 2019-07-08
Genre Computers
ISBN 0192582755

Download The 9 Pitfalls of Data Science Book in PDF, Epub and Kindle

Data science has never had more influence on the world. Large companies are now seeing the benefit of employing data scientists to interpret the vast amounts of data that now exists. However, the field is so new and is evolving so rapidly that the analysis produced can be haphazard at best. The 9 Pitfalls of Data Science shows us real-world examples of what can go wrong. Written to be an entertaining read, this invaluable guide investigates the all too common mistakes of data scientists - who can be plagued by lazy thinking, whims, hunches, and prejudices - and indicates how they have been at the root of many disasters, including the Great Recession. Gary Smith and Jay Cordes emphasise how scientific rigor and critical thinking skills are indispensable in this age of Big Data, as machines often find meaningless patterns that can lead to dangerous false conclusions. The 9 Pitfalls of Data Science is loaded with entertaining tales of both successful and misguided approaches to interpreting data, both grand successes and epic failures. These cautionary tales will not only help data scientists be more effective, but also help the public distinguish between good and bad data science.

30-Second Data Science

30-Second Data Science
Title 30-Second Data Science PDF eBook
Author Liberty Vittert
Publisher Ivy Press
Total Pages 163
Release 2020-09-29
Genre Computers
ISBN 0711261954

Download 30-Second Data Science Book in PDF, Epub and Kindle

30-Second Data Scienceis the quickest way to discover how data is a driving force not just in the big issues, such as climate change and healthcare, but in our daily lives. Data science is an entirely new discipline that encompasses a new era of information, from finding criminals to predicting epidemics. But there’s more to it than the vast quantities of information gathered by our computers, smartphones, and credit cards. Carefully compiled by experts in the field,30-Second Data Science covers the basic statistical principles that drive the algorithms, how data affects us in every way—science, society, business, pleasure—along with the ethical quandaries and its future promise of a better world. Each 30-Second entry details a different facet of data science in just 300 words and one picture, showing how the concept of bringing together different types of data, and using powerful computer programs to find patterns no human eye could spot, is already transforming our world. Exploring key ideas and featuring biographies of the people behind them, 30-Second Data Science explains clearly and concisely all you need to know about data science, from basics to ethics. The 30 Second series presents concise, informative guides to the most important topics which shape the world around us, presenting terms which are key to understanding the subject in 30 seconds, 300 words, and one image.