Williams Professor Shares Stories of Statistical Sins Photo: Ryan Ma |
The talk, aptly titled “The Seven Deadly Sins of Data Science– and How to Avoid Them,” was held in the evening at Cendana Rector’s Commons. Among first-year students taking the Common Curriculum, Prof. De Veaux is known as an author of their Quantitative Reasoning (QR) textbook, “Stats: Data and Models”. As a statistician, his illustrious career has brought him to Stanford, Princeton, and the consulting industry. He has also patented several statistical methods, and he is currently the Vice-President of the American Statistical Association.
At the talk, Prof. De Veaux explored a crucial yet unsettling question – what could go wrong in statistical research? Given the ubiquity of big data in academia and industry, this question concerns all of us regardless of which field we are pursuing.
There is a difference between data scientists and statisticians, he said. The problem with data scientists is that they do not think enough about the problem they are solving. “What do statisticians bring to data science? They ask questions,” he said...
As students, we may be surprised that seasoned statisticians would make such rookie-level mistakes. After all, the vulnerability of mean values to extreme outliers would have been covered in any introductory statistics class. This story shows that data is hardly as objective as many imagine it to be.
Read more...
Source: The Octant