Over a year ago, I tested the claim that Big Data was the most hyped technology ever. Using Google Trends, I compared the term “Big Data” with “Web 2.0” and “cloud computing”. It turned out that the Web 2.0 and cloud computing were more hyped than Big Data (as measured by number of searches on the topics). At […]
Tag Archives | Data science
Maximizing the Impact of Data Science Using the Scientific Method
We live in a Big Data world where everything is being quantified. As a result, businesses are trying to make sense of their ever-expanding, diverse, streaming data sources to drive their business forward. If your competitors have access to the same type of data (CRM, ERP, weather, etc.) that you do, how can you keep ahead […]
How IBM is Transforming Data Science
I am at the IBM InterConnect 2016 event in Las Vegas. While this event, IBM’s largest (estimated 24,000 attendees!), is billed as a cloud and mobile conference, sessions focused on a variety of related solutions around analytics, security, DevOps (which I learned is a methodology) and Watson Internet of Things. I was invited by IBM as […]
For Data Scientists, Big Data is not so Big
In our study of data scientists, we found that only about a third of them possessed skills needed to handle big and distributed data. These results are in line with findings from other studies that find that data scientists typically analyze small data sets. We examined the proficiency of data scientists across 25 different data science skills. […]
The Most Important Skill in Data Science: Mining and Visualizing your Data
While data scientists have many resources in their tool belt, our research shows that proficiency with data mining and visualization tools consistently ranks as one of the most important skills in determining project success. We used two methods to rank data science skills. The first way was based on the frequency with which professionals possessed the skills. This method identified data […]
Why Data Science Needs Subject Matter Expertise: Data Have Meaning
In our Big Data world, we are awash in data. We have a lot of it. It comes at us quickly. And it comes in different forms. Those three Vs (i.e., Volume, Velocity and Variety) represent significant hurdles to businesses in their race to extract value from their data. Each of those hurdles have been removed […]
Making Sense of Our Big Data World: Samples, Populations and Sampling Error
As part of my series on Making Sense of Our Big Data World, today’s post is on sampling error. See the overview, Making Sense of Our Big Data World: Statistics for the 99%, to understand the importance and value of understanding statistics and statistical thinking. This Big Data world is defined by the enormous amount […]
Empirically-Based Approach to Understanding the Structure of Data Science
Based on a study of 620+ data professionals, we found that data science skills fall into three broad areas: domain expertise (in our case, business), technology/programming and math/statistics. I discuss the implications of study findings for current data scientists, would-be data scientists and the recruiters who try to find them. Data science is our ability to extract […]