Data science projects require data professionals to devote their energy toward different activities toward project completion. Results of a recent study of over 23,000 data professionals found that data scientists spend about 40% of gathering and cleaning data, 20% of their time building and selecting models and 11% of their time finding insights and communicating […]
Archive | Data Science
Top Cloud Computing Products and Services Used by Data Scientists
A recent survey revealed that 69% of data pros have used at least one cloud computing product in the last 5 years while 62% of data pros have used at least one cloud computing service in the last 5 years. The most popular cloud computing products include AWS Elastic Compute, Google Cloud Engine and AWS […]
Most Popular Machine Learning Frameworks and Products Used by Data Professionals
A recent survey revealed that 84% of data pros have used at least one ML framework in the last 5 years while 51% of data pros have used at least one ML product in the last 5 years. The most popular ML frameworks include Scikit-Learn, Tensorflow and Keras. The most popular ML products include SAS, […]
Usage-Driven Groupings of Data Science and Machine Learning Programming Languages
Analysis of usage patterns of 16 data science programming languages by over 18,000 data professionals showed that programming languages can be grouped into a smaller set (specifically, 5 groupings). That is, some programming languages tend to be used together apart from other programming languages. A few of the different groupings of languages reflect specific types […]
Programming Languages Most Used and Recommended by Data Scientists
The practice of data science requires the use of analytics tools, technologies and programming languages to help data professionals extract insights and value from data. A recent survey of nearly 24,000 data professionals by Kaggle revealed that Python, SQL and R are the most popular programming languages. The most popular, by far, was Python (83% […]
Top BOB Blog Posts of 2018: Data Science, Machine Learning and the Net Promoter Score
The importance of data science and machine learning continues to grow in business and beyond. I did my part this year to spread interest in data science to more people. All of my top blog posts of 2018 (most reads) are all related to data science, with posts that address the practice of data science, […]
How Data Integration and Machine Learning Improve Retention Marketing
Retention marketing is about preventing your valuable customers from churning. Reducing customer churn requires you to know two things: 1) which customers are about to churn and 2) which remedies will keep them from churning. In this paper, I show you how marketers can improve their customer retention efforts by 1) integrating disparate data silos […]
IBM and Data Science are Helping Save the World through Call for Code
The devastating impact of natural disasters can be measured in human suffering, loss of life and economic impact. It is estimated that 2.5 million people have been directly affected by natural disasters since 2000. Additionally, natural disasters have had an economic impact of $1.3 trillion since 2003. Even though natural events such as floods, earthquakes […]