Top of Page

Data Science/Analytics Tools, Technologies and Languages used in 2017

Most Used Data Science Tools and Technologies in 2017 and What to Expect for 2018

The practice of data science requires the use of analytics tools, technologies and languages to help data professionals extract insights and value from data. A recent survey by Kaggle revealed that data professionals relied on Python, R and SQL more than other tools in 2017. Looking ahead to 2018, the survey results showed that data professionals are most interested in learning Tensorflow, Python and R. 

Kaggle conducted a survey in August 2017 of over 16,000 data professionals (2017 State of Data Science and Machine Learning). Their survey included a variety of questions about data science, machine learning, education and more. Kaggle released the raw survey data and many of their members have analyzed the data (see link above). I will be exploring their survey data over the next couple of months. When I find something interesting, I’ll be sure to post it here on my blog. Today’s post is about the data science and machine learning tools and technologies data professionals used in 2017 and the tool and technologies that excite them in 2018.

Most Popular Data Science/Analytics Tools, Technologies and Languages in 2017

Data Science/Analytics Tools, Technologies and Languages used in 2017

Figure 1. Data Science/Analytics Tools, Technologies and Languages used in 2017. Click image to enlarge.

The survey included a question for data professionals who were empoloyed, “For work, which data science/analytics tools, technologies, and languages have you used in the past year? (Select all that apply).” On average, data professionals used 4 (median) tools in 2017. The top used tool in 2017 was Python (60% of respondents said they used this in the previous year), followed by R (46%) and SQL (42%). The top 10 tools are rounded out by TensorFlow, Amazon Web Services, Unix shell / awk, Tableau, C/C++, NoSQL and MATLAB/Octave.

recent poll by KDNuggets also found that Python, R and SQL were the top tools used in 2017. While there was agreement across these two studies with respect to the top three data science tools used, adoption rates for other data science tools varied across the Kaggle and KDNuggets results like RapidMiner (e.g., KDNuggets: 33%; Kaggle: 3%), KNIME (KDNuggets: 19%; Kaggle: 3%). Differences across the two studies could simply reflect differences in samples of respondents used in each study.

I found that 71% of working data professionals used either Python, R or both in the previous year.

Which Data Science Tool and Technology are Data Pros Most Excited about Learning in 2018?

Figure 2. Data Science Tool / Technology Data Pros are Most Excited about Learning in 2018. Click image to enlarge.

The survey also asked all data professionals (working and not) about the tool they are most excited about learning in the next year (see Figure 2). Results showed that data professionals are most interested in learning TensorFlow (24%), followed by Python (16%) and R (8%).

Summary

The results of the Kaggle survey of over 16,000 data professionals paint a clear picture about the most popular data science tools. A majority of current data professionals use either Python or R. Also, data professionals pick these two tools as the ones they are most excited about learning in the coming year.

While TensorFlow, an open source library for fast numerical computing, was the 5th most popular data science tool used last year, it garnered the most interest from data professionals in the coming year. This interest in TensorFlow is likely the result of the growing interest in developing machine learning (and deep learning) models.

While data professionals have access to many different data science tools, it appears that Python and R are becoming the standard analytics tools for the field of data science and machine learning.

 

, , ,

Trackbacks/Pingbacks

  1. Top Machine Learning and Data Science Methods Used at Work | Business Over Broadway - January 15, 2018

    […] Not surprisingly, 4 out of 10 data professionals said they are most excited to learn about deep learning in the next year. Deep learning is a class of machine learning algorithms that are modeled after the information processing and communication patterns of the brain. Deep learning uses layers of units or nodes for feature extraction and transformation, each layer using the output of the previous layer as input. Deep learning methods can be used in such areas like marketing, automotive, speech recognition and more. In parallel with this interest in learning deep learning, survey results also showed that TensorFlow, a popular open-source software used for deep learning, was the top tool data professionals are interested in learning. […]

  2. Which Data Science Tools are Used Together? | Business Over Broadway - January 10, 2018

    […] tools, technologies and languages to help them get insights from data. A recent study showed that data professionals typically use around 4 data science tools. So, which data science tools they use? Are some tools used together? A previous study of data […]

bob@businessoverbroadway.com | 206.372.5990

UA-23043697-1