Today’s data science roles won’t exist in 10 years (2024)

AutoML is poised to turn developers into data scientists — and vice versa. Here’s how AutoML will radically change data science for the better.

Today’s data science roles won’t exist in 10 years (2)

In the coming decade, the data scientist role as we know it will look very different than it does today. But don’t worry, no one is predicting lost jobs, just changed jobs.

Data scientists will be fine —according to the Bureau of Labor Statistics, the role is still projected to grow at a higher than average clip through 2029. But advancements in technologywill be the impetus for a huge shift in a data scientist’s responsibilities and in the way businesses approach analytics as a whole. And AutoML tools, which help automate the machine learning pipeline from raw data to a usable model, will lead this revolution.

In 10 years, data scientists will have entirely different sets of skills and tools, but their function will remain the same: to serve as confident and competent technology guides that can make sense of complex data to solve business problems.

AutoML democratizes data science

Until recently, machine learning algorithms and processes were almost exclusively the domain of more traditional data science roles—those with formal education and advanced degrees, or working for large technology corporations. Data scientists have played an invaluable role in every part of the machine learning development spectrum. But in time, their role will become more collaborative and strategic. With tools like AutoML to automate some of their more academic skills, data scientists can focus on guiding organizations toward solutions to business problems via data.

In many ways, this is because AutoML democratizes the effort of putting machine learning into practice. Vendors from startups to cloud hyperscalers have launched solutions easy enough for developers to use and experiment on without a large educational or experiential barrier to entry. Similarly, some AutoML applications are intuitive and simple enough that non-technical workers can try their hands at creating solutions to problems in their own departments—creating a “citizen data scientist” of sorts within organizations.

In order to explore the possibilities these types of tools unlock for both developers and data scientists, we first have to understand the current state of data science as it relates to machine learning development. It’s easiest to understand when placed on a maturity scale.

Smaller organizations and businesses with more traditional roles in charge of digital transformation (i.e., not classically trained data scientists) typically fall on this end of this scale. Right now, they are the biggest customers for out-of-the-box machine learning applications, which are more geared toward an audience unfamiliar with the intricacies of machine learning.

  • Pros: These turnkey applications tend to be easy to implement, and relatively cheap and easy to deploy. For smaller companies with a very specific process to automate or improve, there are likely several viable options on the market. The low barrier to entry makes these applications perfect for data scientists wading into machine learning for the first time. Because some of the applications are so intuitive, they even allow non-technical employees a chance to experiment with automation and advanced data capabilities—potentially introducing a valuable sandbox into an organization.
  • Cons: This class of machine learning applications is notoriously inflexible. While they can be easy to implement, they aren’t easily customized. As such, certain levels of accuracy may be impossible for certain applications. Additionally, these applications can be severely limited by their reliance on pretrained models and data.

Examples of these applications include Amazon Comprehend, Amazon Lex, and Amazon Forecast from Amazon Web Services and Azure Speech Services and Azure Language Understanding (LUIS) from Microsoft Azure. These tools are often sufficient enough for burgeoning data scientists to take the first steps in machine learning and usher their organizations further down the maturity spectrum.

Customizable solutions with AutoML

Organizations with large yet relatively common data sets—think customer transaction data or marketing email metrics—need more flexibility when using machine learning to solve problems. Enter AutoML. AutoML takes the steps of a manual machine learning workflow (data discovery, exploratory data analysis, hyperparameter tuning, etc.) and condenses them into a configurable stack.

  • Pros: AutoML applications allow more experiments to be run on data in a larger space. But the real superpower of AutoML is the accessibility —custom configurations can be built and inputs can be refined relatively easily. What’s more, AutoML isn’t made exclusively with data scientists as an audience. Developers can also easily tinker within the sandbox to bring machine learning elements into their own products or projects.
  • Cons: While it comes close, AutoML’s limitations mean accuracy in outputs will be difficult to perfect. Because of this, degree-holding, card carrying data scientists often look down upon applications built with the help of AutoML —even if the result is accurate enough to solve the problem at hand.

Examples of these applications include Amazon SageMaker AutoPilot or Google Cloud AutoML. Data scientists a decade from now will undoubtedly need to be familiar with tools like these. Like a developer who is proficient in multiple programming languages, data scientists will need to have proficiency with multiple AutoML environments in order to be considered top talent.

“Hand-rolled” and homegrown machine learning solutions

The largest enterprise-scale businesses and Fortune 500 companies are where most of the advanced and proprietary machine learning applications are currently being developed. Data scientists at these organizations are part of large teams perfecting machine learning algorithms using troves of historical company data, and building these applications from the ground up. Custom applications like these are only possible with considerable resources and talent, which is why the payoff and risks are so great.

  • Pros: Like any application built from scratch, custom machine learning is “state-of-the-art” and is built based on a deep understanding of the problem at hand. It’s also more accurate —if only by small margins —than AutoML and out-of-the-box machine learning solutions.
  • Cons: Getting a custom machine learning application to reach certain accuracy thresholds can be extremely difficult, and often requires heavy lifting by teams of data scientists. Additionally, custom machine learning options are the most time-consuming and most expensive to develop.

An example of a hand-rolled machine learning solution is starting with a blank Jupyter notebook, manually importing data, and then conducting each step from exploratory data analysis through model tuning by hand. This is often achieved by writing custom code using open source machine learning frameworks such as Scikit-learn, TensorFlow, PyTorch, and many others. This approach requires a high degree of both experience and intuition, but can produce results that often outperform both turnkey machine learning services and AutoML.

Tools like AutoML will shift data science roles and responsibilities over the next 10 years. AutoML takes the burden of developing machine learning from scratch off of data scientists, and instead puts the possibilities of machine learning technology directly in the hands of other problem solvers. With time freed up to focus on what they know—the data and the inputs themselves —data scientists a decade from now will serve as even more valuable guides for their organizations.

Eric Miller serves as the senior director of technical strategy at Rackspace, where he provides strategic consulting leadership with a proven track record of practice building in the Amazon Partner Network (APN) ecosystem.An accomplished tech leader with 20 years of proven success in enterprise IT, Eric has led several AWS and solutions architecture initiatives, including AWS Well Architected Framework (WAF) Assessment Partner Program, Amazon EC2 for Windows Server AWS Service Delivery Program, and a wide range of AWS rewrites for multi-billion dollar organizations.

New Tech Forum provides a venue to explore and discuss emerging enterprise technology in unprecedented depth and breadth. The selection is subjective, based on our pick of the technologies we believe to be important and of greatest interest to InfoWorld readers. InfoWorld does not accept marketing collateral for publication and reserves the right to edit all contributed content. Send all inquiries tonewtechforum@infoworld.com.

Next read this:

  • Why companies are leaving the cloud
  • 5 easy ways to run an LLM locally
  • Coding with AI: Tips and best practices from developers
  • Meet Zig: The modern alternative to C
  • What is generative AI? Artificial intelligence that creates
  • The best open source software of 2023

Related:

  • Machine Learning
  • Deep Learning
  • Artificial Intelligence
  • Data Science
  • Analytics

Copyright © 2020 IDG Communications, Inc.

Today’s data science roles won’t exist in 10 years (2024)

FAQs

Will data science exist in 10 years? ›

In conclusion, the application of Data Science is expected to grow significantly over the next 10 years as more organizations recognize its importance in today's digital world. Industries ranging from finance, healthcare, education, entertainment, and more will benefit and transform with the help of data analytics.

Why there will be no data science job titles by 2029? ›

The coming Trough of Disillusionment with data science job titles will be the following: Many data science teams have not delivered results that can be measured in ROI by executives. The excitement of AI and ML has temporary led people to ignore the basic question: What does a data scientist actually do?

Why so many data scientists are leaving their jobs? ›

Low employee engagement is a leading cause of employee turnover for tech organizations. Low employee engagement means that your data scientists have low levels of enthusiasm and dedication to your company and their job. They do not feel the impact of their roles or care about their performance in a job.

What is a major problem with data science? ›

As organizations continue to generate increasingly large volumes of data, data scientists face the challenge of handling and processing these massive datasets. Traditional data processing tools and techniques may not be well-suited for big data analytics, leading to performance issues and longer processing times.

What is the future of data science in 2030? ›

With the advent of artificial intelligence (AI), machine learning, and advanced analytics, the capabilities of data science have expanded exponentially. In 2030, we can anticipate even more sophisticated tools and algorithms that will enable data scientists to glean deeper insights from complex datasets.

Is data science jobs declining? ›

The overall data science job market is down 15% year over year when you factor in analysts, ML engineers, and data based product managers. But data scientists are losing ground faster. But it's likely because the data science role is getting split into multiple different titles.

Will AI replace data science jobs? ›

The Unique Value of Human Data Scientists

While AI is automating more data science tasks, human data scientists still provide unique value AI cannot currently replicate: Domain Expertise - Data scientists often have deep domain knowledge because of their industry, letting them better contextualize data insights.

Is data science a future proof career? ›

Data science, data engineering, and software engineering are all promising career paths in the technology industry. As businesses increasingly rely on data and technology to drive their operations, these careers are likely to remain in high demand for the foreseeable future.

What will replace data scientist? ›

AI can automate repetitive and time-consuming tasks in Data Science, including data cleaning, feature selection, and even some aspects of model selection and optimization. This automation can help Data Scientists focus on more strategic and innovative aspects of their work.

Is there really a shortage of data scientist? ›

In a survey of hiring managers, Upwork reports that data scientist is one of the most challenging roles to recruit. Anaconda's 2022 State of Data Science report echoes this finding, with 63 percent of respondents saying their organization is at least moderately worried about the field's talent shortage.

Is data science still a hot job? ›

According to the US Bureau of Labor Statistics, the number of jobs requiring data science skills is expected to grow by 27.9 percent by 2026. There is no automated tool that can replace the skillset of a data scientist, as long as you continuously learn and create data-driven solutions.”

Is there still a shortage of data scientist? ›

The demand for specialist data skills is growing. In 2023, the World Economic Forum surveyed 803 global companies and found that 'AI and Machine Learning Specialists' and 'Data Analysts and Scientists' roles were in the top 10 jobs expected to grow fastest between 2023 and 2027.

Why not to get into data science? ›

A data scientist job requires a lot of commitment and time, with most of your daily time spent on data cleaning, which is repetitive and tedious. Data scientists can't work independently and must collaborate with other professionals like data analysts, stakeholders, and SMEs for any project.

What is the future of data science? ›

Collaborations with Domain Experts: In the future, data scientists will increasingly collaborate with domain experts in fields such as healthcare, finance, and climate science. This partnership will allow data scientists to apply their skills to real-world problems, enhancing decision-making and problem-solving.

What is the biggest challenge for data scientist? ›

The role of a data scientist aligns with business strategy, and their fundamental goal is to improve decision-making in the organization. The biggest challenge faced by data scientists is to communicate their results or analyses with business executives.

Will data science have a future? ›

With so many businesses now using data science, it's no wonder that by the end of 2023, the big data analytics market is anticipated to grow to $103 billion. It is also one of the fastest-growing industries right now.

Does data scientist have a future? ›

Therefore, there is a need for a data scientist in every industry. Self-analysis is vital if any business needs to grow and stand out. A data scientist does this analysis. So, the job of a data scientist is very high in demand and will remain as such in the near future.

Will data science be relevant in the future? ›

Continued Growth and Demand: The demand for skilled data scientists is expected to continue growing significantly, driven by the ever-increasing volume of data generated across various industries.

How long will data science be in demand? ›

The U.S. Bureau of Labor Statistics forecasts that the employment of data scientists will grow 35% from 2022 to 2032, much faster than the average for all occupations. About 17,700 openings for data scientists are projected each year, on average, over the decade.

Top Articles
Latest Posts
Article information

Author: Nathanael Baumbach

Last Updated:

Views: 5967

Rating: 4.4 / 5 (55 voted)

Reviews: 86% of readers found this page helpful

Author information

Name: Nathanael Baumbach

Birthday: 1998-12-02

Address: Apt. 829 751 Glover View, West Orlando, IN 22436

Phone: +901025288581

Job: Internal IT Coordinator

Hobby: Gunsmithing, Motor sports, Flying, Skiing, Hooping, Lego building, Ice skating

Introduction: My name is Nathanael Baumbach, I am a fantastic, nice, victorious, brave, healthy, cute, glorious person who loves writing and wants to share my knowledge and understanding with you.