Today’s data science roles won’t exist in 10 years (2024)

New Tech Forum

By Eric Miller, InfoWorld |

About |

Emerging tech dissected by technologists

AutoML is poised to turn developers into data scientists — and vice versa. Here’s how AutoML will radically change data science for the better.

Today’s data science roles won’t exist in 10 years (2)

In the coming decade, the data scientist role as we know it will look very different than it does today. But don’t worry, no one is predicting lost jobs, just changed jobs.

Data scientists will be fine —according to the Bureau of Labor Statistics, the role is still projected to grow at a higher than average clip through 2029. But advancements in technologywill be the impetus for a huge shift in a data scientist’s responsibilities and in the way businesses approach analytics as a whole. And AutoML tools, which help automate the machine learning pipeline from raw data to a usable model, will lead this revolution.

In 10 years, data scientists will have entirely different sets of skills and tools, but their function will remain the same: to serve as confident and competent technology guides that can make sense of complex data to solve business problems.

AutoML democratizes data science

Until recently, machine learning algorithms and processes were almost exclusively the domain of more traditional data science roles—those with formal education and advanced degrees, or working for large technology corporations. Data scientists have played an invaluable role in every part of the machine learning development spectrum. But in time, their role will become more collaborative and strategic. With tools like AutoML to automate some of their more academic skills, data scientists can focus on guiding organizations toward solutions to business problems via data.

In many ways, this is because AutoML democratizes the effort of putting machine learning into practice. Vendors from startups to cloud hyperscalers have launched solutions easy enough for developers to use and experiment on without a large educational or experiential barrier to entry. Similarly, some AutoML applications are intuitive and simple enough that non-technical workers can try their hands at creating solutions to problems in their own departments—creating a “citizen data scientist” of sorts within organizations.

In order to explore the possibilities these types of tools unlock for both developers and data scientists, we first have to understand the current state of data science as it relates to machine learning development. It’s easiest to understand when placed on a maturity scale.

Smaller organizations and businesses with more traditional roles in charge of digital transformation (i.e., not classically trained data scientists) typically fall on this end of this scale. Right now, they are the biggest customers for out-of-the-box machine learning applications, which are more geared toward an audience unfamiliar with the intricacies of machine learning.

Pros: These turnkey applications tend to be easy to implement, and relatively cheap and easy to deploy. For smaller companies with a very specific process to automate or improve, there are likely several viable options on the market. The low barrier to entry makes these applications perfect for data scientists wading into machine learning for the first time. Because some of the applications are so intuitive, they even allow non-technical employees a chance to experiment with automation and advanced data capabilities—potentially introducing a valuable sandbox into an organization.
Cons: This class of machine learning applications is notoriously inflexible. While they can be easy to implement, they aren’t easily customized. As such, certain levels of accuracy may be impossible for certain applications. Additionally, these applications can be severely limited by their reliance on pretrained models and data.

Examples of these applications include Amazon Comprehend, Amazon Lex, and Amazon Forecast from Amazon Web Services and Azure Speech Services and Azure Language Understanding (LUIS) from Microsoft Azure. These tools are often sufficient enough for burgeoning data scientists to take the first steps in machine learning and usher their organizations further down the maturity spectrum.

Customizable solutions with AutoML

Organizations with large yet relatively common data sets—think customer transaction data or marketing email metrics—need more flexibility when using machine learning to solve problems. Enter AutoML. AutoML takes the steps of a manual machine learning workflow (data discovery, exploratory data analysis, hyperparameter tuning, etc.) and condenses them into a configurable stack.

Pros: AutoML applications allow more experiments to be run on data in a larger space. But the real superpower of AutoML is the accessibility —custom configurations can be built and inputs can be refined relatively easily. What’s more, AutoML isn’t made exclusively with data scientists as an audience. Developers can also easily tinker within the sandbox to bring machine learning elements into their own products or projects.
Cons: While it comes close, AutoML’s limitations mean accuracy in outputs will be difficult to perfect. Because of this, degree-holding, card carrying data scientists often look down upon applications built with the help of AutoML —even if the result is accurate enough to solve the problem at hand.

Examples of these applications include Amazon SageMaker AutoPilot or Google Cloud AutoML. Data scientists a decade from now will undoubtedly need to be familiar with tools like these. Like a developer who is proficient in multiple programming languages, data scientists will need to have proficiency with multiple AutoML environments in order to be considered top talent.

“Hand-rolled” and homegrown machine learning solutions

The largest enterprise-scale businesses and Fortune 500 companies are where most of the advanced and proprietary machine learning applications are currently being developed. Data scientists at these organizations are part of large teams perfecting machine learning algorithms using troves of historical company data, and building these applications from the ground up. Custom applications like these are only possible with considerable resources and talent, which is why the payoff and risks are so great.

Pros: Like any application built from scratch, custom machine learning is “state-of-the-art” and is built based on a deep understanding of the problem at hand. It’s also more accurate —if only by small margins —than AutoML and out-of-the-box machine learning solutions.
Cons: Getting a custom machine learning application to reach certain accuracy thresholds can be extremely difficult, and often requires heavy lifting by teams of data scientists. Additionally, custom machine learning options are the most time-consuming and most expensive to develop.

An example of a hand-rolled machine learning solution is starting with a blank Jupyter notebook, manually importing data, and then conducting each step from exploratory data analysis through model tuning by hand. This is often achieved by writing custom code using open source machine learning frameworks such as Scikit-learn, TensorFlow, PyTorch, and many others. This approach requires a high degree of both experience and intuition, but can produce results that often outperform both turnkey machine learning services and AutoML.

Tools like AutoML will shift data science roles and responsibilities over the next 10 years. AutoML takes the burden of developing machine learning from scratch off of data scientists, and instead puts the possibilities of machine learning technology directly in the hands of other problem solvers. With time freed up to focus on what they know—the data and the inputs themselves —data scientists a decade from now will serve as even more valuable guides for their organizations.

Eric Miller serves as the senior director of technical strategy at Rackspace, where he provides strategic consulting leadership with a proven track record of practice building in the Amazon Partner Network (APN) ecosystem.An accomplished tech leader with 20 years of proven success in enterprise IT, Eric has led several AWS and solutions architecture initiatives, including AWS Well Architected Framework (WAF) Assessment Partner Program, Amazon EC2 for Windows Server AWS Service Delivery Program, and a wide range of AWS rewrites for multi-billion dollar organizations.

—

New Tech Forum provides a venue to explore and discuss emerging enterprise technology in unprecedented depth and breadth. The selection is subjective, based on our pick of the technologies we believe to be important and of greatest interest to InfoWorld readers. InfoWorld does not accept marketing collateral for publication and reserves the right to edit all contributed content. Send all inquiries tonewtechforum@infoworld.com.

Next read this:

Why companies are leaving the cloud
5 easy ways to run an LLM locally
Coding with AI: Tips and best practices from developers
Meet Zig: The modern alternative to C
What is generative AI? Artificial intelligence that creates
The best open source software of 2023

Machine Learning
Deep Learning
Artificial Intelligence
Data Science
Analytics

Today’s data science roles won’t exist in 10 years (2024)

FAQs

Will data science exist in 10 years? ›

In conclusion, the application of Data Science is expected to grow significantly over the next 10 years as more organizations recognize its importance in today's digital world. Industries ranging from finance, healthcare, education, entertainment, and more will benefit and transform with the help of data analytics.

Explore More ›

Why there will be no data science job titles by 2029? ›

The coming Trough of Disillusionment with data science job titles will be the following: Many data science teams have not delivered results that can be measured in ROI by executives. The excitement of AI and ML has temporary led people to ignore the basic question: What does a data scientist actually do?

Know More ›

Why so many data scientists are leaving their jobs? ›

Low employee engagement is a leading cause of employee turnover for tech organizations. Low employee engagement means that your data scientists have low levels of enthusiasm and dedication to your company and their job. They do not feel the impact of their roles or care about their performance in a job.

Get More Info Here ›

What is a major problem with data science? ›

As organizations continue to generate increasingly large volumes of data, data scientists face the challenge of handling and processing these massive datasets. Traditional data processing tools and techniques may not be well-suited for big data analytics, leading to performance issues and longer processing times.

Know More ›

What is the future of data science in 2030? ›

With the advent of artificial intelligence (AI), machine learning, and advanced analytics, the capabilities of data science have expanded exponentially. In 2030, we can anticipate even more sophisticated tools and algorithms that will enable data scientists to glean deeper insights from complex datasets.

Read The Full Story ›

Is data science jobs declining? ›

The overall data science job market is down 15% year over year when you factor in analysts, ML engineers, and data based product managers. But data scientists are losing ground faster. But it's likely because the data science role is getting split into multiple different titles.

Will AI replace data science jobs? ›

The Unique Value of Human Data Scientists

While AI is automating more data science tasks, human data scientists still provide unique value AI cannot currently replicate: Domain Expertise - Data scientists often have deep domain knowledge because of their industry, letting them better contextualize data insights.

Is data science a future proof career? ›

Data science, data engineering, and software engineering are all promising career paths in the technology industry. As businesses increasingly rely on data and technology to drive their operations, these careers are likely to remain in high demand for the foreseeable future.

Get More Info ›

What will replace data scientist? ›

AI can automate repetitive and time-consuming tasks in Data Science, including data cleaning, feature selection, and even some aspects of model selection and optimization. This automation can help Data Scientists focus on more strategic and innovative aspects of their work.

Learn More Now ›

Is there really a shortage of data scientist? ›

In a survey of hiring managers, Upwork reports that data scientist is one of the most challenging roles to recruit. Anaconda's 2022 State of Data Science report echoes this finding, with 63 percent of respondents saying their organization is at least moderately worried about the field's talent shortage.

Read The Full Story ›

Is data science still a hot job? ›

According to the US Bureau of Labor Statistics, the number of jobs requiring data science skills is expected to grow by 27.9 percent by 2026. There is no automated tool that can replace the skillset of a data scientist, as long as you continuously learn and create data-driven solutions.”