IT Pro is supported by its audience. When you purchase through links on our site, we may earn an affiliate commission. Learn more

Why DIY data collection should come before automation

If you’re getting started on your data journey, there’s no substitute for the personal touch

Data has been described as everything from the new gold, to the new oil - but really, it’s much more than that. Data has become more than just a valuable commodity, for many businesses, it’s essential for their very survival. However, immersing yourself in data carries its own risks.

For Or Lenchner, CEO of data collection platform Bright Data, many organisations make the mistake of trying to run before they can walk, gathering as much data as they can with little thought to what their actual needs are, and how they’re served by the data they’re gathering.

“Data isn’t the new gold,” says Lenchner, speaking on The IT Pro Podcast; “it's the new water. But you know, you can drown with too much water – so just take what you need. No more. You can always scale up in the future.”

The most valuable tip for businesses who want to experiment with data collection and analysis, he says, is to eschew fancy AI-powered tools altogether, and start with manual processes.

“There’s no replacement for just doing it yourself for the first time. You can throw terms like AI  and ML into the air, and yeah, they'll handle it. But what the hell; I mean, you know what data you need. So just grab the data that you need, and later on scale up.”

Implementing mass-scale data collection, he said, is a waste of time and money if you don’t know what you’re looking for. When Lenchner started as a product manager, he said that his most valuable data came from simply calling customers and asking them a few basic questions.

“As a data company, it took us, like, four years to implement our first data warehouse,” he says, “not because we couldn't, but because we didn't want to; we had so much to do with the basic data that we collected internally.”

“If you're doing a good job, you'll need to scale up to more places, gathering more data, which is great. Just, you know, start reasonably.”

Featured Resources

2022 State of the multi-cloud report

What are the biggest multi-cloud motivations for decision-makers, and what are the leading challenges

Free Download

The Total Economic Impact™ of IBM robotic process automation

Cost savings and business benefits enabled by robotic process automation

Free Download

Multi-cloud data integration for data leaders

A holistic data-fabric approach to multi-cloud integration

Free Download

MLOps and trustworthy AI for data leaders

A data fabric approach to MLOps and trustworthy AI

Free Download

Recommended

The IT Pro Podcast: Making sense of the telco transformation
Network & Internet

The IT Pro Podcast: Making sense of the telco transformation

2 Dec 2022
The IT Pro Podcast: The front line of fraud tech
cyber crime

The IT Pro Podcast: The front line of fraud tech

29 Nov 2022
The IT Pro Podcast: How secure is metaverse tech?
Security

The IT Pro Podcast: How secure is metaverse tech?

18 Nov 2022
The IT Pro Podcast: The rise of automated contracts
automation

The IT Pro Podcast: The rise of automated contracts

11 Nov 2022

Most Popular

Empowering employees to truly work anywhere
Sponsored

Empowering employees to truly work anywhere

22 Nov 2022
Q&A: Fred Voccola, Kaseya
channel

Q&A: Fred Voccola, Kaseya

30 Nov 2022
How to boot Windows 11 in Safe Mode
Microsoft Windows

How to boot Windows 11 in Safe Mode

15 Nov 2022