Why DIY data collection should come before automation
If you’re getting started on your data journey, there’s no substitute for the personal touch
Data has been described as everything from the new gold, to the new oil - but really, it’s much more than that. Data has become more than just a valuable commodity, for many businesses, it’s essential for their very survival. However, immersing yourself in data carries its own risks.
For Or Lenchner, CEO of data collection platform Bright Data, many organisations make the mistake of trying to run before they can walk, gathering as much data as they can with little thought to what their actual needs are, and how they’re served by the data they’re gathering.
“Data isn’t the new gold,” says Lenchner, speaking on The IT Pro Podcast; “it's the new water. But you know, you can drown with too much water – so just take what you need. No more. You can always scale up in the future.”
The most valuable tip for businesses who want to experiment with data collection and analysis, he says, is to eschew fancy AI-powered tools altogether, and start with manual processes.
“There’s no replacement for just doing it yourself for the first time. You can throw terms like AI and ML into the air, and yeah, they'll handle it. But what the hell; I mean, you know what data you need. So just grab the data that you need, and later on scale up.”
Implementing mass-scale data collection, he said, is a waste of time and money if you don’t know what you’re looking for. When Lenchner started as a product manager, he said that his most valuable data came from simply calling customers and asking them a few basic questions.
“As a data company, it took us, like, four years to implement our first data warehouse,” he says, “not because we couldn't, but because we didn't want to; we had so much to do with the basic data that we collected internally.”
“If you're doing a good job, you'll need to scale up to more places, gathering more data, which is great. Just, you know, start reasonably.”
2022 State of the multi-cloud report
What are the biggest multi-cloud motivations for decision-makers, and what are the leading challengesFree Download
The Total Economic Impact™ of IBM robotic process automation
Cost savings and business benefits enabled by robotic process automationFree Download
Multi-cloud data integration for data leaders
A holistic data-fabric approach to multi-cloud integrationFree Download
MLOps and trustworthy AI for data leaders
A data fabric approach to MLOps and trustworthy AIFree Download