Dell Technologies and Starburst announce collaboration on new data lakehouse platform and query engine
A senior figure at Starburst described current single source of truth repositories as “a mess”
Sign up today and you will receive a free copy of our Future Focus 2025 report - the leading guidance on AI, cybersecurity and other IT challenges as per 700+ senior executives
You are now subscribed
Your newsletter sign-up was successful
Dell Technologies and data analytics platform Starburst have announced a new partnership, with the intention of building an advanced data lakehouse solution for better oversight and control of enterprise data.
The initiative, announced at Big Data London, will use Dell’s storage expertise and Starburst’s engine to allow on-demand access to decentralized data.
Customers will then be able to federate and activate data around this lakehouse from a single point of access, which the firms hope will enable more detailed data analysis and for customers to have more oversight of training for artificial intelligence (AI) and machine learning (ML) systems.
Data lakehouses are a model that has arisen in the past few years that combine the structured and unstructured information stored in data warehouses and data lakes. They are particularly useful for performing responsive searches on raw data.
“Dell Technologies is on a journey to a data lakehouse architecture,” said Joe Steiner, CTO of unstructured data solutions at Dell Technologies.
“We have big plans, and step one on our journey is a common query engine, and that's what we're doing with Starburst.
“For far too long our customers, like you, have been bound by the limitations of proprietary databases, data lakes, and data warehouses. My personal feeling is that's going to come to an end. An open ecosystem is emerging, and my customers want these open ecosystem capabilities.
Sign up today and you will receive a free copy of our Future Focus 2025 report - the leading guidance on AI, cybersecurity and other IT challenges as per 700+ senior executives
“We're co-engineering solutions, and we're going to deliver some incredible capabilities very soon."
Rick DeMare, global business development leader at Starburst said that his firm’s engine will “sit on top” of Dell’s data lakehouse, with the aim of giving customers warehouse-like speed over all the forms of data contained within. This will also allow customers to federate and activate their data across the lakehouse from a single point of access.
RELATED RESOURCE
Create the ideal hybrid workplace that will keep you competitive.
On average, Starburst says this approach can help customer systems go 90% faster and reduce the cost of ownership by 53%. In spite of its new partnership with Dell, Starburst will maintain its vendor-agnostic approach and to this end is committed to upholding open file and table formats across its systems.
DeMare also rejected the use of the phrase ‘single source of truth’ used by competitors, describing it as a “single source of lies”.
"It's a mess, it's never been more of a mess,” he said.
DeMare cited a report by S&P Global Market Intelligence which found that on average, firms now maintain 5.4 copies of data between their cloud environments and on-premise data.
He criticized solutions that bill themselves as new technologies but are in effect just data silos, including data lakes, and argued that existing ‘single source of truth’ architecture produces monolithic, closed systems that are expensive to scale.
Easier access to data lakehouses could work to address CIO concerns over cloud complexity. A recent study by Dynatrace found that 47% of CIOs were in favor of more lakehouse structures, to enable greater use of automation.
Prepping for AI, and reducing CIO strain
Steiner and DeMare made the announcement on the keynote stage at Big Data London 2023, an event this year dominated by strategies and solutions aimed at organizing business data for use in AI and ML applications.
The explosion of interest in generative AI, in particular, has put new demands on data teams. Large language models (LLMs) require vast swathes of curated data to function optimally, which requires firms to have a good grip on both structured and unstructured data, and oversight of which data is being used for AI systems to ensure privacy, security, and safety is upheld.
At Dell Technologies World 2023, Dell Technologies global CTO John Roese told ITPro that curation of data was the most important factor for making any LLM work correctly.
Dell’s own effort to remove non-inclusive language such as ‘whitelist’ and ‘blacklist’ from its content repository, for example, would allow for an AI to be trained on the firm’s internal code without fears of unwanted biases appearing in output.
Roese also pointed to the fact that neural networks can make faster connections between data that is unlabelled, as human labels may be seen as unnecessary or arbitrary. In this regard, Dell and Starburst’s data lakehouse could have an advantage over competitors in that it allows firms to quickly draw together data in a variety of forms.
"If everybody can access the data the same way, then they can have the fuel that they need to start working on their generative AI products,” said DeMare.
Recent Gartner research presented evidence that IT teams at many organizations are concerned about the risks of passing their data through public AI systems run by hyperscalers such as Azure OpenAI, and that many firms are weighing the safety of running their own on-premise AI models against the far lower costs of public AI.
DeMare claimed Starburst and Dell's project can help companies to find and manage sensitive data to ensure they have controls over what is and is not given over to public AI firms.
"Maybe you don't want to share all your data with a hyperscaler, which is a general requirement of those generative AI tools."

Rory Bathgate is Features and Multimedia Editor at ITPro, overseeing all in-depth content and case studies. He can also be found co-hosting the ITPro Podcast with Jane McCallion, swapping a keyboard for a microphone to discuss the latest learnings with thought leaders from across the tech sector.
In his free time, Rory enjoys photography, video editing, and good science fiction. After graduating from the University of Kent with a BA in English and American Literature, Rory undertook an MA in Eighteenth-Century Studies at King’s College London. He joined ITPro in 2022 as a graduate, following four years in student journalism. You can contact Rory at rory.bathgate@futurenet.com or on LinkedIn.
-
Tomorrow's fraud techniquesITPro Podcast Leaders need to proactive as attackers launch more consistent, sophisticated attacks
-
Met Office hails huge efficiency gains in first year of cloud supercomputing with Microsoft AzureNews In moving to the cloud, the Met Office has bolstered operational resilience and helped to deliver more accurate forecasts
-
Global demand for this one AI role has skyrocketed 283% in the last year aloneNews AI trainers are now among the most sought-after specialists around the world
-
Most executives have no idea how many employees are actually using AINews A concerning number of business leaders think their staff are using AI across most of their of tasks – the reality is quite different
-
UK firms are dragging their heels on AI training – shadow AI means they need to move fast to avoid unauthorized useNews With shadow AI rife, access to approved tools, clear guardrails, and training are needed to use the technology responsibly
-
OpenAI's big enterprise push needs systems integrators, so it's turning to consultancies to plug implementation gapsNews Consultancies such as Accenture and Capgemini will act as systems integrators and help shape AI strategies for OpenAI customers
-
Microsoft says fear of falling behind is driving an AI arms race among UK businesses – and it's fueling record adoption ratesNews New research shows AI is now a core part of UK business success strategies
-
CEOs aren't seeing any AI productivity gains, yet some tech industry leaders are still convinced AI will destroy white collar work within two yearsNews A massive survey by National Bureau of Economic Research shows limited AI impact, but continued hopes it'll boost productivity eventually
-
‘AI is no longer about experiments. It is about results’: Boards are pushing for faster returns on AI investments, and tech leaders can't keep paceNews AI projects are now being held to the same standards as any other business investment
-
AI isn’t making work easier, it’s intensifying it – researchers say teams are now facing 'unsustainable' workloads, cognitive strain, and higher levels of burnoutNews While workers report productivity gains with AI, that means they’re faced with bigger workloads